Code Monkey home page Code Monkey logo

sc-cnn's Introduction

SC-CNN : Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

Thanks to StyleSpeech, we built up our codes based on Link

  1. LibriTTS dataset (train-clean-100 and train-clean-360) is used.
  2. Sampling rate is set to 16000Hz.
  3. This is the implementation of SC-StyleSpeech.
    For SC-TransferTTS, please refer to the branch SC-TransferTTS

Materials

Prerequisites

  • Clone this repository.
  • Install python requirements. Please refer requirements.txt

Preparing

  1. Run
python prepare_align.py --data_path [LibriTTS DATAPATH]

for some preparations.

  1. Montreal Forced Aligner (MFA) is used to obtain the alignments between the utterances and the phoneme sequences. 1-1. Download MFA following the command in the website. 1-2. Run the below codes
$ conda activate aligner
$ mfa model download acoustic english_mfa
$ mfa align ......LibriTTS/wav16 lexicon.txt english_us_arpa .........LibriTTS/Textgrid
  1. Run
python preprocess.py

2-0. Check input&output data paths.

Training

python train.py
  1. Change default settings --data_path [Preprocessed LibriTTS DATAPATH] --save_path [Experiment SAVEPATH]
  2. You can change hyperparameters of SC-CNN (kernel_size, channels), or other model configurations in configs/config.json

Inference

  1. Mel generation
python synthesize.py --checkpoint_path [CKPT PATH] --ref_audio [REF AUDIO PATH] --text [INPUT TEXT]

sc-cnn's People

Contributors

hcy71o avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.