dhgrs / chainer-clarinet Goto Github PK

View Code? Open in Web Editor NEW

45.0 11.0 13.0 23 KB

A Chainer implementation of ClariNet.

Python 100.00%

wavenet chainer parallel-wavenet parallel-wavenet-vocoder

chainer-clarinet's People

Contributors

Stargazers

Watchers

Forkers

zhang-jian kastnerkyle mannykayy sosonak chochobo entn-at peter05010402 qianqq chairgraveyard osamasabri yjingyu michaelpdu appalachianwine

chainer-clarinet's Issues

student sounds a bit robotic

Hi,

I have trained both teacher/student networks.
Teacher has very nice quality although it takes way too long to generate (1s takes ~6 minutes on gpu).
Student results sound a bit robotic and have "noise" where should be silence.
Do you have any tips on that? (I haven't changed any parameters besides path to trained teacher model).

UPDATE:
Here I my examples for 2 short (~1s) files:
clarinet-results.zip

Thanks.

params

Hello
I would like to experiment with 48kHz sampling frequency.
What kind of parameter setting is good?

I also want to put the acoustic features from outside the script.
Is there any good way to do it?

Q: Text-to-Wave Architecture

Hi,

the Clarinet paper mentions also Text-to-Wave Architecture for end-to-end TTS.

Do you have any suggestions what would I need to do full TTS once the student network is trained?

Should I use some pre-trained model to produce mel-spetrograms like Tacotron2 or DeepVoice3? Or something else entirely?

Thanks!

StudentGaussianIAF: different preprocessing length for teacher_params and params

Hi,

is there a reason why

StudentGaussianIAF/teacher_params.py has length=12000
and
StudentGaussianIAF/params.py has length=24000

while all other preprocessing params stay the same?

Thanks.

Sampling after Student network

Hi,

Thanks for sharing.

It seems that there is an extra sampling step after the student network at https://github.com/dhgrs/chainer-ClariNet/blob/master/StudentGaussianIAF/net.py#L58 to get student samples. However, Algorithm 1 in https://arxiv.org/pdf/1807.07281.pdf uses z from the last IAF flow as student samples. What is the motivation of making such change?

Thanks,

Jian

AutoregressiveWaveNet generates only silence

Hi,

I have trained AutoregressiveWaveNet using command from README.md

python train.py -g 0

Then I have tried to generate audio using command from README.md

python generate.py -i ../../LJSpeech-1.1/wavs/LJ001-0001.wav -o result.wav -m 2018_09_27_16_03_22/snapshot_iter_500000 -g 0

Generated audio was completely silent, do you have any tip what could have gone wrong?

Thanks.

dhgrs / chainer-clarinet Goto Github PK

chainer-clarinet's People

Contributors

Stargazers

Watchers

Forkers

chainer-clarinet's Issues

Recommend Projects

Recommend Topics

Recommend Org