Hey Guys, the last commits from today seems to break evaluation code. More precisely <

eval.py: Evaluation broken about tacotron HOT 3 OPEN

kyubyong commented on July 28, 2024

eval.py: Evaluation broken

from tacotron.

Comments (3)

Kyubyong commented on July 28, 2024

Technically speaking mel-scale is not exactly the same as log. See https://en.wikipedia.org/wiki/Mel_scale. The paper says they use melspectrogram and linear-scale log magnitude (spectrogram). So the spectrogram2wav converts the predicted magnitude to the waveform. It has nothing to do with melspectrogram.

The reason why people care about whether we appy logarithm to magnitude in training is two, in my opinion. First, we or at least I don't have a full understanding of why it is useful. Second, in practice it needs our attention in that there are three times of padding--reducing frames, dynamic padding, and convolution with same padding.

For now I don't know why there's a problem when you set use_log_magnitude to False. I'll check soon.

from tacotron.

Kyubyong commented on July 28, 2024

Oh now I know the reason. For the plain magnitude we must not allow negative numbers because of the power. I simply clipped the value to zero. https://github.com/Kyubyong/tacotron/blob/master/eval.py#L70

from tacotron.

chief7 commented on July 28, 2024

Well, you're right when you say that mel isn't exactly log. And to be honest, my explanation isn't much more then a first guess. I didn't check every part of the code and I agree with you: I'm not quite sure what the log stuff is all about.

But: Even after your latest commits, I can't generate non-silent audio from my model if use_log_magnitude is False

from tacotron.

Recommend Projects

eval.py: Evaluation broken about tacotron HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent