Comments (3)
Technically speaking mel-scale is not exactly the same as log. See https://en.wikipedia.org/wiki/Mel_scale. The paper says they use melspectrogram and linear-scale log magnitude (spectrogram). So the spectrogram2wav
converts the predicted magnitude to the waveform. It has nothing to do with melspectrogram.
The reason why people care about whether we appy logarithm to magnitude in training is two, in my opinion. First, we or at least I don't have a full understanding of why it is useful. Second, in practice it needs our attention in that there are three times of padding--reducing frames, dynamic padding, and convolution with same padding.
For now I don't know why there's a problem when you set use_log_magnitude
to False. I'll check soon.
from tacotron.
Oh now I know the reason. For the plain magnitude we must not allow negative numbers because of the power. I simply clipped the value to zero. https://github.com/Kyubyong/tacotron/blob/master/eval.py#L70
from tacotron.
Well, you're right when you say that mel isn't exactly log. And to be honest, my explanation isn't much more then a first guess. I didn't check every part of the code and I agree with you: I'm not quite sure what the log stuff is all about.
But: Even after your latest commits, I can't generate non-silent audio from my model if use_log_magnitude
is False
from tacotron.
Related Issues (20)
- How long should I wait for the result generated after I perform the train.py? HOT 1
- Can anyone guide me how to get Audio out from eval.py file for testing ?? HOT 1
- Bus error: 10 at training HOT 6
- How to synthesize long sentences?
- About the performance of synthesis
- speaker adaption : not to update the encoder parameters
- ref_db=20, max_db=100, Where did these values come from? statistics?
- Generated wave were empty HOT 8
- I've uploaded Donald Trump speeches and transcripts HOT 1
- error in utils,py
- How can we exploit forced alignments?
- Error in data_load.py---TypeError: a bytes-like object is required, not 'str'
- Segmentation fault on training HOT 1
- Get MelSpectogram for wavenet
- IOError: [Errno 2] No such file or directory: '/data/private/voice/LJSpeech-1.0/transcript.csv'
- IOError: [Errno 2] No such file or directory: '/data/private/voice/LJSpeech-1.0/transcript.csv' HOT 1
- Transcript.csv file? HOT 1
- Different result in train and eval and synthesize mode
- how much it takes to train and on what hardware?
- Tensorflow error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tacotron.