Comments (7)
what is your language?
from styletts2.
It depends on how much speech data you have. If you only have 10-20 hours of data like LJSpeech, then you need millions of sentences in PL-BERT to have a good performance. Otherwise, if you have like thousands of hours speech data, probably you don’t even need a pretrained PL-BERT to begin with.
from styletts2.
Thanks, Is there any PL
what is your language?
For Mandarin and English mixed TTS
from styletts2.
@JohnHerry What do you mean by mixed? Like a sentence contains both English and Mandarin, or just something trained on the mixture of English and Mandarin datasets?
from styletts2.
@JohnHerry What do you mean by mixed? Like a sentence contains both English and Mandarin, or just something trained on the mixture of English and Mandarin datasets?
@yl4579 I am sorry for my ill expression. I mean one sentence contains both English and Mandarin. Eg. "我们来讨论一下这个 paper idea"。 The PLM should be based on Mandarin phonemes but also support English phonemes. I made an issure in the phonemizer project and got that its "language_switch" function is limited and the output may contains some faults when the input text is bilingual.
And what I concern about also, is the alphabeta of bilingual phonemes. As what you said in the StyleTTS-issure10, You had tried to map Mandarin-PinYin into IPA characters but not work. I guess that may be raised in that the Alphabeta in your PinYin-IPA mapping, is not the same with the Alphabeta used in the https://github.com/bootphon/phonemizer . So if the output of phonemizer is what the PLM takes, which means they are sharing the same phoneme Alphabeta, then your PinYin-IPA mapping will not work. I did not find the way to get its IPA alphabeta that support cross-lingual phoneme transcription from the phonemizer project. Do you have any ideas how to get such an Alphabeta?
from styletts2.
@JohnHerry Actually I have trained a multilingual model in English, Mandarin and Japanese and it works quite well with all the settings I provided in yl4579/StyleTTS#10. The PL-BERT was also pre-trained on the Wikipedia corpora of these 3 languages, where I used word-level tokenizers for all languages and merged shared graphemes for Japanese and Chinese (for example, both /ɽˈʲokˈai/ and /lˈjaʊtɕˈjɛ/ correspond to the grapheme 了解). It is well compatible with phonemizer as the English IPA was generated by it.
As for your question regarding mixed language, I'm not sure how to get training data for this. The PL-BERT I pre-trained was trained separately on mixed datasets but not mixed languages in a single sentence. However, if your speech data contains a lot of samples of this type it probably would work with this pre-trained PL-BERT as well.
from styletts2.
Thanks for the information, it is helpfull.
from styletts2.
Related Issues (20)
- styletts2 inference pip package HOT 1
- Current code doesn't work with hifigan HOT 4
- Testing foundation layer needed!
- Noise on long sentences HOT 1
- Some of FineTuning has this error HOT 5
- Using a smaller Hifigan HOT 1
- An Error From LJspeech Dataset HOT 2
- Stage 2 training bug (after joint training) HOT 7
- Speech-to-speech possible? HOT 4
- stage1 training issue HOT 6
- 你好,请问模型支持流式tts吗? HOT 1
- Fine-tuning worsens the quality of speech-synthesis. HOT 6
- When start firtst_train give errors. I have 96 Gb Ram and 3 P40/24GB/ 1 T4 /16GB/ ?? HOT 10
- Train a zero-shot voice adaptation model for a different accent/language HOT 1
- Finetuning kernel size issue HOT 1
- Preparing text and data HOT 2
- training single speaker different accent/language results HOT 3
- Training a Japanese model, pitch accent and IPA HOT 1
- OOD data for LibriTTS-460 training? HOT 1
- Chinese data HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from styletts2.