Code Monkey home page Code Monkey logo

Comments (7)

gitmylo avatar gitmylo commented on June 3, 2024

Nice, for proper credit, you should create a pull request to https://github.com/gitmylo/Voice-cloning-quantizers, which has an index of voice cloning quantizers. Just add it to the models.json

from bark-voice-cloning-hubert-quantizer.

junwchina avatar junwchina commented on June 3, 2024

pr created.

from bark-voice-cloning-hubert-quantizer.

Maverick1983 avatar Maverick1983 commented on June 3, 2024

@junwchina after trained model, how can use for clone voice?

from bark-voice-cloning-hubert-quantizer.

junwchina avatar junwchina commented on June 3, 2024

@junwchina after trained model, how can use for clone voice?

You can use tts-generation-webui to clone and test voice. Remember to replace the qunatizer model to your trained model.

clone voice

from bark-voice-cloning-hubert-quantizer.

Naozumi520 avatar Naozumi520 commented on June 3, 2024

@junwchina

I'm happy to see we can actually add a new language to bark. Can you please guide me to add an language to it (Cantonese)? I have few issue about this:

  1. bark-data-gen generate training data yes. But for unsupported language we cannot use it directly. How did u overcome this issue and use your own dataset to train your Japanese quantizer?

  2. How should the dataset looks like?

  3. Other steps?

I'll be super grateful if you can help, thank you.

Update: My discord is .naozumi, it'll be easier for me to talk in there!

from bark-voice-cloning-hubert-quantizer.

junwchina avatar junwchina commented on June 3, 2024

bark-data-gen is a tool used to generate training data and train your quantizer model. The models we use to generate training data come from Bark itself, so It should support Chinese/Cantonese very well.

You need to follow these steps:

  • Generate semantic data from text(Chinese/Cantonese content).
  • Generate wav files from above semantic data.
  • Train your Cantonese model from wavs and semantic data. Wav files is input of this model, semantic data is the output.

For more details, you can check my train script .

from bark-voice-cloning-hubert-quantizer.

junwchina avatar junwchina commented on June 3, 2024

I am going to close this issue. It's better to create new issue If you still have other problems.

from bark-voice-cloning-hubert-quantizer.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.