Code Monkey home page Code Monkey logo

Comments (10)

adiyoss avatar adiyoss commented on August 16, 2024 16

Hi @jainal09,
We are working on it! :) I believe we will be able to upload some models soon.
Will update of course :)

from svoice.

jainal09 avatar jainal09 commented on August 16, 2024 8

I dont understand why is this the problem. Because I have seen Nvidia open sourcing their Nemo asr model that was trained on WSJ dataset. - https://ngc.nvidia.com/catalog/models/nvidia:wsj_quartznet_15x5

from svoice.

ostapstephan avatar ostapstephan commented on August 16, 2024 2

@adiyoss Is there any update as to when the pre trained model on librispeech will be available?
Edit:
I was wondering if you could please provide some information about the resources that were required to train the model to completion. Figure 4 in the paper shows the training over 60 hours but does not mention the hardware used. Was this on a single gpu or on hundreds of them?

from svoice.

jeffshee avatar jeffshee commented on August 16, 2024 2

@adiyoss Hi, thanks for the repo. Is there any update on pretrained models?

from svoice.

hevinyu avatar hevinyu commented on August 16, 2024 1

Great job. Expect pre-trained models.

from svoice.

adiyoss avatar adiyoss commented on August 16, 2024

Hi @rafaelvalle,
Unfortunately, since the model was trained using the WSJ dataset, which is not publicly available, we can not legally release pre-trained models.
However, if you have access to this dataset you can train it on your own (all relevant details are in the repo). Another option is to train using the librimix dataset (based on librispeech).

from svoice.

FarisHijazi avatar FarisHijazi commented on August 16, 2024

If anyone does train a model on any other dataset PLEASE do upload it and help us out, thanks

from svoice.

AlexeyBoiler avatar AlexeyBoiler commented on August 16, 2024

Hi @jainal09,
We are working on it! :) I believe we will be able to upload some models soon.
Will update of course :)

Hello. Plan to train librimix dataset (based on librispeech) ?

from svoice.

nshreyasvi avatar nshreyasvi commented on August 16, 2024

Hello, I tried to run the voice separation using the trained models available at https://ngc.nvidia.com/catalog/models/nvidia:wsj_quartznet_15x5 but got the following error
in deserialize_model klass = package['class'] KeyError: 'class'
Do you know how to fix this error and run voice separation on a custom .wav file?

from svoice.

RHTNT avatar RHTNT commented on August 16, 2024

A pretrained model would be great :)

from svoice.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.