Code Monkey home page Code Monkey logo

Comments (4)

mravanelli avatar mravanelli commented on May 19, 2024

Hi,
this repository is mainly intended for speech recognition. You are probably talking about the other repository where we used sincnet for speaker id (https://github.com/mravanelli/SincNet). To address another task you have to change datasets and labels. To assign to each sentence to the right label, you have modify the dictionary "TIMIT_labels.npy" as you pointed out. When you change task, it could be very important to properly tune the hyperparameters of the model (e.g., cw_len, cnn_N_filt, cnn_len_filt, fc_lay,lr) to make them more suitable for the new task.
Please, let me know if you are able to make it!

Thank you!

from pytorch-kaldi.

tsly123 avatar tsly123 commented on May 19, 2024

Hi,
Thank you for you reply. The repo instruction is very informative. I will get back to you when i am able to run my fusion models.

Again, thank you for your time.
tsly

from pytorch-kaldi.

tsly123 avatar tsly123 commented on May 19, 2024

Hi,
I am apologize about this but after struggling with Kaldi ASR (i'm new to kaldi), I realize that my EmotiW dataset which contains *.avi files only, can't be done as instructed for TIMIT tutorial which needs others must be done files (as stated in Kaldi for Dummies, such as text, lexicon, or spk2utt, etc.

Is there another way to construct the data preparation and alignment by myself, like preparing the pre-extracting features and labels to compatible with the pytorch-kaldi?
I've tried to run the Librispeech s5 and other free datasets with Kaldi to get how the structure of prepared data but always got some errors. I've also looked at the Kaldi-io-for-python repo and thought that the features can be converted to ark file using it but for the label and alignment i don't know how to do it.

Thank you for your time.
tsly

from pytorch-kaldi.

mravanelli avatar mravanelli commented on May 19, 2024

from pytorch-kaldi.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.