Code Monkey home page Code Monkey logo

utterancepit-speech-separation's Introduction

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

If you want to see the funcwj code, this is his repository link.

uPIT-for-speech-separation

Demo Pages: Results of pure speech separation model

Accomplished goal

  • Support Multi-GPU Training
  • Use the Dataloader Method That Comes With Pytorch
  • Provide Pre-Training Models

Python Library Version

  • Pytorch==1.3.0
  • tqdm==4.32.1
  • librosa==0.7.1
  • scipy==1.3.0
  • numpy==1.16.4
  • PyYAML==5.1.1

How to Using This Repository

  1. Generate dataset using create-speaker-mixtures.zip with WSJ0 or TIMI

  2. Prepare scp file(The content of the scp file is "filename path")

     python create_scp.py
  3. Prepare cmvn(Cepstral mean and variance normalization (CMVN) is a computationally efficient normalization technique for robust speech recognition.).

     #Calculated by the compute_cmvn.py script: 
     python compute_cmvn.py ./tt_mix.scp ./cmvn.dict
  4. Modify the contents of yaml, mainly to modify the scp address, cmvn address. At the same time, the number of num_spk in run_pit.py is modified.

  5. Training:

    sh train.sh
  6. Inference:

    sh test.sh
    

Reference

  • Kolbæk M, Yu D, Tan Z H, et al. Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2017, 25(10): 1901-1913.
  • https://github.com/funcwj/uPIT-for-speech-separation

utterancepit-speech-separation's People

Contributors

jusperlee avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.