Code Monkey home page Code Monkey logo

timit-preprocessor's Introduction

TIMIT Preprocessor

timit-preprocessor extract mfcc vectors and phones from TIMIT dataset for advanced use on speech recognition.

Overview

The TIMIT corpus of read speech is designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. More information on website or Wiki

Installation

Note that to install Kaldi first by following the instructions in INSTALL.

(1)
go to tools/ and follow INSTALL instructions there.

(2) go to src/ and follow INSTALL instructions there.

After running the scripts instructed by INSTALL in tools/, there will be reminder as followed. Go and run it.

Kaldi Warning: IRSTLM is not installed by default anymore. If you need IRSTLM, use the script extras/install_irstlm.sh

After ensuring kaldi installation, we can start by running

git clone https://github.com/orbxball/timit-preprocessor.git

Preprocessing

Steps

  1. Run ./convert_wav.sh only in the first time after cloning this repo.

  2. python3 parsing.py -h to see instructions parsing timit dataset for phone labels and raw intermediate files in folder data/material/.

  3. ./extract_mfcc.sh to extract mfcc vectors into .scp and .ark files.

Finally, there's a folder called data/ which contains all the outcomes in the belowing directory structure:

data/
|-- material
|   |-- test.lbl
|   `-- train.lbl
`-- processed
    |-- test.39.cmvn.ark
    |-- test.39.cmvn.scp
    |-- test.extract.log
    |-- train.39.cmvn.ark
    |-- train.39.cmvn.scp
    `-- train.extract.log

If you want to do further operations, there's a good repo called kaldi-io-for-python.

Contact

Feel free to contact me if there's any problems.

License

BSD 3-Clause License (2017), Jun-You Liu

timit-preprocessor's People

Contributors

orbxball avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.