Code Monkey home page Code Monkey logo

wht2020's Projects

aesrc2020 icon aesrc2020

Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).

ast icon ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio icon audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

awesome-diarization icon awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

ds-tdnn icon ds-tdnn

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch

ecapa-tdnn icon ecapa-tdnn

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

kaldi icon kaldi

This is the official location of the Kaldi project.

open-speech-corpora icon open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

paddlespeech icon paddlespeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

pytorch-book icon pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)

s3prl icon s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

speecht5 icon speecht5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

tuning_playbook icon tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

voiceprint icon voiceprint

A simple model implemented with tensorflow for voiceprint

wespeaker icon wespeaker

Research and Production Oriented Speaker Recognition Toolkit

zhvoice icon zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.