Code Monkey home page Code Monkey logo

shiwanglei's Projects

adaspeech icon adaspeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

adaspeech2 icon adaspeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

adaspeech2-1 icon adaspeech2-1

AdaSpeech2 based on https://github.com/rishikksh20/AdaSpeech2

awesome-diarization icon awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

coqui-tts icon coqui-tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deep-speaker icon deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf

kaldi icon kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

nlp-svm icon nlp-svm

网易云音乐曲风分类机器人

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

pyaudioanalysis icon pyaudioanalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

quickgitclone icon quickgitclone

这是一个帮助快速克隆git仓库+生成、下载网站二维码+选中右击跳转百度的Chrome插件

simpleder icon simpleder

A lightweight library to compute Diarization Error Rate (DER).

speaker-identification-using-gmms icon speaker-identification-using-gmms

It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data corpus.

speakerdiarization_rnn_cnn_lstm icon speakerdiarization_rnn_cnn_lstm

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

spectralcluster icon spectralcluster

Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"

uis-rnn icon uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

v-vector-tf icon v-vector-tf

Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"

vae icon vae

a simple vae and cvae from keras

vbx icon vbx

Variational Bayes HMM over x-vectors diarization on DIHARD II

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.