Code Monkey home page Code Monkey logo

gongchenghhu's Projects

asr_guided_tacotron icon asr_guided_tacotron

Use las to enhance the performance of tacotron, especially at the lack of the speaker labels.

auorange icon auorange

Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet

bigcidian icon bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

bit-rnn icon bit-rnn

Quantize weights and activations in Recurrent Neural Networks.

cnngraph icon cnngraph

A tool that automatically extracts network structures from Tensorflow model files

complexeventextraction icon complexeventextraction

A concept and obvious expression pattern collection of Chinese compound event extraction which then be evolved into ComplexEventGraph,本项目提出了中文复合事件的概念与显式模式,包括条件事件、因果事件、顺承事件、反转事件等事件抽取,并形成事理图谱。

comprehensive-e2e-tts icon comprehensive-e2e-tts

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

comprehensive-transformer-tts icon comprehensive-transformer-tts

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

cross-lingual-voice-cloning icon cross-lingual-voice-cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

cs420-zeroshot-tts-korean icon cs420-zeroshot-tts-korean

[Autumn 2019] [KAIST CS420] Transfer Learning from Speaker Verification to Zero-Shot Multispeaker Korean Text-To-Speech Synthesis

emotion2vec icon emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

emotional-vits icon emotional-vits

无需情感标注的情感可控语音合成模型,基于VITS

emov-db icon emov-db

The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems

erisha icon erisha

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

espnet icon espnet

End-to-End Speech Processing Toolkit

expressivetacotron icon expressivetacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

fac-via-ppg icon fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

fastspeech icon fastspeech

The Implementation of FastSpeech based on pytorch.

fastspeech2 icon fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fcl-taco2 icon fcl-taco2

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

fg-transformer-tts icon fg-transformer-tts

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

gantts icon gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.