tsaifangsheng,github

adversarial-examples-pytorch

Implementation of Papers on Adversarial Examples

A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.

animegan2-pytorch

PyTorch implementation of AnimeGANv2

animeganv2

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

attentions

PyTorch implementation of some attentions for Deep Learning Researchers.

attentions-in-tacotron

audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

auxiliaryasr

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

bark

🔊 Text-Prompted Generative Audio Model

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

binary-gen-models

binauralspeechsynthesis

N/A

bvae-tts

Official implementation of BVAE-TTS

byte2speech

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

cdiffuse

Conditional Diffusion Probabilistic Model for Speech Enhancement

comprehensive-transformer-tts

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

compressed-tts

Compressed version of Tacotron 2 using Tensor Train + Waveglow.

controlnet

Let us control diffusion models!

cvss

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

deepstory

Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.

diffsptk

A differential version of SPTK

diffusion_distiller

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

eets

A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech

erisha

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

expressivetacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

tsaifangsheng Goto Github PK

tsaifangsheng's Projects

Recommend Projects

Recommend Topics

Recommend Org