tsaifangsheng Goto Github PK
Type: User
Type: User
Implementation of Papers on Adversarial Examples
Implementation of the AlignTTS
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
PyTorch implementation of AnimeGANv2
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
PyTorch implementation of some attentions for Deep Learning Researchers.
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
🔊 Text-Prompted Generative Audio Model
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
N/A
Official implementation of BVAE-TTS
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Conditional Diffusion Probabilistic Model for Speech Enhancement
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
Let us control diffusion models!
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.
A differential version of SPTK
🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
PyTorch Implementation of FastDiff (IJCAI'22)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.