wendongj Goto Github PK
Name: wendong
Type: User
Bio: work with attention
Name: wendong
Type: User
Bio: work with attention
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Simulation data from VCTK Corpus (version 0.92) for direction of arrival (DoA) estimation, and detailed data simulation process.
Vector (and Scalar) Quantization, in Pytorch
Video Swin Transformer - PyTorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
本项目使用了EcapaTdnn模型实现的声纹识别
Voice Activity Detection based on Deep Learning & TensorFlow
This is official repository of new SOTA diffusion models based method for speech enhancement
VRT: A Video Restoration Transformer (official repository)
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
An ODE-based generative neural vocoder using Rectified Flow
WaveRNN Vocoder + TTS
Production first, nn-based on-device signal processing toolkit.
Production First and Production Ready End-to-End Text-to-Speech Toolkit
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is submitted to Information Fusion.
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.