whitefu Goto Github PK
Type: User
Bio: speech synthesis & voice conversion & speech enhancement
Type: User
Bio: speech synthesis & voice conversion & speech enhancement
VALL-E-X-Trainer
将任意人的音色转换为成千上万种不同音色
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
⚡ LLaMA-2 model experiment
Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
unofficial vits2-TTS implementation in pytorch
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统
Open source voice labeling application
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Zero-Shot Speech Editing and Text-to-Speech in the Wild
VoiceSmith makes training text to speech models easy.
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Unofficial Pytorch Implementation of WaveGrad2
Production First and Production Ready End-to-End Speech Recognition Toolkit
Text Normalization & Inverse Text Normalization
Whisper realtime streaming for long speech-to-text transcription and translation
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.