whitefu Goto Github PK

followers: 39.0 following: 431.0 repos: 487.0 gists: 0.0

Type: User

Bio: speech synthesis & voice conversion & speech enhancement

whitefu's Projects

2019-bdci-financialentitydiscovery

2019 BDCI互联网金融新实体发现

academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

ace_phonemes

a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

ai-audio-startups

Community list of startups working with AI in audio and music technology

ai-song-cover-rvc

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

ai-video-translation

A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.

alpaca-lora

Instruct-tune LLaMA on consumer hardware

amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

apnet2

Source code of APNet2, a vocoder

athena

an open-source implementation of sequence-to-sequence based speech processing engine

atlas

Principled instruction dataset on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

attentions-in-tacotron

atvgnet

CVPR 2019

audio-dataset

Audio Dataset for training CLAP and other models

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

audio-pipeline

audio-preprocess

Preprocess Audio for training

audio-preprocessing-scripts

数据集制作-从录播到伴奏分离到切片脚本

audio-retrieval

Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch

audio-slicer

Python script that slices audios with silence detection

audio2face

http://www.facegood.cc

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

whitefu Goto Github PK

whitefu's Projects

Recommend Projects

Recommend Topics

Recommend Org