taalua,github

lpcnet_torch

torch version of LPCNet

maskcyclegan-vc

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

melnet

MelNet-Tensorflow implementation

mir-svc

Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

msaf

Music Structure Analysis Framework

mtl-speaker-embeddings

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021

naver-ai-hackathon-speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

neural-hmm

Neural HMMs are all you need (for high-quality attention-free TTS)

nn-vad

simple dnn based vad

nonparaseq2seqvc_code

Implementation code of non-parallel sequence-to-sequence VC

normalizing-flows

PyTorch implementation of normalizing flow models

parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

rnn-sound-classification

RNN implementation with Tensorflow (LSTM) to classify variable length sound sequences

sova-tts-engine

speakerverifiaction-pytorch

Speaker Verification using Pytorch

speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

speech2singing

Implementation of speech to singing of interspeech20' paper.

speechsubjectivetest

Speech (audio) subjective evaluation system

ssl_anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

ssqueezepy

Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python

stereoeeg2speech

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

taalua

Config files for my GitHub profile.

tacotron-wavernn

TTS (Tacotron + WaveRNN)

transferlearning-clvc

Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

vectorquantizedcpc

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

voice-conversion

Open Source Octave toolbox for conversion of voices

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

taalua Goto Github PK

taalua's Projects

Recommend Projects

Recommend Topics

Recommend Org