Code Monkey home page Code Monkey logo

taalua's Projects

maskcyclegan-vc icon maskcyclegan-vc

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

melnet icon melnet

MelNet-Tensorflow implementation

mir-svc icon mir-svc

Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach

mixture-of-experts icon mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

msaf icon msaf

Music Structure Analysis Framework

mtl-speaker-embeddings icon mtl-speaker-embeddings

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021

neural-hmm icon neural-hmm

Neural HMMs are all you need (for high-quality attention-free TTS)

parallelwavegan icon parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

pytorch_xvectors icon pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speech-transformer icon speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

ssl_anti-spoofing icon ssl_anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

ssqueezepy icon ssqueezepy

Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python

stereoeeg2speech icon stereoeeg2speech

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

taalua icon taalua

Config files for my GitHub profile.

transferlearning-clvc icon transferlearning-clvc

Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion

tt-vae-gan icon tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

vectorquantizedcpc icon vectorquantizedcpc

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

voicesmith icon voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.