Code Monkey home page Code Monkey logo

Aby Louw's Projects

a3t icon a3t

Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

apnet2 icon apnet2

Source code of APNet2, a vocoder

audioseal icon audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

autovc icon autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

consistencyvc-voive-conversion icon consistencyvc-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

convnext_tts icon convnext_tts

Unofficial implementation of ConvNeXt-TTS powered by lightning and Rye

covarep icon covarep

A Cooperative Voice Analysis Repository for Speech Technologies

crank icon crank

Non-parallel Voice Conversion

dectalk icon dectalk

Modern builds for the 90s/00s DECtalk text-to-speech application.

dhasa2023_styleguide icon dhasa2023_styleguide

Style guide for the Digital Humanities Association of Southern Africa (DHASA) fourth conference, 2023.

dissc icon dissc

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730

esphome-for-deye icon esphome-for-deye

Esphome component for Deye sun-12k-sg04lp3 to implement into home assistant

flet icon flet

Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.

fragmentvc icon fragmentvc

Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention

freevc icon freevc

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

glow_tts icon glow_tts

An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.

graphneuraltts icon graphneuraltts

TTS via embedding Structural Graphs (HRGs) that capture linguistic information

hubert icon hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

idgan icon idgan

Official PyTorch implementation on ID-GAN: High-Fidelity Synthesis with Disentangled Representation by Lee et al., 2020.

ims-toucan icon ims-toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

istftnet icon istftnet

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.