abylouw Goto Github PK
Name: Aby Louw
Type: User
Name: Aby Louw
Type: User
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Source code of APNet2, a vocoder
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Audio Watermarking
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
An awesome README template to jumpstart your projects!
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Unofficial implementation of ConvNeXt-TTS powered by lightning and Rye
A Cooperative Voice Analysis Repository for Speech Technologies
Non-parallel Voice Conversion
Modern builds for the 90s/00s DECtalk text-to-speech application.
Grapheme to phoneme conversion with deep learning.
VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE
Style guide for the Digital Humanities Association of Southern Africa (DHASA) fourth conference, 2023.
Reference-aware automatic speech evaluation toolkit
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730
Esphome component for Deye sun-12k-sg04lp3 to implement into home assistant
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
Implementation and experiments of graph embedding algorithms.
TTS via embedding Structural Graphs (HRGs) that capture linguistic information
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Official PyTorch implementation on ID-GAN: High-Fidelity Synthesis with Disentangled Representation by Lee et al., 2020.
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.