hbwu-ntu Goto Github PK

followers: 119.0 following: 108.0 repos: 41.0 gists: 0.0

Type: User

Company: National Taiwan University

Bio: 🏠 Ph.D. student at NTU working on speech processing and machine learning. 💻 Contributor of S3PRL.

Location: Seattle, WA, US

Blog: https://hbwu-ntu.github.io/

hbwu-ntu's Projects

adain-vc

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

advattacksasvspoof

This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

asv-anti-spoofing-with-res2net

Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.15006

audiodecbenchmark

Audio Codec Benchmark

audiowmark

Audio Watermarking

autospeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

cmgan

Conformer-based Metric GAN for speech enhancement

conv-tasnet-pytorch

A PyTorch implementation of Conv-TasNet

deepcomplexcrn

deepcomplexunetpytorch

Implementation of Deep Complex UNet Using PyTorch

distillloss

dns_mos_calculate

Code for calculate DNS_MOS.

ecapa-tdnn

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

face_trainer

Face embedding trainer

facodec

Training code for FAcodec presented in NaturalSpeech3

fullsubnet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

gagnet

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.