yzyouzhang Goto Github PK

followers: 108.0 following: 76.0 repos: 27.0 gists: 0.0

Name: You Zhang

Type: User

Company: University of Rochester

Bio: PhD Student at Audio Information Research Lab @ UR

Twitter: yzyouzhang

Location: NY, US

Blog: https://yzyouzhang.com

You Zhang's Projects

air-asvspoof

Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"

asvspoof2021_air

Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"

audio_research_in_us

For students who would like to apply for RA, PhD, postdoc in audio research.

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

chcochleagram

cochleagram generation code in pytorch

cs61bsp18-proj2-byog

Project BYoG for UCB course CS61B Data Structures Spring 2018

diffgan-tts

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

empirical-channel-cm

Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems"

espnet

End-to-End Speech Processing Toolkit

fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

hbas_chapter_voice3

Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"

hrtf_field

Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"

hrtf_field_norm

Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"

info159-lhw4-chatbot

A pytorch Chatbot for INFO159 Natural Language Processing

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

online-recurrent-extreme-learning-machine

Online-Recurrent-Extreme-Learning-Machine (OR-ELM) for time-series prediction, implemented in python

phaseantispoofing_interspeech

Official repository of the Interspeech 2023 paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"

samo

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

sasv_pr

Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"

serve

Serve, optimize and scale PyTorch models in production

singfake

Official Repository for "SingFake: Singing Voice Deepfake Detection"

speechemotionavlearning

speechtasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

yzyouzhang Goto Github PK

You Zhang's Projects

Recommend Projects

Recommend Topics

Recommend Org