Code Monkey home page Code Monkey logo

You Zhang's Projects

air-asvspoof icon air-asvspoof

Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"

asvspoof2021_air icon asvspoof2021_air

Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"

awesome-audio-visual icon awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

awesome-diarization icon awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

diffgan-tts icon diffgan-tts

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

empirical-channel-cm icon empirical-channel-cm

Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems"

espnet icon espnet

End-to-End Speech Processing Toolkit

fastdiff icon fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

flowtron icon flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

hbas_chapter_voice3 icon hbas_chapter_voice3

Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"

hrtf_field icon hrtf_field

Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"

hrtf_field_norm icon hrtf_field_norm

Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"

mellotron icon mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

phaseantispoofing_interspeech icon phaseantispoofing_interspeech

Official repository of the Interspeech 2023 paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"

samo icon samo

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

sasv_pr icon sasv_pr

Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"

serve icon serve

Serve, optimize and scale PyTorch models in production

singfake icon singfake

Official Repository for "SingFake: Singing Voice Deepfake Detection"

speechtasks icon speechtasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.