Code Monkey home page Code Monkey logo

Hi there 👋

  • 🔭 I’m currently working on speech and natural language processing, especially large-scale pre-trained models.

  • 🎓 I obtained my Ph.D. degree at Beihang University, China. Now, I am a senior researcher at Microsoft Research Asia.

  • 📫 How to reach me: Wu.Yu at microsoft.com

  • 📄 Here are my selected publications:

    • Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
      • Chengyi Wang, Sanyuan Chen, Yu Wu (Corresponding author) , Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei.
      • A language model based TTS system, which could clone your voice with a 3-second recording.
      • Demo and Paper
      • VALL-E X a cross-lingual version VALL-E that can help anyone speak a foreign language in their own voice.
    • WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
    • Response Generation by Context-aware Prototype Editing
      • Yu Wu, Furu Wei, Shaohan Huang, Yunli Wang, Zhoujun Li, Ming Zhou.
      • [Accepted in AAAI 2019] [code]
      • The first paper studies prototype based response generation.
    • Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
      • Yu Wu, Wei Wu, Chen Xing, Ming Zhou, Zhoujun Li.
      • [Accepted in ACL 2017] [code]
      • The first paper studies multi-turn response selection.

MarkWuNLP's github stats

Yu Wu (吴俣)'s Projects

academicpage icon academicpage

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

data4stylizeds2s icon data4stylizeds2s

The repository contains resources of our paper published at AAAI 2020 ``A Dataset for Low-Resource Stylized Sequence-to-Sequence Generation "

demucs icon demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

dgk_lost_conv icon dgk_lost_conv

dgk_lost_conv 中文对白语料 chinese conversation corpus

hed-dlg-truncated icon hed-dlg-truncated

Hierarchical Encoder Decoder RNN (HRED) with Truncated Backpropagation Through Time (Truncated BPTT)

kehnn icon kehnn

Source code of Knowledge Enhanced Hybrid Neural Network for Text Matching

nlp-progress icon nlp-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

nmtlab icon nmtlab

A Pytorch-based Neural Machine Translation Framework for Research

parlai icon parlai

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

responseedit icon responseedit

Resources of our paper at AAAI-19 ``Response Generation by Context-aware Prototype Editing"

semanticmask icon semanticmask

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

seqgan icon seqgan

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

tacntn icon tacntn

Source code of Response Selection with Topic Clues for Retrieval-based Chatbots

unilm icon unilm

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.