Code Monkey home page Code Monkey logo

semanticmask's Introduction

Hi there ๐Ÿ‘‹

  • ๐Ÿ”ญ Iโ€™m currently working on speech and natural language processing, especially large-scale pre-trained models.

  • ๐ŸŽ“ I obtained my Ph.D. degree at Beihang University, China. Now, I am a senior researcher at Microsoft Research Asia.

  • ๐Ÿ“ซ How to reach me: Wu.Yu at microsoft.com

  • ๐Ÿ“„ Here are my selected publications:

    • Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
      • Chengyi Wang, Sanyuan Chen, Yu Wu (Corresponding author) , Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei.
      • A language model based TTS system, which could clone your voice with a 3-second recording.
      • Demo and Paper
      • VALL-E X a cross-lingual version VALL-E that can help anyone speak a foreign language in their own voice.
    • WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
    • Response Generation by Context-aware Prototype Editing
      • Yu Wu, Furu Wei, Shaohan Huang, Yunli Wang, Zhoujun Li, Ming Zhou.
      • [Accepted in AAAI 2019] [code]
      • The first paper studies prototype based response generation.
    • Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
      • Yu Wu, Wei Wu, Chen Xing, Ming Zhou, Zhoujun Li.
      • [Accepted in ACL 2017] [code]
      • The first paper studies multi-turn response selection.

MarkWuNLP's github stats

semanticmask's People

Contributors

cywang97 avatar markwunlp avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

semanticmask's Issues

how much the final loss?

Hi,
Think to your paper and this github, I have re-implemented this project. but my final result is 3.5 of clean and 9.0 of other in WER, the final loss is about 30. so how much the final loss in your?
thanks you in advance for your help.

best wishes
Ma

how to generate the new json file

I have got the json file from espnet. But in SemanticMask, we should add the start and end information into the json file. How should I do? Otherwise espnet/utils/io_utils.py can't read the start and end info from "y_feats_dict". Thank you.

pre-trained RNN language model

hello, I want to download the ESPnet pre-trained RNN language model, but the link is not correct. How can I download the pre-trained RNN language model? Thanks very much!

Which value is used to fill the masked part?

Hi Dr.Wu,
I'm trying to reproduce the experimental results in the SemanticMask paper. I'm a little confused about the value used to fill the masked part. In the paper, it looks like you use zero to fill the masked part. But in this repo, the code shows the filled value is the mean value of the feature. code. Which value should I use?
Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.