Code Monkey home page Code Monkey logo

Sang-Hoon Lee

I will join the Department of Software and Computer Engineering at Ajou University as an Assistant Professor in Mar. 2024 (SAIL, Speech AI Lab.). I'm currently a postdoctoral researcher in AI Research Center, Korea University, Seoul, South Korea. I received the Ph.D. degree in the Department of Brain and Cognitive Engineering, Korea University in 2023. In March 2016, I started my integrated M.S.&Ph.D. in Pattern Recognition & Machine Learning (PRML) Lab at the Korea University in Seoul, Korea, under the supervision of Seong-Whan Lee.

πŸ‘€ Research Interests

  • Speech Synthesis (2019~, HierSpeech++, DDDM-VC)
  • Audio Generation (2023~, DDDM-Mixer)
  • Singing Voice Synthesis (2022~, MIDI-Voice, HiddenSinger)
  • Speech-to-Speech Translation (2023~, TranSentence)
  • Brain-Computer Interface (2019~2020, Brain-to-Speech System)
  • Reinforcement Learning (2017~2018, AI Curling Robot Curly)

βœ” News

  • 2024.04: One paper has been accepted to TASLP (DiffProsody)
  • 2024.01: One paper has been accepted to ICASSP 2024 (LIMMITS'24, ICASSP SP Grand Challenges)
  • 2023.12: One paper has been accepted to TASLP (Fre-Painter)
  • 2023.12: Two papers have been accepted to ICASSP 2024 (TranSentence, MIDI-Voice)
  • 2023.12: One paper has been accepted to AAAI 2024 (DDDM-VC)
  • 2023.11: We release HierSpeech++, Zero-shot Speech Synthesis models for Zero-shot TTS, Zero-shot VC, and Speech Super-resolution. [Demo] [Code] [Gradio]
  • 2023.06: We release HiddenSinger for High-quality singing voice synthesis system. This project was funded by Netmarble AI Center, Netmarble Corp. in 2022.

πŸŽ‰ Publications

Arxiv

2024

2023

2022

2021

~2020

Patents (KR)

  • "METHOD TO TRANSFORM VOICE,", 10-2439022, 29, Aug., 2022.
  • "METHOD AND APPARTUS FOR VOICE CONVERSION BY USING NEURAL NETWORK," 10-2340486, 14, Dec., 2021.
  • "SYSTEM AND METHOD FOR CURLING SWEEPING CONTROL," 10-2257358, 21, May, 2021.
  • "APPARATUS AND METHOD FOR RECOMMENDATION OF CURLING GAME STRATEGY USING DEEP LEARNING," 10-2045567, 11, Nov., 2019.
  • "APPARATUS AND METHOD FOR DELIVERY AND SWEEPING AT CURLING GAME," 10-1948713, 11, Feb., 2019.

✨ Educations

2016.03-2023.02: Integrated M.S.&Ph.D, Dept. of Brain and Cognitive Engineering, Korea University

2012.03-2016.02: B.S, Dept. of Life Science, Dongguk University

🎁 Awards and Services

Reviewer: NeurIPS 2023, ICLR 2024, ICASSP 2024, ICML 2024, IEEE/ACM Transactions on Audio, Speech, and, Language Processing

2022.02.25: Paper Award (Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis), Korea University

πŸŽ™Invited Talks

2024.06.07: Speech Synthesis, 제2회AIμœ΅ν•©μ›Œν¬μˆ, Ajou University

2024.05.24: Speech Language Model for Generative AI, KSCS2024

2023.08.18: Towards Unified Speech Synthesis for Text-to-Speech and Voice Conversion, Deepbrain AI

2023.08.11: Towards Unified Speech Synthesis for Text-to-Speech and Voice Conversion, Workshop on Brain and Artificial Intelligence 2023

2023.06.20: HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis, Top Conference Session in KCC2023

2022.08.19: VoiceMixer: Adversarial Voice Style Mixup, AIGS Symposium 2022

2022.07.01: VoiceMixer: Adversarial Voice Style Mixup, Top Conference Session in KCC2022

2021.12.02: Voice Conversion, Netmarble

2021.07.29: Speech Synthesis and Voice Conversion, Neosapience

Sang-Hoon Lee's Projects

bigvgan icon bigvgan

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.