ishine's Projects
Speech to Sing project
The project for speech translation
S2T-Perceiver and Dynamic Latent Access
Speech2Vec Reality Check
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing" by Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
A modified version of LPCNet
SALMONN: Speech Audio Language Music Open Neural Network
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
语音识别模型
Speech Separation
AddressSanitizer, ThreadSanitizer, MemorySanitizer
SC-CNN: An Effective Style Conditioning Method for Zero-Shot Text-to-Speech Systems
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
用scrapy重写爬取新浪网站最新新闻即评论并更新以前的评论
爬取各个听书平台的专辑
Reproduction of "Scyclone" with PyTorch
computational graphs in C