Jiajie Chen's Projects
Async WebSocket Client. Advantage: Flexible Lighter and Faster
Audio Unit example using AVAudioEngine and Audiobus.
speech enhancement\speech seperation\sound source localization
Open source reference implementation of ITU-T P.1204.3
自己收集的一些电子书籍
A deep learning framework for Speech-Music discrimination of continuous audio streams
Coding Practice Using
Making large AI models cheaper, faster and more accessible
计算机前沿课笔记
WHU ComputerGraphic homework2
Note and lab of Computer Systems A Programer's Perspective 3e
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
open source、high performance、industrial rtsp streaming server,a lot of optimization on streaming relay,KeyFrame cache,RESTful,and web management,also EasyDarwin support distributed load balancing,a simple streaming media cloud platform architecture.高性能开源RTSP流媒体服务器,基于go语言研发,维护和优化:RTSP推模式转发、RTSP拉模式转发、录像、检索、回放、关键帧缓存、秒开画面、RESTful接口、WEB后台管理、分布式负载均衡,基于EasyDarwin构建出了一套基础的流媒体云视频平台架构!
ESC-50: Dataset for Environmental Sound Classification
End-to-End Speech Processing Toolkit
Github of the FaceForensics dataset
FFmpeg Debug Script for QP Values
Python draw figure codes
GNN learning project
Just for learning
实时语音转文字,以及录音文件转文字
ITU-T Rec. P.1203 Implementation
kaldi-asr/kaldi is the official location of the Kaldi project.
关于机器学习的内容
Map Api Development Project.