Duo MA's Projects
Adaptive Attention Span in Transformers
HMM, CTC, RNN-Transducer, forward-backward algorithm
Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
A self-supervised learning framework for audio-visual speech
百度网盘命令行工具。The terminal utility for Baidu Network Disk.
更纯粹、更高压缩率的Tokenizer
A complete computer science study plan to become a software engineer.
CUDA 编程指南学习
Clustering-based methods for overlapping diarization
assignments for e6870 ASR class
My solution to course E6870 (Speech Recognition) of Columbia University.
End-to-End Neural Diarization
End-to-End Speech Processing Toolkit
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
FAIR Sequence Modeling Toolkit 2
fairseq_speechtext project focus on dataset and model part of multi-modual pretraining(i.e: speech and text) for research.
Fast and memory-efficient exact attention
A C++ standalone library for machine learning
Large, modern dataset for speech recognition
Training data simulation
This is the official location of the Kaldi project.