wendongj,wendong,github

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)

pytorch-speech-features

pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning

pywsj0-mix

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

research-and-analysis-of-speech-enhancement-or-dereverberation

This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further understanding. On the other hand, I hope that all beginners or masters interested in speech enhancement can ask me questions and make progress together. A lot of my summary is not very good, I hope you put forward corrections!

resgrad

Unofficial implementation of ResGrad as a new and high quality TTS model

room-simulation

Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".

rvae-em

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function"

sc-cnn

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

sc_vall-e

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

sdcm

seed-tts-eval

self_attention_alignment

Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement

sensevoice

Multilingual Voice Understanding Model

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

sinet

Unofficial Tensorflow 2 implementation of SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder