wangtianrui,Tianrui Wang (王天锐),github

paper2gui

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

perceptualaudio

Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM

phase-hpss

Code for harmonic/percussive source separation with MadTwinNet and phase recovery

phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

poolformer

PoolFormer: MetaFormer is Actually What You Need for Vision

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

quadtreeattention

QuadTree Attention for Vision Transformers (ICLR2022)

robust-e2e-asr

This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 2021.

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

sddnet

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

semetrics

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Matlab-based deep learning toolkit that supports arbitrary directed acyclic graphs (DAG). Support DNN, LSTM, CNN layers and many signal processing layers. Include recipes/examples of using the tool for various tasks.

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

sounds_pretreatment

implements of sounds pretreatment & extractors & notes

specaugment-plus

A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

speech-emotion-analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

speech-enhancement

Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement

speech_enhancement_mmse-stsa

A statistical model-based Speech Enhancement Using MMSE-STSA

speech_recognition

中文语音识别

sru

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

stationary-wavelet-packet-transform

An implementation of the stationary wavelet packet transform on top of PyWavelets

wangtianrui Goto Github PK

Tianrui Wang (王天锐)'s Projects

Recommend Projects

Recommend Topics

Recommend Org