Code Monkey home page Code Monkey logo

pcs's Introduction

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

This repo is only dedicated to the post-processing PCS.

catalog

Introduction
PCS-tools
SpeechMetrics-tools
Citation
References

"PCS is derived based on the critical band importance function and applied to modify the targets of the SE model."
"It can also be used as a post-processing (PP) method to further sharpen the structure of enhanced speech and suppress residual noise."

More details can be found in here: http://arxiv.org/abs/2203.17152 (Preprint arXiv; Accepted by INTERSPEECH 2022)

This repo is only dedicated to the post-processing PCS.

Enhanced audios are generated by different baseline models to which post-processing PCS is then applied.
The experimental results are as follows:

Some examples are shown below:

PCS-tools

Post-processing PCS tools can be found at /PCS folder.
So you can simply post-process the audio with PCS.

Scoring-tools

Speech metric scores were computed with /speech_metrics.

Online Post-processing PCS Demo

https://lojoffy-pcs-online-demo-main-luu0rc.streamlitapp.com/

Citation:

If you find the code useful in your research, please cite:

@article{chao2022perceptual,
  title={Perceptual Contrast Stretching on Target Feature for Speech Enhancement},
  author={Chao, Rong and Yu, Cheng and Fu, Szu-Wei and Lu, Xugang and Tsao, Yu},
  journal={Proc. of INTERSPEECH},
  year={2022}
}

Reference:

SEGAN:

arXiv: https://arxiv.org/pdf/1703.09452.pdf

Wiener filter:

wikipedia: https://en.wikipedia.org/wiki/Wiener_filter

Transformer T(c) / T(nc)

arXiv: https://arxiv.org/pdf/2006.10296.pdf

CRNN

arXiv: https://arxiv.org/pdf/1805.00579.pdf

MetricGAN+

arXiv: https://arxiv.org/pdf/2104.03538.pdf
From SpeechBrain: https://huggingface.co/speechbrain/metricgan-plus-voicebank

DPT-FSNet:

arXiv: https://arxiv.org/pdf/2104.13002.pdf
Reproduced and denoted as DPT*

pcs's People

Contributors

roychao19477 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.