Code Monkey home page Code Monkey logo

robustger's Introduction

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

[Paper] [Data] [Model]

This work extends the latest ASR generative error correction (GER) benchmark to noise-robust ASR with a Robust HyPoradise dataset, and it proposes a language-space denoising approach for GER that has achieved a new breakthrough.

Conda Environment Configuration

Our code is built based on lit-gpt, please refer to official tutorial to build a conda environment. Then, please install the required packages using following command:

pip install -r requirements.txt

Code

  • Model code: lit_gpt/robust_ger.py;
  • Training script: finetune.sh;
  • Inference script: infer.sh;

To run the training or inference script, you need to enter the scripts (including .sh and the called .py files) and modify all the absolute paths of data, model, and experiment directory to be your own (Hint: search for "~/RobustGER"). Then, directly run the .sh script using bash command.

Models

  • For LLMs, please refer to tutorial for configuration steps, which support many mainstream LLMs like LLaMA-2;
  • For well-trained adapter checkpoints, please refer to our HuggingFace repo.

Dataset

We have released our Robust HyPoradise dataset at HuggingFace.

References

@inproceedings{hu2024large,
  title={Large Language Models are Efficient Learners of Noise-Robust Speech Recognition},
  author={Hu, Yuchen and Chen, Chen and Yang, Chao-Han Huck and Li, Ruizhe and Zhang, Chao and Chen, Pin-Yu and Chng, Eng Siong},
  booktitle={International Conference on Learning Representations},
  year={2024}
}

@inproceedings{chen2023hyporadise,
  title={HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models},
  author={Chen, Chen and Hu, Yuchen and Yang, Chao-Han Huck and Siniscalchi, Sabato Marco and Chen, Pin-Yu and Chng, Eng Siong},
  booktitle={Advances in Neural Information Processing Systems},
  year={2023}
}

robustger's People

Contributors

yuchen005 avatar

Stargazers

 avatar Yizirui Fang avatar kk avatar  avatar Xiaolei avatar Alex Wu avatar Kento Nozawa avatar Sakshi Kumar avatar  avatar Tower avatar  avatar Mengjie Du avatar Jiaming Zhou avatar  avatar Yuanhao Li avatar A Normal Programmer avatar 酸菜鱼 Mai Yu avatar Elaine Tan avatar 幻城 avatar Xuper avatar wym keith avatar iacker avatar Mostafa Samy avatar  avatar  avatar  avatar 刘译蓬 avatar James avatar  avatar Maynor avatar 白马非马 avatar Ether Line avatar Steven Nelson avatar Neo Flow avatar xy avatar  avatar 龙佚 avatar ProgrammerUnknown avatar  avatar  avatar WangYiChen avatar zhangYin avatar Nate River avatar  avatar  avatar  avatar porschebz avatar  avatar DooDay avatar Jaime Lee avatar  avatar  avatar guanglinmei avatar Z avatar Chris avatar 0xLemon avatar  avatar wannature avatar  avatar Nikita Zhenev avatar Jeff Carpenter avatar Xuanru Zhou avatar phurich avatar Alef Iury avatar  avatar  avatar ZXMu avatar Ana Nomie avatar Kim Lee avatar Yuanli Xiao avatar Chen Xue avatar 宇化成仙 avatar 女装研究家『雯饕』 avatar 幼馴染 avatar Asmodee Digital avatar 香茵波克 avatar Control System Technologies avatar 察哈尔滨 avatar Simon Cai avatar notegeek avatar  avatar Phan Tường Lân avatar  avatar  avatar Thiri Kyaw avatar Jayakumar Barn avatar Gerth Broberg avatar Zoff Peng avatar Ming Chen avatar 无论何时都选择我 avatar 天使小宝贝 avatar Dan avatar The Coder Cat avatar 青年科学家 avatar No.67 avatar 木漏れ日 avatar int const change; avatar Senior Algorithm Developer avatar kkk avatar Zhida Chen avatar

Watchers

Nickolay V. Shmyrev avatar Ewald Enzinger avatar Kostas Georgiou avatar Sofian Mejjoute avatar  avatar  avatar

robustger's Issues

Token-level noise embedding

Hi,

Thank you for releasing your implementation. Can you please share scripts to compute token-level noise embedding as mentioned in the paper?

When will the code be updated

This is a good piece of work, thanks to the author, but there are still some errors in the code. When will the code be updated?

ROBUST HYPORADISE DATASET training splits

Hello,

Great work! I wanted to ask for the ROBUST HYPORADISE DATASET training splits. I checked the HF page linked in README, and it looks like only the test sets were released. Any help would be appreciated!

Thank You again!

Error while running

Hello,

Great work! I face this error while running finetune.sh:

from lit_gpt.model import KVCache, RoPECache, apply_rope
ImportError: cannot import name 'RoPECache' from 'lit_gpt.model'

any help would be appreciated!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.