Topic: asr Goto Github
Some thing interesting about asr
Some thing interesting about asr
asr,A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
User: abhirooptalasila
asr,OpenAI Whisper ASR Webservice API
User: ahmetoner
Home Page: https://ahmetoner.github.io/whisper-asr-webservice
asr,📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
User: ailln
Home Page: https://www.dovolopor.com/cn2an
asr,Offline speech recognition for Android with Vosk library.
Organization: alphacep
asr,Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Organization: alphacep
asr,an open-source implementation of sequence-to-sequence based speech processing engine
Organization: athena-team
Home Page: https://athena-team.readthedocs.io
asr,faster_whisper GUI with PySide6
User: cheshirecc
asr,🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Organization: coqui-ai
Home Page: https://coqui.ai
asr,DELTA is a deep learning based natural language and speech processing platform.
Organization: delta-ml
Home Page: https://delta-didi.readthedocs.io/
asr,INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
User: dmitryryumin
asr,Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
User: freewym
asr,Open tools and data for cloudless automatic speech recognition
User: gooofy
asr,End-to-end ASR/LM implementation with PyTorch
User: hirofumi0810
asr,:speech_balloon: An On-Premises, Streaming Speech Recognition System
User: iceychris
Home Page: https://news.ycombinator.com/item?id=25099847
asr,This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
User: innovatorved
Home Page: https://innovatorved-whisper-api.hf.space/
asr,This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
User: jdepoix
asr,HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
User: jonatasgrosman
asr,Speech-to-text server framework with next-gen Kaldi
Organization: k2-fsa
Home Page: https://k2-fsa.github.io/sherpa
asr,Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Organization: k2-fsa
Home Page: https://k2-fsa.github.io/sherpa/onnx/index.html
asr,A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
User: kaituoxu
asr,Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Organization: linto-ai
asr,WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
User: m-bain
asr,Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
User: mahmoudashraf97
asr,pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
User: mravanelli
asr,SincNet is a neural architecture for efficiently processing raw audio samples.
User: mravanelli
asr,A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Organization: nvidia
Home Page: https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
asr,Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
Organization: openspeech-team
Home Page: https://openspeech-team.github.io/openspeech/
asr,Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Organization: paddlepaddle
Home Page: https://paddlespeech.readthedocs.io
asr,On-device streaming speech-to-text engine powered by deep learning
Organization: picovoice
Home Page: https://picovoice.ai/
asr,On-device speech-to-text engine powered by deep learning
Organization: picovoice
Home Page: https://picovoice.ai/
asr,Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
User: purfview
asr,A Python wrapper for Kaldi
Organization: pykaldi
Home Page: https://pykaldi.github.io
asr,Open STT
User: snakers4
asr,Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
User: snakers4
asr,[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
User: sooftware
asr,Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
User: sooftware
Home Page: https://sooftware.github.io/kospeech/
asr,A PyTorch-based Speech Toolkit
Organization: speechbrain
Home Page: http://speechbrain.github.io
asr,Chinese text normalization for speech processing
Organization: speechio
asr,The official repository of the Eesen project
Organization: srvk
Home Page: http://arxiv.org/abs/1507.08240
asr,Lingvo
Organization: tensorflow
asr,Production First and Production Ready End-to-End Speech Recognition Toolkit
Organization: wenet-e2e
Home Page: https://wenet-e2e.github.io/wenet/
asr,🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
User: wzpan
Home Page: https://wukong.hahack.com/
asr,html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
User: xiangyuecn
Home Page: https://xiangyuecn.github.io/Recorder/
asr,Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
User: yeyupiaoling
asr,基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
User: yeyupiaoling
Home Page: https://yeyupiaoling.blog.csdn.net/article/details/102904306
asr,基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
User: yeyupiaoling
asr,Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
User: yeyupiaoling
asr,语音识别理论,论文和PPT
User: zw76859420
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.