tugstugi / mongolian-speech-recognition Goto Github PK
View Code? Open in Web Editor NEWMongolian speech recognition with PyTorch
Home Page: https://www.chimege.mn/
Mongolian speech recognition with PyTorch
Home Page: https://www.chimege.mn/
CommonVoice нутагшуулах: https://pontoon.mozilla.org/mn/common-voice
Hi, tugstugi, thanks for sharing your works for speech recognition, while i have some issues about the code:
Predictions on the validation set look already good:
EXPECTED:
аливаа цус хувцсан дээр үсрэхэд цус үсэрсэн хэсгийг та нар ариун газарт угаагтун
PREDICTED:
аливаа ус хусан ээр үсэрэхэ ус үсэрсан хэсгийг та нар ариун газарт угаагтун
Now, increase the network model size add some dropouts to see whether above mistakes could be fixed.
python_speech_features-ын logfbank-ыг яагаад хэрэглэх хэрэгтэйг тайлбарлаж болох уу? (Жишээ нь яагаад mfcc хэрэглээгүй вэ?)
winlen, winstep, preemph зэрэг утгууд нь default хамгийн сайн утга гэж ойлгож болох уу? (Яаж tune хийх ээ сайн ойлгохгүй л байна)
Did your try these languages? Thanks you.
Sain bnu?
Bi uurii chin ajillig sullgaj joohon yum sudlah gesen yum. Daanch installation deere gatschihlaa. 1).Docker file --> workspace uurin chin file uu? ugui bol workspace file github-d baih estio file uu?
2). Apex error bas uguud bh yum
Traceback (most recent call last):
File "train.py", line 12, in
import apex
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/apex/init.py", line 18, in
from apex.interfaces import (ApexImplementation,
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/apex/interfaces.py", line 10, in
class ApexImplementation(object):
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/apex/interfaces.py", line 14, in ApexImplementation
implements(IApex)
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/zope/interface/declarations.py", line 706, in implements
raise TypeError(_ADVICE_ERROR % 'implementer')
TypeError: Class advice impossible in Python3. Use the @Implementer class decorator instead.
Install hiihed tusalj ugnuu?
Bayarlalaa
the storage bucket used to pull the Mongolian Bible dataset no longer has the Mongolian version available for download.
if anyone still has a copy of the original .zip files, I would be eternally grateful.
ERROR conda.cli.main_run:execute(33): Subprocess for 'conda run ['python3', 'dl_mbspeech.py']' command failed. (See above for error)
downloading https://s3.us-east-2.amazonaws.com/bible.davarpartners.com/Mongolian/01_Genesis.zip...
extracting '01_Genesis.zip'...
2MB [00:00, 766.57MB/s]
Traceback (most recent call last):
File "/Users/xd/Code/mongolian-speech-recognition/datasets/dl_mbspeech.py", line 37, in <module>
zipfile = ZipFile(bible_book_file_path)
File "/opt/homebrew/Caskroom/miniforge/base/lib/python3.9/zipfile.py", line 1257, in __init__
self._RealGetContents()
File "/opt/homebrew/Caskroom/miniforge/base/lib/python3.9/zipfile.py", line 1324, in _RealGetContents
raise BadZipFile("File is not a zip file")
zipfile.BadZipFile: File is not a zip file
Hi thank you for your great work. I wonder why do u think Noam scheduler work well for this case (mongolian)
AttributeError: module 'librosa' has no attribute 'output'
gedeg aldaa garj ireed bgaa tul doorh downgrade hiiheer ajillaj bna
pip3 install librosa==0.6.3
pip3 install numba==0.48
The network outputs recognizable texts already after 30 minutes or 10 epochs:
expected:
аливаа цус хувцсан дээр үсрэхэд цус үсэрсэн хэсгийг та нар ариун газарт угаагтун
predicted:
аааааааааааалллллллливвваааааааааааааааааа ууусссс ххххууввссаанн гэээрррррррррр үүсссэррррррххх ттуусссуррссрррссссаннн хххээссссггийгг ттаааааааааааааааааааа ннаарррррррр ааааааааааааааааааааарррииинннннн гггаааааааааааааарррррртт ууггааааааааааааааааааагтттүүнннррррааааа
To collapse the repeated characters and to choose most likely word sequence, we need to train a language model using KenLM.
Dear Erdene-Ochir Tuguldur,
You are doing great job! I mean also your TTS project.
The codes are so plain, training is so quick, but at the same time the solutions are powerful and effective.
May I know your best loss resuls for this project?
Recently I had developed Kaldi ASR soluion for my language.
So I would like to know is it possible to reach near resuls with your Speech Recognition project.
Thank you in advance!
5000 өгүүлбэр CommonVoice-д оруулах
Hi Erdene-Ochir Tuguldur,
thanks for sharing your work.
I am trying to start a train for the first time (Windows 10, Conda, Single GPU)but I am getting this run time error:
RuntimeError:
An attempt has been made to start a new process before the
current process has finished its bootstrapping phase.
This probably means that you are not using fork to start your
child processes and you have forgotten to use the proper idiom
in the main module:
if __name__ == '__main__':
freeze_support()
...
The "freeze_support()" line can be omitted if the program
is not going to be frozen to produce an executable.
I can see that I need to add some kind of guard in the train py to avoid recursive subprocess, but I couldn't find where exact place.
Please suggest, thank you beforehand.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.