belambert / asr-evaluation Goto Github PK

View Code? Open in Web Editor NEW

267.0 15.0 78.0 127 KB

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

License: Apache License 2.0

Python 69.35% Dockerfile 9.71% Shell 20.94%

speech-recognition asr error-rate evaluation

asr-evaluation's People

Contributors

Stargazers

Watchers

Forkers

ezk84 dhvfanny haewon-mit logonod liyong3forever annielytical rshar007 yashtawade pranjaldaga cyu0913 jeroenwijering zweiein cvit-contrib ramonsanabria shingo22 daishu7 bilaldendani pyeongkim speechprojects afcarl mgoldey yfliao motazsaad sirvany janengelmohr gogozhaoya takatsugu-kato jhuiac yny-learning mystlee agangzz andrewsofie convergence-lab haibit neilblaze sscorpio93 emekalu eddy0613 robd003 ropok gaozheyuan huimengzhang titiaffandi shuchita-rahman spetrik77 luan78zaoha sulemanbashir dangnt0490 davidnemeskey yeoji jokecorleone sgdavidw copperdong gaoyiyeah zhouchen428 kbmanova billdingf rgriscom scarecrow1123 mhilmiasyrofi vinayakarannil chirpstrawworm mistyisback a2un zge ekallaur karolinakuligowska srandal serchsm acoustic-lab-biu martincastro1 amazearavind-5904 latha2024

asr-evaluation's Issues

Could not able to run on my system

Can you give me an example of command line usage of this

UnicodeDecodeError when using UTF-8 Encoding files

Unable to run for UTF-8 encoding files.
For example, using the attached file as HYP and REF will reproduce the error.
small_trans.txt

SER and WRR return different values among pip and src versions

This is a really nice project! I noticed that WRR and SER return wrong values. I installed asr-evaluation.

asr-evaluation==2.0.2 (pip install)

I have a simple example for reproducing the problem. I also clned this repository, and tested it directly. The returned values looks correct.

Is my usage wrong? Or, some bugs?

hyp.txt

i have dog
did you pen
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom

ref.txt

i have a dog
do you have a pen
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom
hello tom

wer ref.txt hyp.txt                                                                                                                                                                                                              Sentence count: 13
WER:    14.286% (         4 /         28)
WRR:    96.429% (        27 /         28)
SER:   100.000% (        13 /         13)

cd asr_evaluation
python __main__.py ../../ref.txt ../../hyp.txt                                                                         
Sentence count: 13
WER:    12.903% (         4 /         31)
WRR:    87.097% (        27 /         31)
SER:    15.385% (         2 /         13)

Paper Citations

Can you name a paper on which this ASR Evaluation is based on?

thank you very much

Python API

Is there Python API that can be called from within Python?

SER wrong for simple example

It's probably me but with the following two input files:

ref.txt

the crazy frog jumps over the lazy dog
the crazy frog jumps over the lazy dog
the frog jumps over the lazy dog
the crazy frog jumps over the lazy dog
the crazy frog jumps over the lazy dog extended

and hyp.txt

the crazy frog jumps over the lazy dog
the frog jumps over the lazy dog
the crazy frog jumps over the lazy dog
the craz frog jumps over the lazy dog
the crazy frog jumps over the lazy dog

Running the wer command:

wer ref.txt hyp.txt`

Gives me the result:

Sentence count: 5
WER:    10.000% (         4 /         40)
WRR:    92.500% (        37 /         40)
SER:    80.000% (         4 /          5)

For some reason it would seem that only 4 senteneces in hyp.txt are recognized?

Environment

$ pip show asr-evaluation
Name: asr-evaluation
Version: 2.0.2
Summary: Evaluating ASR (automatic speech recognition) hypotheses, i.e. computing word error rate.
Home-page: UNKNOWN
Author: Ben Lambert
Author-email: [email protected]
License: LICENSE.txt
Location: /home/sfalk/miniconda3/envs/t2t/lib/python3.5/site-packages
Requires: termcolor, edit-distance
Required-by:

issues about calculating WER and WRR using python2.7

@belambert
Very useful package, however, there might be a little mistake when calcualtes the WER and WRR if we use python2.7:
I think 'error_count' and 'match_count' should be convert to type 'float' if you use python2.7

asr_evaluation/asr_evaluation.py(line65-66)

print('WRR: {0:f} % ({1:10d} / {2:10d})'.format(100 * match_count / ref_token_count, match_count, ref_token_count))
print('WER: {0:f} % ({1:10d} / {2:10d})'.format(100 * error_count / ref_token_count, error_count, ref_token_count))

Anyway, it's a good package!

Sphinx format problem when using -id argument

In Sphinx format the hypothesis file has the following form:

hypothesis_text (file_id score)

while transcriptions:

transcription_text (file_id)

So when I run the wer command the following error occurs:

$ wer transcriptions hypothesis -id
Reference and hypothesis IDs do not match! ref="(data_005)" hyp="-7716)"
File lines in hyp file should match those in the ref file.

I think this occurs because you have not take into account the score parameter. It compares the file id of the transcriptions to the score instead of the file id of the hypothesis.

belambert / asr-evaluation Goto Github PK

asr-evaluation's People

Contributors

Stargazers

Watchers

Forkers

asr-evaluation's Issues

wer -p raises IndexError

Could not able to run on my system

UnicodeDecodeError when using UTF-8 Encoding files

SER and WRR return different values among pip and src versions

Paper Citations

Python API

SER wrong for simple example

Environment

issues about calculating WER and WRR using python2.7

asr_evaluation/asr_evaluation.py(line65-66)

Sphinx format problem when using -id argument

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent