gary083 / gan_harmonized_with_hmms Goto Github PK

Code：Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models

Home Page: https://arxiv.org/abs/1904.04100

Python 38.11% Shell 52.82% Perl 9.07%

automatic-speech-recognition unsupervised-learning

gan_harmonized_with_hmms's People

Contributors

Stargazers

Watchers

Forkers

raywu0123 kkulczak themidwestcanapps darongliu jcarlosneto

gan_harmonized_with_hmms's Issues

PyTorch code version high FER in completely unsupervised settings?

@SungFeng-Huang I have been playing with your pytorch code version for the first iteration of GAN training using oracle bounds or the provided "gas" bounds. It seems that I wasn't able to achieve the FER numbers reported in the paper, even under the "matching" setting between audio and text. While the oracle bounds gave somewhat more reasonable results, the FER was still in the low 30s%, with a final PER of 30.5%. With the provided unsupervised "gas" bounds, the FER was much much higher than the results in the paper. The FER I could obtain was 70-85%, and the best PER I could get was 65%.

I compared the gas bounds with the oracle bounds and the r-value seemed reasonable (81.77) under a 2-frame (standard 25ms/10ms shift) tolerance window.

I noticed that in the commit comments you also mentioned this issue. Have you ever figured out the reason?

Thanks for the help!

GAS boundaries

How did you extract the GAS phone boundaries from the data set?
I found this repository https://github.com/allyoushawn/timit_gas
I modified their decoder code to output the boundaries in a pickle file. The code worked but the PER was very bad compared to the one generated by the original uns_bnd files located in ./data/timit_gas.

Running this repo in colab

Any tips on what should I do to get this code running on colab?

gary083 / gan_harmonized_with_hmms Goto Github PK

gan_harmonized_with_hmms's People

Contributors

Stargazers

Watchers

Forkers

gan_harmonized_with_hmms's Issues

PyTorch code version high FER in completely unsupervised settings?

GAS boundaries

Running this repo in colab

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent