Code Monkey home page Code Monkey logo

projectlog's Introduction

ProjectLog

项目日志

语音识别的一些基础知识 本项目主要内容在于孤立词的评分以及其发音反馈,其中功能将逐步实现

由于需要部署到小程序,因此有域名www.kaldi-speech.cn

现在主要工作在于自己实现DTW与HMM-DNN,对比其效率 后期将选择使用Kaldi进行进一步的模型训练

目前主要方向以及进度

单词评分

实现方法

  1. MFCC简单教程
    python_speech_features官方使用文档
    使用Python_speech_features进行MFCC提取
    MFCC的提取过程讲解
    MFCC各种教程 动态时间规整
    DTW MFCC和DTW的实现

    结果说明:无法定量的描述相对的评分,最终得到的是两者特征向量的平均匹配值,无法达到预想的使用标准,可以直接放弃

  2. HMM的简单说明

句子评分

重点在于流畅度等

projectlog's People

Contributors

paulzzzhang avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.