Code Monkey home page Code Monkey logo

offline_stt's Introduction

Offline Speech to Text Engine

Overview

  • This application depicts an offline speech to text engine that consists of a limited set of vocabulary
  • The speech recognition is done by a trained model.
  • I used tensorflowjs to achieve this
    • In particular, I followed this tutorial for the speech recognition implementation
  • Upon implementation of the speech recognition, the offline capabilities are enabled by deploying the web app as a PWA
    • All required static resources are cached
    • I used a known caching strategy to achieve this.

Build and deploy

  1. Clone repo
  2. Change logo if need be
  3. npm install serve OR npm install http-server
    • This is to serve static files from a given directory

Goal

  • Going forward, I have plans to do either of the following:
    1. Extrapolate from this initial implementation and train my own model with a larger set of vocabulary
    2. Extrapolate from this implementation and train my own model using data acquired from Common Voice Project with hopes of full English speech recognition

Caveat

  • My knowledge in both ML and PWA deployment is beginner at best. Hence, the project is based heavily on a variety of resources.
  • Implementation has not been thoroughly vetted and is meant to serve as a proof-of-concept

Extra References

offline_stt's People

Contributors

ashwin2397 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.