Code Monkey home page Code Monkey logo

keyword-spotting-system's Introduction

Keyword Spotting System

This project involves building a Convolutional Neural Network (CNN) architecture for Keyword Spotting, specifically trained on the Google Speech Commands dataset with an impressive Test Accuracy of 90%. Additionally, the model has been deployed as an API using Flask, and the API has been integrated into a frontend built in ReactJS.

CNN Architecture

The Convolutional Neural Network is a deep learning model well-suited for audio-related tasks. For keyword spotting, it has proven to be effective in recognizing specific words or phrases within audio clips.

The architecture of the CNN used for this project is as follows:

image

The CNN architecture is designed to extract relevant features from the audio input and learn to differentiate between different keywords.

Model Performance

The trained CNN has achieved an impressive Test Accuracy of 90% on the Google Speech Commands dataset. The high accuracy is a testament to the effectiveness of the chosen architecture and the quality of the training data.

API Deployment with Flask

To make the trained model accessible and usable, it has been deployed as an API using Flask. Flask is a popular web framework in Python that allows us to create a simple and efficient API for serving predictions from the model.

With the API, users can send audio clips as requests to the server, and the server will respond with predictions regarding the presence of the specific keyword in the audio.

Integration with ReactJS Frontend

To provide a user-friendly interface, the API has been integrated with a frontend built in ReactJS. ReactJS is a popular JavaScript library for building interactive user interfaces.

Users can interact with the frontend, record or upload audio clips, and send them to the Flask API for keyword spotting predictions. The results are then displayed back to the user in a user-friendly manner.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Special thanks to the authors of the Google Speech Commands dataset and the open-source community for their contributions to the tools used in this project.

keyword-spotting-system's People

Contributors

snehagupta1907 avatar kali-decoder avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.