Code Monkey home page Code Monkey logo

huzaifa7524 / whisper_small_openai_finetuned_on_arabic_language Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 64 KB

Arabic Speech Recognition with Whisper: Fine-tune the Whisper model from OpenAI for Arabic speech recognition tasks. This repository provides scripts and resources for collecting, preprocessing, and fine-tuning an Arabic speech dataset, as well as evaluating and deploying the trained model for inference.

License: MIT License

Jupyter Notebook 100.00%

whisper_small_openai_finetuned_on_arabic_language's Introduction

Arabic Speech Recognition with Whisper

Overview

This project aims to perform Arabic speech recognition using the Whisper model, developed by OpenAI. We fine-tune the Whisper model on an Arabic speech dataset, leveraging the Hugging Face Transformers library. The trained model can transcribe Arabic speech audio into text with high accuracy.

How it Works

  1. Data Preparation: We start by collecting and preparing an Arabic speech dataset. The dataset should contain audio files along with their corresponding transcriptions.

  2. Fine-Tuning: We fine-tune the pre-trained Whisper model on the Arabic dataset using supervised learning. During fine-tuning, the model learns to map input audio features to text transcriptions.

  3. Evaluation: After fine-tuning, we evaluate the trained model's performance on a separate test dataset. We measure metrics such as Word Error Rate (WER) to assess the accuracy of the model's transcriptions.

  4. Inference: Once the model is trained and evaluated, it can be used for real-world inference tasks. Given a new Arabic speech audio file, the model can transcribe the audio into text.

Key Components

  • Whisper Model: The core of the project is the Whisper model, which is a deep learning model specifically designed for speech recognition tasks.
  • Hugging Face Transformers: We leverage the Transformers library from Hugging Face, which provides a user-friendly interface for working with state-of-the-art deep learning models, including Whisper.
  • Training Script: We provide a training script that automates the process of fine-tuning the Whisper model on the Arabic dataset.
  • Evaluation Script: We also provide an evaluation script to measure the performance of the trained model using standard metrics.
  • Inference Script: Finally, we offer an inference script that allows users to transcribe Arabic speech audio using the trained model.

Usage

To use the project, follow these steps:

  1. Prepare the Arabic speech dataset.
  2. Fine-tune the Whisper model on the dataset using the provided training script.
  3. Evaluate the trained model using the evaluation script.
  4. Use the trained model for inference tasks using the inference script.

Learn More

For a detailed guide on fine-tuning the Whisper model and other advanced techniques, read the blog post on Hugging Face's blog.

whisper_small_openai_finetuned_on_arabic_language's People

Contributors

huzaifa7524 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.