Arabic Speech Recognition with Whisper

Overview

This project aims to perform Arabic speech recognition using the Whisper model, developed by OpenAI. We fine-tune the Whisper model on an Arabic speech dataset, leveraging the Hugging Face Transformers library. The trained model can transcribe Arabic speech audio into text with high accuracy.

How it Works

Data Preparation: We start by collecting and preparing an Arabic speech dataset. The dataset should contain audio files along with their corresponding transcriptions.
Fine-Tuning: We fine-tune the pre-trained Whisper model on the Arabic dataset using supervised learning. During fine-tuning, the model learns to map input audio features to text transcriptions.
Evaluation: After fine-tuning, we evaluate the trained model's performance on a separate test dataset. We measure metrics such as Word Error Rate (WER) to assess the accuracy of the model's transcriptions.
Inference: Once the model is trained and evaluated, it can be used for real-world inference tasks. Given a new Arabic speech audio file, the model can transcribe the audio into text.

Key Components

Whisper Model: The core of the project is the Whisper model, which is a deep learning model specifically designed for speech recognition tasks.
Hugging Face Transformers: We leverage the Transformers library from Hugging Face, which provides a user-friendly interface for working with state-of-the-art deep learning models, including Whisper.
Training Script: We provide a training script that automates the process of fine-tuning the Whisper model on the Arabic dataset.
Evaluation Script: We also provide an evaluation script to measure the performance of the trained model using standard metrics.
Inference Script: Finally, we offer an inference script that allows users to transcribe Arabic speech audio using the trained model.

Usage

To use the project, follow these steps:

Prepare the Arabic speech dataset.
Fine-tune the Whisper model on the dataset using the provided training script.
Evaluate the trained model using the evaluation script.
Use the trained model for inference tasks using the inference script.

Learn More

For a detailed guide on fine-tuning the Whisper model and other advanced techniques, read the blog post on Hugging Face's blog.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Arabic_fine_tune_whisper (1).ipynb		Arabic_fine_tune_whisper (1).ipynb
LICENSE		LICENSE
README.md		README.md
whisperonarabic.ipynb		whisperonarabic.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arabic Speech Recognition with Whisper

Overview

How it Works

Key Components

Usage

Learn More

About

Releases

Packages

Languages

License

Huzaifa7524/Whisper_small_openai_finetuned_on_arabic_language

Folders and files

Latest commit

History

Repository files navigation

Arabic Speech Recognition with Whisper

Overview

How it Works

Key Components

Usage

Learn More

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages