Annotated reciters #1

OmarKhled · 2023-11-02T02:21:00Z

The current limitation of the Tafakor Generator lies in the lack of annotated reciters. To effectively navigate through the captions in video, the audio files require annotation. This annotation means combining the mp3 files with data that indicates when each individual word begins and ends.

Currently, these annotated audio files are sourced from the quran.com API. However, the available list of annotated reciters is quite limited and short. To address this issue, we need to expand the list of reciters with annotated audio files.

This expansion can be achieved by leveraging ASR (Automatic Speech Recognition) speech recognition models. ASR technology will play a pivotal role in automatically annotating audio files from additional reciters.

Previous Work:
Here are some sources of previous related work in this area:

For reference, also Whisper ASR notebook:

Whisper ASR Notebook

OmarKhled · 2023-11-02T02:51:45Z

https://github.com/tarekeldeeb/DeepSpeech-Quran

OmarKhled · 2023-12-17T02:39:41Z

Decided to proceed with Deepspeech-Quran repo and build on the mozilla model, the current trained model presented by Mr. Tarek el deep has a fair accuracy that manges to detect most of the words but arround 25% of the words aren't detected or are miss detected.

Inshaa-Allah I tend to train the model on the data of Islam Sobhy.

OmarKhled changed the title ~~More Rectires~~ Annotated reciters Nov 2, 2023

OmarKhled added the enhancement New feature request label Jan 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Annotated reciters #1

Annotated reciters #1

OmarKhled commented Nov 2, 2023 •

edited

Loading

OmarKhled commented Nov 2, 2023

OmarKhled commented Dec 17, 2023

Annotated reciters #1

Annotated reciters #1

Comments

OmarKhled commented Nov 2, 2023 • edited Loading

OmarKhled commented Nov 2, 2023

OmarKhled commented Dec 17, 2023

OmarKhled commented Nov 2, 2023 •

edited

Loading