Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Annotated reciters #1

Open
OmarKhled opened this issue Nov 2, 2023 · 2 comments
Open

Annotated reciters #1

OmarKhled opened this issue Nov 2, 2023 · 2 comments
Labels
enhancement New feature request

Comments

@OmarKhled
Copy link
Owner

OmarKhled commented Nov 2, 2023

The current limitation of the Tafakor Generator lies in the lack of annotated reciters. To effectively navigate through the captions in video, the audio files require annotation. This annotation means combining the mp3 files with data that indicates when each individual word begins and ends.

Currently, these annotated audio files are sourced from the quran.com API. However, the available list of annotated reciters is quite limited and short. To address this issue, we need to expand the list of reciters with annotated audio files.

This expansion can be achieved by leveraging ASR (Automatic Speech Recognition) speech recognition models. ASR technology will play a pivotal role in automatically annotating audio files from additional reciters.

Previous Work:
Here are some sources of previous related work in this area:

For reference, also Whisper ASR notebook:

@OmarKhled OmarKhled changed the title More Rectires Annotated reciters Nov 2, 2023
@OmarKhled
Copy link
Owner Author

@OmarKhled
Copy link
Owner Author

Decided to proceed with Deepspeech-Quran repo and build on the mozilla model, the current trained model presented by Mr. Tarek el deep has a fair accuracy that manges to detect most of the words but arround 25% of the words aren't detected or are miss detected.

Inshaa-Allah I tend to train the model on the data of Islam Sobhy.

@OmarKhled OmarKhled added the enhancement New feature request label Jan 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature request
Projects
None yet
Development

No branches or pull requests

1 participant