This project focuses on transcribing Urdu audio files into text using state-of-the-art Automatic Speech Recognition (ASR) techniques. The task involved processing 63 Urdu audio files in WAV format. The primary technologies utilized include the Wav2Vec2 model for ASR and the Google Speech Recognition API for audio transcription.
To access and run the code in a Google Colab environment, simply open the provided Jupyter notebook:
Follow these steps to run the code on Google Colab:
-
Open the Jupyter notebook in Google Colab by clicking the link above.
-
Click on the "Open in Colab" button at the top of the notebook.
-
Run each cell sequentially to execute the code.
-
Ensure you have the required dependencies installed. If not, install them using the provided commands in the notebook.
-
The notebook will transcribe Urdu audio files to text, and the results will be displayed within the notebook.
The notebook will generate the transcribed text within itself. Check the output cells for the transcribed text corresponding to each audio file.
Contributions are welcome! Feel free to open issues or submit pull requests. Please follow the guidelines outlined in the CONTRIBUTING.md file.
This project is licensed under the MIT License.
For any inquiries or collaborations, please contact:
- GitHub: zahidparviz
- Email: [pervaizzahid55@gmail.com]