Skip to content

📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP

License

Notifications You must be signed in to change notification settings

sameemul-haque/TranscribeTool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TranscribeTool

This project is a simple tool that allows users to upload a video or audio file and extract text from it. It utilizes Streamlit for the user interface and ffmpeg for audio extraction, Whisper API for speech recognition, and yt-dlp for retrieving audio from a video url.

  • Report issues here
  • Request features here

Usage

  1. Open https://transcribetool.streamlit.app/
  2. Upload a video | audio file or provide url of a youtube video.
  3. Wait for the transcription process to complete.
  4. View the text extracted from the file.

Installation

To run this project locally, follow these steps:

  1. Clone this repository to your local machine.
  2. Navigate to the project directory.
  3. Create and activate a virtual environment (optional but recommended):
python3 -m venv venv
source venv/bin/activate
  1. Install the required dependencies using pip:
pip install -r requirements.txt
  1. Run the following command to start the Streamlit app:
streamlit run main.py 
  1. Open your web browser and go to the URL provided by Streamlit.
  2. Upload a video | audio file or provide url of a video.
  3. Wait for the transcription process to complete.
  4. View the text extracted from the file.

preview

Acknowledgements

This project uses the following libraries:

License

This project is licensed under the GPL-3.0 License - see the LICENSE file for details.