Universal Video Summarizer is a python based QT5 desktop application that automates summarizing videos. The application utilizes yt-dlp to download videos, then ASR to transcribe it and LLMs to summarize/process the transcript according to user directives.
- Python 3.8+
ffmpeginstalled on your system.
pip install -r requirements.txtpython main.py- Enter the URL or local path of the video you want to summarize.
- Select the desired summarization method.
- Click the
Summarizebutton to start the summarization process. - The application will display the summarized text in the text box.
- Click the
Settingsbutton. - Select one of many LLM models available on Hugging Face.
- Click the
Savebutton to save the settings.
| Model Name | Notes |
|---|---|
openai/whisper-large-v3 |
Best performance. |
distil-whisper/distil-large-v3 |
Reduced VRAM usage, comparable performance. |
| Model Name | Notes |
|---|---|
meta-llama/Llama-3.2-1B-Instruct |
Requires signing up and requesting access. |
meta-llama/Llama-3.2-3B-Instruct |
Larger version of the 1B, requires more VRAM to run fast. Requires signing up and requesting access. |
Qwen/Qwen2-1.5B-Instruct |
Has issues with hallucinations. |
- Summarize videos from YouTube, Twitch or any other online platform
yt-dlpsupports and local files. - Switch between different models for transcription and summarization.
- Create custom prompts for the summarization process.
Contributions are welcome! Feel free to fork this repository, submit issues, or create pull requests.