Sutitle generation using OpenAI's Whisper

This is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files with specific settings. You can then browse, filter, and search through your saved audio files. Feel free to raise an issue for bugs or feature requests or send a PR.

這是一個簡單的 Streamlit UI ，用於 OpenAI的Whisper 語音轉文字模型。它允許您從YouTube視頻、播放列表或本地文件下載和轉錄媒體 (一個檔案限制為200MB)。然後，您可以瀏覽、過濾和搜索您保存的音頻文件。隨時歡迎提出錯誤或功能要求，或發送 PR。

Watch demo @ Youtube:

Setup

This was built & tested on Python 3.11 but should also work on Python 3.8+ as with the original Whisper repo). You'll need to install ffmpeg on your system. Then, install the requirements with pip.

# Install pytorch if you don't have it
# sudo conda install pytorch
pip install -r requirements.txt
pip install git+https://github.com/openai/whisper.git

Usage

Once you're set up, you can run the app with:

streamlit run app/01_🏠_Home.py

This will open a new tab in your browser with the app. You can then select a YouTube URL or local file & click "Run Whisper" to run the model on the selected media.

If the tab doesn't open, please use the URL: https://localhost:8501 in your browser.

If you are not satisfied with the output, click on '⚙️Settings' on the left, then you can fine-tune the inference of Whisper model.

Important ⚙️Settings F.Y.I :

Model: the model branch you want to use, defult: medium
language: the language of transcription, default: zh (中文)
No Speech Threshold: how strictly we are in excluding non-speech detection, default 0.4 (lower level are more strict)
Condition on previous text: whether the model will be affected by last text, default: True

Hosting

If you want to host it on a server with dynamic IP, you can install ngrok for forwarding your IP out. So you can access it anywhere via a random url like: https://b9f1-458-19-17-41.jp.ngrok.io

Register for an account and get your own token from ngrok website: https://dashboard.ngrok.com/get-started/your-authtoken
Install NGROK

wget https://bin.equinox.io/c/bNyj1mQVY4c/ngrok-v3-stable-linux-amd64.tgz
sudo tar xvzf ngrok-v3-stable-linux-amd64.tgz -C /usr/local/bin
rm ngrok-v3-stable-linux-amd64.tgz

Put your own ngrok token from ngrok website to forward_port.sh
Expose your url to the public with bash forward_port.sh
Inspect the random url by python inspect_url.py and use the url in your browser

🚧 Under Construction:

Import redis for task queue

🔥You can try our demo here

Special thanks to for the server.

Changelog

See Commits for detailed changes.

Version summary will be provided in Release.

The changelog of the original vertion can be found in this file.

License

Whisper: MIT
Streamlit: Apache 2.0.
else: MIT.

Reference

I forked the original version of the interfaces form https://github.com/hayabhay/whisper-ui.

They actually did a great job for forming a manage systme of subtitles: search engine, transcript viewer, settings

The original version aims to demonstrate the power of Whisper, especially for short films in youtube for local use.

My goal is to provide a service for a bunch of clinets to make subtitles for long videos like meeting records, courses and movies.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
app		app
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
inspect_url.py		inspect_url.py
install_whisper.txt		install_whisper.txt
permanent_url.py		permanent_url.py
public_url.sh		public_url.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

LICENSE

LICENSE

README.md

README.md

inspect_url.py

inspect_url.py

install_whisper.txt

install_whisper.txt

permanent_url.py

permanent_url.py

public_url.sh

public_url.sh

requirements.txt

requirements.txt

Repository files navigation

Sutitle generation using OpenAI's Whisper

Watch demo @ Youtube:

Setup

Usage

Hosting

Changelog

License

Reference

About

Releases 1

Packages

Languages

License

ShuYuHuang/whisper-subtitle

Folders and files

Latest commit

History

Repository files navigation

Sutitle generation using OpenAI's Whisper

Watch demo @ Youtube:

Setup

Usage

Hosting

Changelog

License

Reference

About

Resources

License

Stars

Watchers

Forks

Languages