transcribe-anything

Input a local file or url and this service will transcribe it using Mozilla Deepspeech 0.9.3.

Build Status

Example

Example (cmd):
- transcribe_anything <YOUTUBE_URL> > out_subtitles.txt
- transcribe_anything <LOCAL.MP4/WAV> > out_subtitles.txt

Example (api):

from transcribe_anything.api import bulk_transcribe

urls = ['https://www.youtube.com/watch?v=Erk4_jFDjzQ']
def onresolve(url, sub): print(url, sub)
def onfail(url): print(f'Failed: {url}')
bulk_transcribe(urls, onresolve=onresolve, onfail=onfail)

Required: Install to current python environment

pip install https://github.com/Soebb/transcribe_anything/archive/main.zip
- The command transcribe_anything will magically become available.
transcribe_anything <YOUTUBE_URL> > out_subtitles.txt
-or- transcribe_anything <MY_LOCAL.MP4/WAV> > out_subtitles.txt

How does it work?

This program performs fetching using YouTube-dl for downloading videos from video services, and then stripping the audio track out.

static_ffmpeg is then called to transcode the audio track into a specific format that DeepSpeech requires.

Once the audio file has been prepared, pydeepspeech is called. This little utility automatically downloads the proper AI models and installs them into the proper path so that deepspeech can be called. It also partitions the input wav file into chunks, split at the parts of silence, in order to make processing go easier (DeepSpeech degrades performance significantly with longer audio clips, so they have to be kept short.)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github/workflows		.github/workflows
tests		tests
transcribe_anything		transcribe_anything
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transcribe-anything

Build Status

Example

Required: Install to current python environment

How does it work?

About

Releases

Packages

Languages

License

Soebb/transcribe_anything

Folders and files

Latest commit

History

Repository files navigation

transcribe-anything

Build Status

Example

Required: Install to current python environment

How does it work?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages