Skip to content

Extract audio from a video file and transcribe it using the OpenAI open-source neural network called Whisper that approaches human level robustness and accuracy on English speech recognition.

Notifications You must be signed in to change notification settings

Olney1/Video-File-AI-Transcription

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Video Transcription with OpenAI's Whisper

This Python script allows you to extract audio from a video file and transcribe it into text using OpenAI's Whisper model. Whisper is an open-source neural network that approaches human-level robustness and accuracy on English speech recognition tasks. Learn more about Whisper here.

Features

  • Audio Extraction: Utilizes ffmpeg to extract audio from video files.
  • Speech to Text: Leverages the Whisper model for accurate transcription.

Prerequisites

Before you begin, ensure you have the following installed:

  • Python 3.6 or higher
  • ffmpeg - For audio extraction from video
  • Whisper Python library - For transcribing audio to text

You can install the Whisper library using pip:

pip install openai-whisper

About

Extract audio from a video file and transcribe it using the OpenAI open-source neural network called Whisper that approaches human level robustness and accuracy on English speech recognition.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages