community: add video transcript loaders using Whisper for enhanced video transcription #21426

TeodorZlatanov · 2024-05-08T13:58:45Z

Description:
This pull request introduces six new loader classes to the community package, enhancing video processing capabilities across different environments using the Whisper model. These classes facilitate the transcription of video files into either segmented or paragraphed text formats. Each loader class returns comprehensive information about the transcription, including the source, start time, end time, and other relevant metadata. The classes are designed to cater to different setups: Azure, Local, and OpenAI platforms, each offering both paragraph and segment-based processing.

New Classes:

AzureWhisperVideoParagraphLoader: Processes video files into paragraphs using Azure's Whisper API.
AzureWhisperVideoSegmentLoader: Processes video files into segments using Azure's Whisper API.
LocalWhisperVideoParagraphLoader: Transcribes local video files into paragraphs using the local Whisper model.
LocalWhisperVideoSegmentLoader: Transcribes local video files into segments using the local Whisper model.
OpenAIWhisperVideoParagraphLoader: Utilizes OpenAI's cloud-based Whisper API to transcribe videos into paragraphs.
OpenAIWhisperVideoSegmentLoader: Utilizes OpenAI's cloud-based Whisper API to transcribe videos into segments.

Dependencies:

Whisper: Required for local loaders. Install via pip:
```
pip install openai-whisper
```
OpenAI: Required for Azure OpenAI and OpenAI loaders. Install via pip:
```
pip install openai
```

FFmpeg: Required for preprocessing video files into audio formats that Whisper can process. Install FFmpeg:

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using direct download:
Download from https://ffmpeg.org/download.html and add the executable to your PATH.

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

vercel · 2024-05-08T13:58:50Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 15, 2024 7:20am

TeodorZlatanov · 2024-05-28T05:14:20Z

@hwchase17 , just wanted to bring this PR to your attention.

TeodorZlatanov added 3 commits May 7, 2024 21:54

community: Add VideoTranscript loaders using Whisper

b544644

Updated classes names, added documentation and unit tests

82cf294

Merge remote-tracking branch 'upstream/master'

f746d4e

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label May 8, 2024

dosubot bot added Ɑ: doc loader Related to document loader module (not documentation) 🔌: openai Primarily related to OpenAI integrations 🤖:improvement Medium size change to existing code to handle new use-cases labels May 8, 2024

vercel bot deployed to Preview May 8, 2024 14:06 View deployment

reformat and fix tests

6de7e1c

vercel bot deployed to Preview May 14, 2024 15:30 View deployment

Merge branch 'master' into master

3d3f8cc

vercel bot deployed to Preview May 14, 2024 15:50 View deployment

Merge branch 'master' into master

c4afed7

vercel bot deployed to Preview May 14, 2024 17:19 View deployment

Merge branch 'master' into master

c413bc1

vercel bot deployed to Preview May 15, 2024 07:20 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community: add video transcript loaders using Whisper for enhanced video transcription #21426

community: add video transcript loaders using Whisper for enhanced video transcription #21426

TeodorZlatanov commented May 8, 2024

vercel bot commented May 8, 2024 •

edited

TeodorZlatanov commented May 28, 2024

community: add video transcript loaders using Whisper for enhanced video transcription #21426

Are you sure you want to change the base?

community: add video transcript loaders using Whisper for enhanced video transcription #21426

Conversation

TeodorZlatanov commented May 8, 2024

vercel bot commented May 8, 2024 • edited

TeodorZlatanov commented May 28, 2024

vercel bot commented May 8, 2024 •

edited