LeapfrogAI Whisper Backend

⚠️This repo is archived in favor of the LeapfrogAI monorepo: https://github.com/defenseunicorns/leapfrogai⚠️

Description

A LeapfrogAI API-compatible faster-whisper wrapper for audio transcription generation across CPU and GPU infrastructures.

Usage

See instructions to get the backend up and running. Then, use the LeapfrogAI API server to interact with the backend.

Instructions

The instructions in this section assume the following:

Properly installed and configured Python 3.11.x, to include its development tools
The LeapfrogAI API server is deployed and running

GPU Variation

The following are additional assumptions for GPU inferencing:

You have properly installed one or more NVIDIA GPUs and GPU drivers
You have properly installed and configured the cuda-toolkit and nvidia-container-toolkit

Model Selection

The default model that comes with this backend in this repository's officially released images is a CTranslate2 converted version of OpenAI's Whisper Base.

Other Whisper model sizes and variants can be loaded into this backend by modifying the MODEL_NAME during image creation or Makefile command execution.

Run Locally

GPU Variation

The following additional variables must be exported for local GPU inferencing:

# enable GPU switch
export GPU_ENABLED=true

# point to VENV's local CUDA 11.8 python lib
export LD_LIBRARY_PATH=${PWD}/.venv/lib64/python3.11/site-packages/nvidia/cublas/lib:${PWD}/.venv/lib64/python3.11/site-packages/nvidia/cudnn/lib

# Install FFMPEG locally
sudo apt install ffmpeg

# Setup Virtual Environment
make create-venv
source .venv/bin/activate
make requirements-dev

# Clone Model
make fetch-model

# Start Model Backend
make dev

Run in Docker

Local Image Build and Run

For local image building and running.

make docker-build
# without GPU, CPU-only
make docker-run
# with GPU
make docker-run-gpu

Remote Image Build and Run

For pulling a tagged image from the main release repository.

Where <IMAGE_TAG> is the released packages found here.

docker build -t ghcr.io/defenseunicorns/leapfrogai/whisper:<IMAGE_TAG> .
# add the "--gpus all" and "-e GPU_ENABLED=true" flags for GPU inferencing
docker run -p 50051:50051 -d --name whisper ghcr.io/defenseunicorns/leapfrogai/whisper:<IMAGE_TAG>

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github/workflows		.github/workflows
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
zarf-config.yaml		zarf-config.yaml
zarf.yaml		zarf.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LeapfrogAI Whisper Backend

Description

Usage

Instructions

Model Selection

Run Locally

Run in Docker

Local Image Build and Run

Remote Image Build and Run

About

Releases 6

Packages

Contributors 6

Languages

License

defenseunicorns/leapfrogai-backend-whisper

Folders and files

Latest commit

History

Repository files navigation

LeapfrogAI Whisper Backend

Description

Usage

Instructions

Model Selection

Run Locally

Run in Docker

Local Image Build and Run

Remote Image Build and Run

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 6

Languages

Packages