Whisper Diarization Experiment

Overview

This repository contains a Dockerized application for audio diarization using the Whisper model. It leverages NVIDIA CUDA for performance optimization and includes a Gradio-based web interface for easy interaction.

Features

NVIDIA CUDA Base: Utilizes NVIDIA CUDA for efficient processing.
Gradio Web Interface: Provides a user-friendly interface for audio file processing.
Audio Diarization: Implements diarization using the Whisper model for audio files.

Prerequisites

Docker
NVIDIA GPU with CUDA support

Installation

Clone the Repository:

git clone https://github.com/domtoro/whisper-diarization-experiment.git
cd whisper-diarization-experiment

Build the Docker Image:

docker build -t domtoro/whisper-diarization-experiment:0.0.1 .

Usage

Access the Gradio Interface: Open a web browser and navigate to http://localhost:7860 to use the Gradio interface.
Upload Audio File: Use the Gradio interface to upload an audio file for diarization.
View Results: After processing, the diarization results will be displayed on the Gradio interface.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cache		cache
.dockerignore		.dockerignore
Dockerfile		Dockerfile
README.md		README.md
gradio-ui.py		gradio-ui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache

cache

.dockerignore

.dockerignore

Dockerfile

Dockerfile

README.md

README.md

gradio-ui.py

gradio-ui.py

Repository files navigation

Whisper Diarization Experiment

Overview

Features

Prerequisites

Installation

Usage

About

Languages

domtoro/whisper-diarization-experiment

Folders and files

Latest commit

History

Repository files navigation

Whisper Diarization Experiment

Overview

Features

Prerequisites

Installation

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages