OZEN toolkit, AI powered audio dataset helper.

OZEN is a small tool to help you process audio files to a LJ format.

Given a folder of files or a single audio file, it will extract the speech, transcribe using Whisper and save in the LJ format (wavs in wavs folder, train and valid txts).

INSTALLATION

Accept the license terms on https://huggingface.co/pyannote/segmentation 
Install Anaconda or setup your own environment and install requirements
git clone https://github.com/devilismyfriend/ozen-toolkit
run Set Up Ozen.bat

USAGE

Drag a folder or a file on the Drag_Here.bat to process it.

The first time you'll be prompted to provide an HuggingFace token, once you do a config file will be created where you can specifiy models to use, the validation/training data desired split and more.

Alternatively you can use ozen.py in cli.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
modules		modules
.gitattributes		.gitattributes
.gitignore		.gitignore
Drag_Here.cmd		Drag_Here.cmd
README.md		README.md
Set Up Ozen.bat		Set Up Ozen.bat
environment.yaml		environment.yaml
ozen.py		ozen.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OZEN toolkit, AI powered audio dataset helper.

INSTALLATION

USAGE

About

Releases

Packages

Languages

devilismyfriend/ozen-toolkit

Folders and files

Latest commit

History

Repository files navigation

OZEN toolkit, AI powered audio dataset helper.

INSTALLATION

USAGE

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages