This repository fine-tunes the Parakeet CTC models speech model on conversational speech datasets using the Hugging Face transformers and datasets libraries.
git clone https://github.com/Deep-unlearning/Finetune-Parakeet.git
cd Finetune-ParakeetChoose your preferred package manager:
๐ฆ Using UV (recommended)
uv venv .venv --python 3.10 && source .venv/bin/activate
uv pip install -r requirements.txt๐ Using pip
python -m venv .venv --python 3.10 && source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txtIf you want to swap to a different dataset, ensure after loading you still have:
- an
audiocolumn (cast toAudio(sampling_rate=16000)), and - a
textcolumn (the reference transcription).
If your dataset uses different column names, map them to audio and text before returning.
Run the training script:
uv run train.pyLogs and checkpoints will be saved under the outputs/ directory by default.
You can also run the training script with LoRA:
uv run train_lora.pyHappy fine-tuning Parakeet! ๐