Skip to content

HKAB/whisper-finetune-vietnamese

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Whiper vietnamese finetuning

[In case you are looking for a Vietnamese ASR model, have a look at HKAB/whisper-finetune-1-notebook ]

This notebook contains:

  • Notebooks finetuning, inferencing and generating N-gram.
  • Demo Whisper and Wav2vec

Installation

For using Beam search with LM, install Whisper from my Github

    pip install git+https://github.com/HKAB/whisper.git

Run

For training & inference

  python finetune.py  --model_name base \
                      --dataset vin100h

  python test.py      --checkpoint_path path/to/ckpt \
                      --dataset vin100h \
                      --model_name base

For generating language model with KenLM, use notebook in notebooks folder.

We share the checkpoint (base, batch_size 1, gradient accumulation steps 10, epoch 14, lr 0.0001).

Demo

Whisper

Wav2vec

Contributing

  • We finetune Whisper on 100h speech dataset.
  • We implement BeamSearchWithLM, using KenLM and showed positive result.

Result

Methods Fleurs Vin100h (Full)
Whisper (base) 50.38% 50.33%
Finetune Whisper (base) 28.68% 33%
Whisper (large) one shot - 26.87%

About

Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published