Skip to content

vbrydik/pyw2v2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pyw2v2

Work in progress!

This module is a wrapper for Wav2Vec2 models, intended to accelerate ARS research.

Install

This module can be easily be installed with pip:

pip install pyw2v2

Examples

Different examples can be found here.

Fine-tuning CTC model example

This example will show how to load a pretrained model, load dataset, process dataset, and fine-tune CTC model.

Example configuration files can be found here.

from pyw2v2 import ModelCTC, DatasetPreprocessor
from pyw2v2.utils import load_config, load_custom_dataset_commonvoice_format

if __name__ == "__main__":
    # Load pretrained model
    model_config = load_config("../configs/ctc/default.yaml")
    model = ModelCTC(model_config)
    
    # Loading dataset in Common Voice format
    train_set = load_custom_dataset_commonvoice_format('../datasets/example', 'train')
    eval_set = load_custom_dataset_commonvoice_format('../datasets/example', 'test')

    # Set up dataset preprocessor
    dataproc_config = load_config("../configs/dataproc/default.yaml").data_proc
    data_processor = DatasetPreprocessor(dataproc_config)
    data_processor.processor = model.processor

    # Process data
    train_set = data_processor.process(train_set, dataproc_config.n_samples_train)
    eval_set = data_processor.process(eval_set, dataproc_config.n_samples_test)
    
    # Train/Fine-tune model
    model.train(train_set, eval_set)

Releases

No releases published

Packages

No packages published

Languages