Skip to content
/ wvmos Public

MOS score prediction by fine-tuned wav2vec2.0 model

Notifications You must be signed in to change notification settings

AndreevP/wvmos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

WV-MOS

MOS score prediction by fine-tuned wav2vec2.0 model

Keywords: MOS-Net, MB-Net, PESQ, STOI, speech quality

Getting started

The package installation was tested with python3.9

pip install git+https://github.com/AndreevP/wvmos

Inference

from wvmos import get_wvmos
model = get_wvmos(cuda=True)

mos = model.calculate_one("path/to/wav/file") # infer MOS score for one audio 

mos = model.calculate_dir("path/to/dir/with/wav/files", mean=True) # infer average MOS score across .wav files in directory

Citation and Acknowledgment

This work was done for the deep learning course in Skolteh university by Pavel Andreev, Nikolay Patakin, Oleg Desheulin, Alexander Kagan and Arthur Bulanbaev. More details are described in paper https://arxiv.org/abs/2203.13086

About

MOS score prediction by fine-tuned wav2vec2.0 model

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages