TMHINT-QI VoiceMOS2023

This repository aims to publically share the training and test data of TMHINT-QI version II, which was used as one of the tracks in VoiceMOS Challenge 2023.

TMHINT-QI version II

The TMHINT-QI version II is the updated version of the original TMHINT-QI dataset. The version I dataset has no unseen scenarios for the evaluation set. Therefore, the version II dataset aims to accommodate such concerns by modifying the training set and providing the unseen systems for the evaluation set.

Training Set

The training set consists of four scene environments: clean, babble, white, and pink noises. It also consists of noisy, clean, and enhanced utterances from four systems, including Minimum-mean Square Error (MMSE), Deep Denoising Autoencoder (DDAE), Fully Convolutional Network (FCN), and Transformer. The training set consists of 11053 utterances with corresponding quality (0-5) and intelligibility (0-10) scores.

Please refer to the following link for the updated split for the training set, and please download the corresponding utterances in the following link.

Test Set

The TMHINT-QI version II test set consists of five scene environments: clean, babble, white, and pink noises, and street noise for the unseen environment. It also consists of noisy, clean, and enhanced utterances consisting of three seen enhanced systems (FCN, MMSE, and Tranformer) and two unseen enhanced systems (Conformer-based Metric Generative Adversarial Network (CMGAN) and DEMUCS). In total, the test set of TMHINT-QI version II consists 1960 utterances.

The corresponding test set of TMHINT-QI version II can be downloaded at the following link. In addition, the corresponding utterances can be downloaded here.

Current benchmark score

From the VoiceMOS Challenge 2023, our system (T02), which is based on the improved version of MOSA-Net achieved the top performer among the other systems. The detail explanation regarding our systems can be found in the following link

Citation

Please kindly cite our paper, if you use the dataset in your research.

@misc{zezario2023study,
      title={A Study on Incorporating Whisper for Robust Speech Assessment}, 
      author={Ryandhimas E. Zezario and Yu-Wen Chen and Szu-Wei Fu and Yu Tsao and Hsin-Min Wang and Chiou-Shann Fuh},
      year={2023},
      eprint={2309.12766},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Acknowledgment

BioASP Lab, Academia Sinica

VoiceMOS Challenge Organizers 2023

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Repository files navigation

TMHINT-QI VoiceMOS2023

TMHINT-QI version II

Training Set

Test Set

Current benchmark score

Citation

Acknowledgment

About

Releases

Packages

dhimasryan/TMHINT-QI_VoiceMOS2023

Folders and files

Latest commit

History

README.md

README.md

Repository files navigation

TMHINT-QI VoiceMOS2023

TMHINT-QI version II

Training Set

Test Set

Current benchmark score

Citation

Acknowledgment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages