CI-AVSR

Repository for the paper CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition and the corresponding new dataset, which is accepted in LREC 2022.

If you find our dataset or code useful, please cite this paper, thanks!

@article{Dai2022CIAVSRAC,
  title={CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition},
  author={Wenliang Dai and Samuel Cahyawijaya and Tiezheng Yu and Elham J. Barezi and Peng Xu and Cheuk Tung Shadow Yiu and Rita Frieske and Holy Lovenia and Genta Indra Winata and Qifeng Chen and Xiaojuan Ma and Bertram E. Shi and Pascale Fung},
  journal={ArXiv},
  year={2022},
  volume={abs/2201.03804}
}

Data

Version 1.0

For details of the dataset, please refer to the paper.

Clean Sets

The originally collected data, including train_clean.csv, valid_clean.csv, test_clean.csv.

Noise Augmented Sets

The noise augmented data, including train_noisy.csv, valid_noisy.csv, test_noisy.csv. In addition, there is also a out-of-domain test set test_noisy_ood.csv to evaluate the generalization of models, i.e. these noises are not in the training set.

Data Download

[CI-AVSR Dataset] We provide the processed CI-AVSR dataset with audios, image frames (25 per seconds), annotations, and augmentation.

[Raw Videos] Just in case, we also provide the unprocessed raw videos (you may not need to use them).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dataset		dataset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
args_helper.py		args_helper.py
data_collator_ctc.py		data_collator_ctc.py
data_utils.py		data_utils.py
eval.py		eval.py
mm_wrapper.py		mm_wrapper.py
preprocess_data.py		preprocess_data.py
requirements.txt		requirements.txt
run_eval.sh		run_eval.sh
run_eval_mm.sh		run_eval_mm.sh
run_eval_mm_noise.sh		run_eval_mm_noise.sh
run_eval_noise.sh		run_eval_noise.sh
run_train.sh		run_train.sh
run_train_mm.sh		run_train_mm.sh
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CI-AVSR

Data

Clean Sets

Noise Augmented Sets

Data Download

About

Releases

Packages

Languages

License

HLTCHKUST/CI-AVSR

Folders and files

Latest commit

History

Repository files navigation

CI-AVSR

Data

Clean Sets

Noise Augmented Sets

Data Download

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages