AVSR-datasets

This repo contains: `AVLetters`[1], `AVDigits`[2] and `AVLetters2`[3] AVSR datasets. These files are all on Google cloud.

References:

[1] Matthews, Iain, et al. "Extraction of visual features for lipreading." IEEE Transactions on Pattern Analysis and Machine Intelligence 24.2 (2002): 198-213.

[2] Hu, Di, and Xuelong Li. "Temporal multimodal learning in audiovisual speech recognition." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.

[3] Cox, Stephen J., et al. "The challenge of multispeaker lip-reading." AVSP. 2008.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AVSR-datasets

This repo contains: `AVLetters`[1], `AVDigits`[2] and `AVLetters2`[3] AVSR datasets. These files are all on Google cloud.

About

Releases

Packages

foowaa/AVSR-datasets

Folders and files

Latest commit

History

Repository files navigation

AVSR-datasets

This repo contains: AVLetters[1], AVDigits[2] and AVLetters2[3] AVSR datasets. These files are all on Google cloud.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

This repo contains: `AVLetters`[1], `AVDigits`[2] and `AVLetters2`[3] AVSR datasets. These files are all on Google cloud.

Packages