#

vision-audio-subtitle-text

Here is 1 public repository matching this topic...

TXH-mercury / VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

dataset vision-language audio-language multimodal-foundation-model cross-modality-pretraining vision-audio-subtitle-text

Updated Mar 14, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-audio-subtitle-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-audio-subtitle-text topic, visit your repo's landing page and select "manage topics."