#
joint
Here are 4 public repositories matching this topic...
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
video
localization
caption
alignment
segmentation
coin
multimodality
joint
multimodal-sentiment-analysis
pretrain
pretraining
msrvtt
video-text-retrieval
video-text
video-language
youcookii
retrieval-task
caption-task
-
Updated
Jul 25, 2024 - Python
Improve this page
Add a description, image, and links to the joint topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the joint topic, visit your repo's landing page and select "manage topics."