Skip to content

Track the trend of Representation learning of MultiModal Machine Learning(MMML).

Notifications You must be signed in to change notification settings

kealennieh/MultiModal-Machine-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

MultiModal Machine Learning

Track the trend of Representation learning of MultiModal Machine Learning(MMML).

1. Paper

2021

  1. [CVPR oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning paper code

  2. [ICML] ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision paper code

2020

  1. [NeurIPS] Large-Scale Adversarial Training for Vision-and-Language Representation Learning paper code

2019

  1. [NeurIPS] ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks paper code

  2. [EMNLP] LXMERT: Learning Cross-Modality Encoder Representations from Transformers paper code

  3. [arXiv] VisualBERT: A Simple and Performant Baseline for Vision and Language paper code

2018

  1. [TPAMI] Multimodal machine learning: A survey and taxonomy paper

2. Dataset

1.

3. Others

1. awesome-multimodal-ml

  • website
  • Reading list for research topics in multimodal machine learning

About

Track the trend of Representation learning of MultiModal Machine Learning(MMML).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published