A curated list of multimodal dialogue models and related resources.
Please feel free to pull requests or open an issue to add papers.
Type | UIUO |
MIUO |
MIMO |
L2V |
V2L |
Other |
---|---|---|---|---|---|---|
Explanation | Unimodal Input & Unimodal Output | Multimodal Input & Unimodal Output | Multimodal Input & Multimodal Output | Language to Vision | Vision to Language | other types |
Title | Venue | Type | Code | Star |
---|---|---|---|---|
InstructPix2Pix Learning to Follow Image Editing Instructions | CVPR-Highlight | MIUO |
PyTorch(Author) | |
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | ICML | V2L |
PyTorch(Author) | |
NeurIPS | UIUO |
PyTorch(Author) |
Title | Venue | Type | Code | Star |
---|---|---|---|---|
Flamingo: a Visual Language Model for Few-Shot Learning | NeurIPS | MIUO |
PyTorch(Author) | |
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) |
Title | Venue | Type | Code | Star |
---|---|---|---|---|
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) |
Title | Venue | Type | Code | Star |
---|---|---|---|---|
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) |
Title | Venue | Type | Code | Star |
---|---|---|---|---|
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) | ||
NeurIPS | UIUO |
PyTorch(Author) |
- Awesome-Multimodal-Large-Language-Models
- awesome-multimodal-ml
- Awesome-Multimodal-Research
- Awesome-Text-to-Image
- Awesome-Multimodal-LLM
- awesome-llm-and-aigc
- Awesome-Multimodal-Chatbot
- Awesome-Multimodal-LLM
- awesome-free-chatgpt
- awesome-generative-ai
- Awesome-LLM
- awesome-vision-and-language
- awesome-vision-language-pretraining-papers
- awesome-chatgpt-dataset