AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
-
Updated
Jun 19, 2024 - Python
AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
A Framework of Small-scale Large Multimodal Models
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.
Open Platform for Embodied Agents
Contains code and documentation for our VANE-Bench paper.
A collection of resources on applications of multi-modal learning in medical imaging.
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
The official evaluation suite and dynamic data release for MixEval.
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
An open-source implementation of LLaVA-NeXT.
This is the official repository of the paper "Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey"
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
ShareGPT4Omni: Towards Building Omni Large Multi-modal Models with Comprehensive Multi-modal Annotations
A curated list of awesome Multimodal studies.
The offical Implementation of "Instruction-Guided Visual Masking"
"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
Code for "Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning"
Add a description, image, and links to the large-multimodal-models topic page so that developers can more easily learn about it.
To associate your repository with the large-multimodal-models topic, visit your repo's landing page and select "manage topics."