#

multimodal-large-language-models

Here are 33 public repositories matching this topic...

ChenDelong1999 / polite-flamingo

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

large-language-models visual-instruction-tuning multimodal-large-language-models

Updated Dec 9, 2023
Python

X-PLUG / Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

benchmark video dataset chinese youku multimodal video-retrieval video-question-answering multimodal-pretraining mllm multimodal-large-language-models

Updated Jan 8, 2024
Python

BradyFU / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

multimodality hallucination hallucinations large-language-models llm mllm multimodal-large-language-models

Updated Jan 12, 2024
Python

X-PLUG / mPLUG-HalOwl

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

benchmark contrastive-learning hallucinations mllm multimodal-large-language-models multimodal-hallucination

Updated Jan 29, 2024
Python

LLaVA-VL / LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

agent tool-use large-language-models multimodal-large-language-models large-multimodal-models

Updated Feb 1, 2024
Python

zjukg / KoPA

[Paper][Preprint 2023] Making Large Language Models Perform Better in Knowledge Graph Completion

knowledge-graph knowledge-graph-completion multi-modal knowledge-graph-embeddings large-language-models instruction-tuning multimodal-large-language-models

Updated Feb 10, 2024
Python

tsujuifu / pytorch_mgie

A Gradio demo of MGIE

pytorch image-editing vision-and-language multimodal-large-language-models iclr2024

Updated Feb 23, 2024
Python

bigai-nlco / LSTP-Chat

A Video Chat Agent with Temporal Prior

spatial-temporal video-language llm mllm visual-instruction-tuning multimodal-large-language-models

Updated Feb 28, 2024
Python

Lzcstan / DrugLAMP

A PyTorch-based system for highly accurate drug-target interaction predictions utilizing multi-modal large language models to discern structural affinities in drug-target pairs.

attention-mechanism drug-target-interactions contrastive-learning multimodal-large-language-models

Updated Mar 26, 2024
Python

OpenKG-ORG / EasyDetect

An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing knowledge-graph generation hallucinations aigc large-language-models multimodal-large-language-models genrative-ai easydetect hallucination-detection

Updated Apr 21, 2024
Python

declare-lab / MM-InstructEval

This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodal content comprehension tasks.

multimodal-large-language-models multimodal-content-comprehension-tasks

Updated May 2, 2024
Python

hari-huynh / viVQA-voice-assistant

Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper

text-to-speech lora visual-question-answering llava multimodal-large-language-models audio-speech-recognition mistral-7b

Updated May 15, 2024
Python

MileBench / MileBench

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

benchmark machine-learning natural-language-processing deep-neural-networks computer-vision deep-learning evaluation multimodality visual-question-answering multimodal foundation-models large-language-models llm llms long-context-transformers multimodal-large-language-models large-multimodal-models long-context-modeling

Updated May 19, 2024
Python

CKeibel / FHSWF-deep-learning

Multimodal RAG and comparisons between language models. (Project for Deep Learning Module at the FHSWF)

machine-learning deep-learning multimodal rag multimodal-large-language-models multimodal-rag

Updated Jun 2, 2024
Python

cocacola-lab / MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

minecraft ai-agents large-language-models multimodal-large-language-models

Updated May 23, 2024
Python

rese1f / MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

computer-vision dataset llama large-language-models long-video-understanding multimodal-large-language-models

Updated May 24, 2024
Python

VisualWebBench / VisualWebBench

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

machine-learning natural-language-processing computer-vision deep-learning evaluation question-answering visual-question-answering multimodal multimodal-deep-learning foundation-models large-language-models llm llms mllm multimodal-large-language-models large-multimodal-models

Updated May 31, 2024
Python

Victorwz / MLM_Filter

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

data-filtering data-quality-assessment large-language-models llava multimodal-large-language-models image-text-data

Updated May 31, 2024
Python

zjunlp / EasyDetect

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing artificial-intelligence knowledge-graph generation multimodal hallucination aigc large-language-models generative-ai model-editing knowledge-editing multimodal-large-language-models knowlm easydetect hallucination-detection

Updated Jun 3, 2024
Python

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

multimodal table-understanding document-understanding mllm multimodal-large-language-models chart-understanding

Updated Jun 4, 2024
Python

Improve this page

Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."