Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models"
-
Updated
Jun 18, 2024 - Python
Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models"
Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"
Multi-Modal Representational Learning for Social Media Popularity Prediction
Contains code and documentation for our VANE-Bench paper.
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
FreeVA: Offline MLLM as Training-Free Video Assistant
Multimodal RAG and comparisons between language models. (Project for Deep Learning Module at the FHSWF)
A PyTorch-based system for highly accurate drug-target interaction predictions utilizing multi-modal large language models to discern structural affinities in drug-target pairs.
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
A Video Chat Agent with Temporal Prior
This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"
This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodal content comprehension tasks.
[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
Pressure Testing Large Video-Language Models (LVLM): Doing multimodal retrieval from LVLM at any video lengths to measure accuracy
Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."