multimodal-llms

Here are 5 public repositories matching this topic...

aimagelab / LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

vision-and-language llms llava siglip multimodal-llms llama3 llava-llama3 llama3-vision gemma-2 llama3-1 deepseek-r1 siglip2

Updated Aug 8, 2025
Python

DingchenYang99 / Pensieve

Star

The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"

computer-vision visual-hallucination multimodal-llms

Updated May 4, 2024
Python

Kartik-3004 / facexbench

Star

FaceXBench: Evaluating Multimodal LLMs on Face Understanding

attributes face face-recognition gender-classification facial-expression-recognition race-detection age-estimation crowd-counting face-antispoofing multimodal face-segmentation face-perception deepfake-detection headpose-estimation llms llms-benchmarking multimodal-llms

Updated Feb 4, 2025
Python

jhCOR / EgoOrientBench

Star

The Official Code Repo for EgoOrientBench [CVPR25]

multimodal-llms

Updated Apr 25, 2025
Jupyter Notebook

QinCheng0928 / Awesome-Autonomous-Driving-Tools-and-Theory

Star

This repository is a comprehensive collection of open-source tools, code implementations, research notes, and practical tutorials for Autonomous Driving technologies. Key focuses include AI/ML (Reinforcement Learning, Transformers, and Multimodal LLMs with PyTorch), Robotics (ROS integration and simulation), and Theoretical Foundations.

reinforcement-learning information-theory transformers tutorials pytorch ros autonomous-driving convex-optimization machine-learning-explainability llm multimodal-llms

Updated Aug 8, 2025
Python

Improve this page

Add a description, image, and links to the multimodal-llms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-llms topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-llms

Here are 5 public repositories matching this topic...

aimagelab / LLaVA-MORE

DingchenYang99 / Pensieve

Kartik-3004 / facexbench

jhCOR / EgoOrientBench

QinCheng0928 / Awesome-Autonomous-Driving-Tools-and-Theory

Improve this page

Add this topic to your repo