vlms

Here are 8 public repositories matching this topic...

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark benchmarks lmm hallucination gpt-4 large-language-models llm llava large-vision-language-models vlms gpt-4v

Updated Nov 13, 2024
Python

Beckschen / ViTamin

Star

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

vlms scalable-vision-encoder

Updated Jun 9, 2024
Python

MCG-NJU / AWT

Star

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

computer-vision transfer-learning clip video-understanding zero-shot-learning open-set-recognition vlms siglip

Updated Oct 5, 2024
Python

foundation-multimodal-models / CAL

Star

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

vlms contrastive-alignment

Updated Sep 26, 2024
Python

Mamadou-Keita / VLM-DETECT

Star

[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection

diffusion-models deepfake-detection text-to-image-generation llms vlms

Updated Sep 30, 2024
Python

Imageomics / VLM4Bio

Star

Code for VLM4Bio, a benchmark dataset of scientific question-answer pairs used to evaluate pretrained VLMs for trait discovery from biological images.

fish bird biology cv benchmarks image-classification image-recognition traits butterfly vlms