Skip to content
#

vision-foundation-model

Here are 16 public repositories matching this topic...

A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggregates surveys, blog posts, and research papers that explore how LMMs represent, transform, and align multimodal information internally.

  • Updated Jun 18, 2025

CAST is a method for semi-supervised instance segmentation that efficiently trains a compact model using both labeled and unlabeled data. This repository contains the implementation of our three-stage pipeline, showcasing contrastive adaptation and distillation techniques. 🐙🌟

  • Updated Jun 18, 2025

Improve this page

Add a description, image, and links to the vision-foundation-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-foundation-model topic, visit your repo's landing page and select "manage topics."

Learn more