Skip to content
View 9bow's full-sized avatar
🤔
doing fewer things better
🤔
doing fewer things better

Sponsoring

@ohmyzsh
@python
@pyenv
@Homebrew
@numfocus
@parkr
@ggerganov
@jart

Organizations

@shineware @mnms @PyTorchKorea @cloudbandwagon

Block or report 9bow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

👀CV&VLM

13 repositories

[CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for Grounded Situation Recognition"

Python 50 7 Updated Apr 9, 2023

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Python 525 40 Updated Jan 27, 2024

Collection of AWESOME vision-language models for vision tasks

3,085 233 Updated Oct 14, 2025

Awesome-Remote-Sensing-Vision-Language-Models

191 10 Updated Apr 27, 2024

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,619 942 Updated Jan 12, 2026

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

Jupyter Notebook 297 30 Updated Sep 20, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,536 189 Updated Apr 2, 2025

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Python 228 21 Updated Jul 21, 2023

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,430 1,230 Updated Jul 30, 2024

[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models

Python 1,069 140 Updated Dec 20, 2023

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

3,307 290 Updated Feb 12, 2026

This repo contains annotated research papers that I found really good and useful

2,766 268 Updated Dec 24, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

982 42 Updated Sep 27, 2025