Skip to content

Navigation Menu

Appearance settings

Q-Future

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Visual Evaluation with Foundation Models

We are working towards a future that one foundation model can be a multi-purpose expert for low-level visual perception and visual evaluation.

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

README.md

👁️‍🗨️ Low-level Visual Perception in the Foundation Model Era

🔖Aiming at next-era cornerstone research

⭐ Low-level Visual Perception | Multi-Modality Large Language Models | Visual Quality Assessment

📖Main Projects

④Co-Instruct: Homepage, Repo, Demo. Open-ended visual quality comparer (up to 4 images), low-level visual assistant, an improved version of ②Q-Instruct [CVPR 2024].
③Q-Align [ICML 2024]: Homepage, Repo, Demo. A unified visual scorer for images and videos, via text-instructed alignment on multi-modality foundation models; can efficiently fine-tune to more datasets with stable good performance. State-of-the-art on IQA, VQA, and IAA.
②Q-Instruct [CVPR 2024]: Homepage, Repo, 200K Dataset, Technical Report A large-scale instruction tuning dataset to improve low-level perceptual abilities of foundation models.
①Q-Bench+ [ICLR2024, Spotlight]: Homepage, Repo, Data-Single, Data-Pair, Preprint The first low-level benchmark for foundation models on low-level vision.

🖋️Extension Projects

Q-Boost: Homepage A discussion on boosting the IQA performance for non-specially-IQA-aligned MLLMs.
[Pending]Chinese-Q-Bench/质衡: Homepage, Repo The first attempt to test multi-lingual abilities on low-level vision.

Maintained by Teo Wu@Singapore and Zicheng Zhang@Shanghai.

Pinned Loading

Q-Align Q-Align Public

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 445 26
Q-Bench Q-Bench Public

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Jupyter Notebook 269 13
Q-Instruct Q-Instruct Public

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Python 224 10
Co-Instruct Co-Instruct Public

④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

81 5
Visual-Question-Answering-for-Video-Quality-Assessment Visual-Question-Answering-for-Video-Quality-Assessment Public

Official released code for VQA² series models

Python 42 1
A-Bench A-Bench Public

[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?

143 3

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All Jupyter Notebook Python

Sort

Select order

Last updated Name Stars

Showing 10 of 16 repositories

Visual-Question-Answering-for-Video-Quality-Assessment Public
Official released code for VQA² series models

Q-Future/Visual-Question-Answering-for-Video-Quality-Assessment’s past year of commit activity

Python 42 1 7 0 Updated May 3, 2025
Q-Bench-Video Public
[CVPR 2025] A benchmark for video quality understanding of LMMs

Q-Future/Q-Bench-Video’s past year of commit activity

Python 108 1 1 0 Updated Mar 14, 2025
Q-Align Public
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Q-Future/Q-Align’s past year of commit activity

Python 445 26 19 0 Updated Mar 12, 2025
Compare2Score Public
[Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors

Q-Future/Compare2Score’s past year of commit activity

Python 39 MIT 3 6 0 Updated Feb 20, 2025
A-Bench Public
[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?

Q-Future/A-Bench’s past year of commit activity

143 3 0 0 Updated Feb 3, 2025
LMM-PCQA Public
Official repo for `LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM', ACM MM2024 Oral

Q-Future/LMM-PCQA’s past year of commit activity

Python 17 1 0 0 Updated Nov 21, 2024
Q-Align-Aria Public Forked from rhymes-ai/Aria
Visual Scorer with Aria MoE model

Q-Future/Q-Align-Aria’s past year of commit activity

Jupyter Notebook 2 Apache-2.0 86 0 0 Updated Nov 14, 2024
Q-Ground Public
Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)

Q-Future/Q-Ground’s past year of commit activity

42 0 3 0 Updated Oct 25, 2024
R-Bench Public
[IEEE JSTSP 2025] Using LMM in Real-world Distortion

Q-Future/R-Bench’s past year of commit activity

Jupyter Notebook 10 0 0 0 Updated Oct 13, 2024
Co-Instruct Public
④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

Q-Future/Co-Instruct’s past year of commit activity

81 5 4 0 Updated Sep 29, 2024

View all repositories

People

Top languages

Loading…

Uh oh!

There was an error while loading. Please reload this page.

Most used topics

Loading…

Uh oh!

There was an error while loading. Please reload this page.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.