[CVPR 2026] Rethinking Token Reduction for Large Vision-Language Models

Overview

This is the official code implementation of our CVPR 2026 paper "Rethinking Token Reduction for Large Vision-Language Models". We propose a novel learning-based, prompt-agnostic token compression method tailored for Large Vision-Language Models (LVLMs) in multi-turn Visual Question Answering (MT-VQA) scenarios.

Status

⌛️ Code Release Update: The code implementation is currently being organized and will be released as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2026] Rethinking Token Reduction for Large Vision-Language Models

Overview

Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2026] Rethinking Token Reduction for Large Vision-Language Models

Overview

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages