Skip to content

SkalskiP/top-cvpr-2026-papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

visitor badge

top CVPR 2026 papers

2023 | 2024 | 2025 | 2026

👋 hello

Computer Vision and Pattern Recognition is a massive conference. In 2026 alone, 16,092 papers were submitted, and 4,090 were accepted. I created this repository to help you search for crème de la crème of CVPR publications. If the paper you are looking for is not on my short list, take a peek at the full list of accepted papers.

🗞️ papers and posters

📢 - oral | 🔥 - highlight | 🏆 - best paper

3d vision

SAM 3D: 3Dfy Anything in Images 🏆 SAM 3D: 3Dfy Anything in Images
Jianing Yang, Georgia Gkioxari, Anushka Sagar, Aohan Lin, Bowen Song, Bowen Zhang, Fu-Jen Chu, Hao Tang, ...
[paper] [code] [video]
Topic: 3D Vision
Session: Fri 5 Jun 13:00-14:15 Oral Session 2A #5 | Fri 5 Jun 16:00-18:00 Poster Session 2 #5



B³-Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates 🏆 B³-Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates
Hiromichi Kamata, Samuel Arthur Munro, Fuminori Homma
[paper] [video]
Topic: 3D Vision
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #507



Efficiently Reconstructing Dynamic Scenes One D4RT at a Time 🏆 Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Chuhan Zhang, Guillaume Le Moing, Skanda Koppula, Ignacio Rocco, Liliane Momeni, Junyu Xie, Shuyang Sun, Rahul Sukthankar, ...
[paper] [video]
Topic: 3D Vision
Session: Fri 5 Jun 13:00-14:15 Oral Session 2D #2 | Fri 5 Jun 16:00-18:00 Poster Session 2 #20



4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation 🏆 4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation
Chiao-An Yang, Ryo Hachiuma, Sifei Liu, Subhashree Radhakrishnan, Raymond A. Yeh, Yu-Chiang Frank Wang, Min-Hung Chen
[paper] [code] [video] [demo]
Topic: 3D Vision
Session: Sun 7 Jun 11:45-13:45 Poster Session 5 #225



Featurising Pixels from Dynamic 3D Scenes with Linear In-Context Learners 📢 Featurising Pixels from Dynamic 3D Scenes with Linear In-Context Learners
Nikita Araslanov, Martin Sundermeyer, Hidenobu Matsuki, David Joseph Tan, Federico Tombari
[paper] [video]
Topic: 3D Vision
Session: Sat 6 Jun 14:00-15:15 Oral Session 4A: Geometric Understanding #2 | Sat 6 Jun 16:45-18:45 Poster Session 4 #2



MuM: Multi-View Masked Image Modeling for 3D Vision MuM: Multi-View Masked Image Modeling for 3D Vision
David Nordström, Johan Edstedt, Fredrik Kahl, Georg Bökman
[paper] [code] [video]
Topic: 3D Vision
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #28



Emergent Outlier View Rejection in Visual Geometry Grounded Transformers Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Jisang Han, Sunghwan Hong, Jaewoo Jung, Wooseok Jang, Honggyu An, Qianqian Wang, Seungryong Kim, Chen Feng
[paper] [code] [video]
Topic: 3D Vision
Session: Fri 5 Jun 10:45-12:45 Poster Session 1 #41



AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization 🏆 AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization
Mohammad Omama, Gabriele Berton, Eric Foxlin, Yelin Kim
[paper]
Topic: 3D Vision
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #467



tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction 🏆 tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction
Chen Wang, Hao Tan, Wang Yifan, Zhiqin Chen, Yuheng Liu, Kalyan Sunkavalli, Sai Bi, Lingjie Liu, ...
[paper] [code] [video]
Topic: 3D Vision
Session: Sun 7 Jun 15:30-17:30 Poster Session 6 #39



ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion
Remy Sabathier, David Novotny, Niloy J. Mitra, Tom Monnier
[paper] [code] [video] [demo] [colab]
Topic: 3D Vision
Session: Sun 7 Jun 11:45-13:45 Poster Session 5 #530



agents

NitroGen: An Open Foundation Model for Generalist Gaming Agents 🏆 NitroGen: An Open Foundation Model for Generalist Gaming Agents
Loïc Magne, Anas Awadalla, Guanzhi Wang, Yinzhen Xu, Joshua Belofsky, Fengyuan Hu, Joohwan Kim, Ludwig Schmidt, ...
[paper] [code] [video] [demo]
Topic: Agents
Session: Sat 6 Jun 14:00-15:15 Oral Session 4B #2 | Sat 6 Jun 16:45-18:45 Poster Session 4 #4



depth estimation

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Xin Lin, Meixi Song, Dizhe Zhang, Wenxuan Lu, Haodong Li, Bo Du, Ming-Hsuan Yang, Truong Nguyen, ...
[paper] [code] [demo]
Topic: Depth Estimation
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #504


generative models

EgoX: Egocentric Video Generation from a Single Exocentric Video
Taewoong Kang, Kinam Kim, Keunwoo Park, Seonghyeon Park, Youngjoon Yu, Seunghoon Hong
[paper] [code]
Topic: Generative Models
Session: Fri 5 Jun 16:00-18:00 Poster Session 2 #366


Back to Basics: Let Denoising Generative Models Denoise Back to Basics: Let Denoising Generative Models Denoise
Tianhong Li, Kaiming He
[paper] [code]
Topic: Generative Models
Session: Sun 7 Jun 11:45-13:45 Poster Session 5 #700



MacTok: Robust Continuous Tokenization for Image Generation 🏆 MacTok: Robust Continuous Tokenization for Image Generation
Hengyu Zeng, Xin Gao, Guanghao Li, Yuxiang Yan, Jiaoyang Ruan, Junpeng Ma, Haoyu Albert Wang, Jian Pu
[paper] [video]
Topic: Generative Models
Session: Sun 7 Jun 15:30-17:30 Poster Session 6 #672



A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens 🏆 A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
Tommie Kerssies, Gabriele Berton, Ju He, Qihang Yu, Wufei Ma, Daan de Geus, Gijs Dubbelman, Liang-Chieh Chen
[paper] [code] [video] [demo]
Topic: Generative Models
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #611



image-to-image

ARC Is a Vision Problem! ARC Is a Vision Problem!
Keya Hu, Ali Cy, Linlu Qiu, Xiaoman Delores Ding, Runqian Wang, Yeyin Eva Zhu, Jacob Andreas, Kaiming He
[paper] [code]
Topic: Image-to-Image
Session: Fri 5 Jun 10:45-12:45 Poster Session 1 #234



motion prediction

Envisioning the Future, One Step at a Time Envisioning the Future, One Step at a Time
Stefan Andreas Baumann, Jannik Wiese, Tommaso Martorella, Mahdi M. Kalayeh, Björn Ommer
[paper] [code] [video] [demo]
Topic: Motion Prediction
Session: Fri 5 Jun 10:45-12:45 Poster Session 1 #634



object tracking

V²-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence 🏆 V²-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence
Jiancheng Pan, Runze Wang, Tianwen Qian, Mohammad Mahdi, Yanwei Fu, Xiangyang Xue, Xiaomeng Huang, Luc Van Gool, ...
[paper] [code] [video]
Topic: Object Tracking
Session: Sat 6 Jun 11:45-13:45 Poster Session 3 #248



Real-World Point Tracking with Verifier-Guided Pseudo-Labeling 🏆 Real-World Point Tracking with Verifier-Guided Pseudo-Labeling
Görkay Aydemir, Fatma Güney, Weidi Xie
[paper] [code] [video]
Topic: Object Tracking
Session: Fri 5 Jun 16:00-18:00 Poster Session 2 #593



physical modeling

MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention 🏆 MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention
Pedro M. P. Curvo, Jan-Willem van de Meent, Maksim Zhdanov
[paper] [code] [video]
Topic: Physical Modeling
Session: Fri 5 Jun 16:00-18:00 Poster Session 2 #534



pose estimation

SAM 3D Body: Robust Full-Body Human Mesh Recovery 🏆 SAM 3D Body: Robust Full-Body Human Mesh Recovery
Xitong Yang, Devansh Kukreja, Don Pinkus, Anushka Sagar, Taosha Fan, Jinhyung Park, Soyong Shin, Jinkun Cao, ...
[paper] [code] [video] [demo]
Topic: Pose Estimation
Session: Fri 5 Jun 13:00-14:15 Oral Session 2A #4 | Fri 5 Jun 16:00-18:00 Poster Session 2 #4



FMPose3D: monocular 3D pose estimation via flow matching FMPose3D: monocular 3D pose estimation via flow matching
Ti Wang, Xiaohang Yu, Mackenzie Weygandt Mathis
[paper] [code] [video]
Topic: Pose Estimation
Session: Sat 6 Jun 11:45-13:45 Poster Session 3 #40



MAMMA: Markerless Accurate Multi-person Motion Acquisition 🏆 MAMMA: Markerless Accurate Multi-person Motion Acquisition
Hanz Cuevas Velasquez, Anastasios Yiannakidis, Soyong Shin, Giorgio Becherini, Markus Höschle, Joachim Tesch, Taylor Obersat, Tsvetelina Alexiadis, ...
[paper] [code] [video]
Topic: Pose Estimation
Session: Fri 5 Jun 13:00-14:15 Oral Session 2A #1 | Fri 5 Jun 16:00-18:00 Poster Session 2 #1



segmentation

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model
Narges Norouzi, Idil Esen Zulfikar, Niccolò Cavagnero, Tommie Kerssies, Bastian Leibe, Gijs Dubbelman, Daan de Geus
[paper] [code] [video]
Topic: Segmentation
Session: Sun 7 Jun 10:15-11:30 Poster Session 5 #611


🏆 MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator
Peiqing Yang, Shangchen Zhou, Kai Hao, Qingyi Tao
[paper] [code] [video] [demo]
Topic: Segmentation
Session: Sun 7 Jun 15:30-17:30 Poster Session 6


INSID3: Training-Free In-Context Segmentation with DINOv3 📢 INSID3: Training-Free In-Context Segmentation with DINOv3
Claudia Cuttano, Gabriele Trivigno, Christoph Reich, Daniel Cremers, Carlo Masone, Stefan Roth
[paper] [code] [video]
Topic: Segmentation
Session: Sat 6 Jun 14:00-15:15 Oral Session 4D: Visual Segmentation #1 | Sat 6 Jun 16:45-18:45 Poster Session 4 #19



🏆 The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification
Dante Francisco Wasmuht, Otto Brookes, Maximillian Schall, Pablo Palencia, Chris Beirne, Tilo Burghardt, Majid Mirmehdi, Hjalmar Kühl, ...
[paper] [demo]
Topic: Segmentation
Session: Sat 6 Jun 14:00-15:15 Oral Session 4D: Visual Segmentation #5 | Sat 6 Jun 16:45-18:45 Poster Session 4 #23


MARCO: Navigating the Unseen Space of Semantic Correspondence 📢 MARCO: Navigating the Unseen Space of Semantic Correspondence
Claudia Cuttano, Gabriele Trivigno, Carlo Masone, Stefan Roth
[paper] [code] [video]
Topic: Segmentation
Session: Sat 6 Jun 14:00-15:15 Oral Session 4D: Visual Segmentation #2 | Sat 6 Jun 16:45-18:45 Poster Session 4 #20



VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation 🏆 VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
Yulu Gao, Bohao Zhang, Zongheng Tang, Jitong Liao, Wenjun Wu, Si Liu
[paper] [video]
Topic: Segmentation
Session: Sat 6 Jun 14:00-15:15 Oral Session 4D: Visual Segmentation #6 | Sat 6 Jun 16:45-18:45 Poster Session 4 #24



Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation? 🏆 Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
Tilemachos Aravanis, Vladan Stojnic, Bill Psomas, Nikos Komodakis, Giorgos Tolias
[paper] [code]
Topic: Segmentation
Session: Sat 6 Jun 16:45-18:45 Poster Session 4 #578



video understanding

VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition 🔥 VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
Tanush Yadav, Mohammadreza Salehi, Jae Sung Park, Vivek Ramanujan, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi, Rohun Tripathi, ...
[paper] [code] [video] [demo]
Topic: Video Understanding
Session: Fri 5 Jun 16:00-18:00 Poster Session 2 #530



vision-language models

TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment
Bingyi Cao, Koert Chen, Kevis-Kokitsi Maninis, Kaifeng Chen, Arjun Karpur, Ye Xia, Sahil Dua, Tanmaya Dabral, ...
[paper] [code] [video] [demo]
Topic: Vision-Language Models
Session: Sun 7 Jun 11:45-13:45 Poster Session 5 #65



Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding 🏆 Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
Christopher Clark, Jieyu Zhang, Zixian Ma, Jae Sung Park, Mohammadreza Salehi, Rohun Tripathi, Sangho Lee, Zhongzheng Ren, ...
[paper] [code] [video] [demo]
Topic: Vision-Language Models
Session: Sun 7 Jun 10:15-11:30 Oral Session 5A #3 | Sun 7 Jun 11:45-13:45 Poster Session 5 #3



🦸 contribution

We would love your help in making this repository even better! If you know of an amazing paper that isn't listed here, or if you have any suggestions for improvement, feel free to open an issue or submit a pull request.

About

About This repository is a curated collection of the most exciting and influential CVPR 2026 papers. 🔥 [Paper + Code + Demo]

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages