Skip to content

Latest commit

 

History

History
178 lines (173 loc) · 84.6 KB

recognition-categorization-detection-retrieval.md

File metadata and controls

178 lines (173 loc) · 84.6 KB

CVPR-2023-Papers

Application App
New collections Conference

Recognition: Categorization, Detection, Retrieval

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
R2Former: Unified Retrieval and Reranking Transformer for Place Recognition GitHub thecvf
arXiv
YouTube
Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations GitHub thecvf
arXiv
YouTube
StructVPR: Distill Structural Knowledge With Weighting Samples for Visual Place Recognition thecvf
arXiv
YouTube
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining GitHub thecvf
arXiv
YouTube
One-to-Few Label Assignment for End-to-End Dense Detection GitHub thecvf
arXiv
YouTube
Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization GitHub thecvf
arXiv
YouTube
Semi-DETR: Semi-Supervised Object Detection With Detection Transformers GitHub thecvf
arXiv
Universal Instance Perception As Object Discovery and Retrieval GitHub thecvf
arXiv
CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection thecvf
arXiv
YouTube
Phase-Shifting Coder: Predicting Accurate Orientation in Oriented Object Detection GitHub thecvf
arXiv
FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection GitHub thecvf
arXiv
YouTube
Box-Level Active Detection
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking GitHub thecvf
arXiv
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection GitHub thecvf
arXiv
YouTube
Aligning Bag of Regions for Open-Vocabulary Object Detection GitHub thecvf
arXiv
YouTube
Asymmetric Feature Fusion for Image Retrieval thecvf
arXiv
3D Video Object Detection With Learnable Object-Centric Global Optimization GitHub thecvf
arXiv
YouTube
Enhanced Training of Query-Based Object Detection via Selective Query Recollection GitHub thecvf
arXiv
YouTube
Dense Distinct Query for End-to-End Object Detection GitHub thecvf
arXiv
YouTube
On-the-Fly Category Discovery GitHub thecvf YouTube
ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification thecvf
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
CVPR - Highlight
GitHub thecvf
arXiv
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency GitHub thecvf
arXiv
YouTube
An Erudite Fine-Grained Visual Classification Model GitHub thecvf YouTube
Self-Supervised Implicit Glyph Attention for Text Recognition GitHub thecvf
arXiv
Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization GitHub thecvf
arXiv
YouTube
DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets GitHub thecvf
arXiv
YouTube
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
CVPR - Highlight
GitHub thecvf
arXiv
Fake It Till You Make It: Learning Transferable Representations From Synthetic ImageNet Clones WEB Page thecvf
arXiv
YouTube
FFF: Fragment-Guided Flexible Fitting for Building Complete Protein Structures thecvf
arXiv
Revisiting Self-Similarity: Structural Embedding for Image Retrieval GitHub thecvf YouTube
Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition GitHub thecvf
MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection GitHub thecvf
arXiv
YouTube
Learning Attention As Disentangler for Compositional Zero-Shot Learning GitHub thecvf
arXiv
YouTube
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration GitHub thecvf
arXiv
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection GitHub thecvf
arXiv
YouTube
SOOD: Towards Semi-Supervised Oriented Object Detection GitHub thecvf
arXiv
YouTube
Bias-Eliminating Augmentation Learning for Debiased Federated Learning thecvf
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors GitHub thecvf
arXiv
YouTube
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection GitHub thecvf YouTube
CORA: Adapting CLIP for Open-Vocabulary Detection With Region Prompting and Anchor Pre-Matching GitHub thecvf
arXiv
YouTube
Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection GitHub thecvf
arXiv
YouTube
Disentangled Representation Learning for Unsupervised Neural Quantization thecvf
YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors GitHub thecvf
arXiv
Virtual Sparse Convolution for Multimodal 3D Object Detection GitHub thecvf
arXiv
YouTube
TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning With Structure-Trajectory Prompted Reconstruction for Person Re-Identification GitHub thecvf
arXiv
YouTube
Adaptive Sparse Pairwise Loss for Object Re-Identification GitHub thecvf
arXiv
Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph GitHub thecvf
arXiv
YouTube
Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning GitHub thecvf
Vector Quantization With Self-Attention for Quality-Independent Representation Learning GitHub thecvf YouTube
Siamese Image Modeling for Self-Supervised Vision Representation Learning GitHub thecvf
arXiv
YouTube
FCC: Feature Clusters Compression for Long-Tailed Visual Recognition GitHub thecvf YouTube
Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information GitHub thecvf
arXiv
YouTube
Soft Augmentation for Image Classification GitHub thecvf
arXiv
Correspondence Transformers With Asymmetric Feature Learning and Matching Flow Super-Resolution GitHub thecvf
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models GitHub thecvf
arXiv
Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning GitHub thecvf
arXiv
YouTube
Glocal Energy-Based Learning for Few-Shot Open-Set Recognition GitHub thecvf
arXiv
YouTube
Improving Image Recognition by Retrieving From Web-Scale Image-Text Data thecvf
arXiv
Deep Factorized Metric Learning GitHub thecvf YouTube
Learning To Detect and Segment for Open Vocabulary Object Detection thecvf
arXiv
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
CVPR - Highlight
WEB Page thecvf
arXiv
YouTube
Photo Pre-Training, but for Sketch GitHub thecvf YouTube
InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions GitHub thecvf
arXiv
YouTube
Detecting Everything in the Open World: Towards Universal Object Detection GitHub thecvf
arXiv
YouTube
Twin Contrastive Learning With Noisy Labels GitHub thecvf
arXiv
Feature Aggregated Queries for Transformer-Based Video Object Detectors GitHub thecvf
arXiv
YouTube
Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection GitHub thecvf YouTube
Deep Hashing With Minimal-Distance-Separated Hash Centers thecvf YouTube
Knowledge Combination To Learn Rotated Detection Without Rotated Annotation GitHub thecvf
arXiv
YouTube
Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification GitHub thecvf YouTube
Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder thecvf YouTube
2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection GitHub thecvf
arXiv
LINe: Out-of-Distribution Detection by Leveraging Important Neurons GitHub thecvf
arXiv
YouTube
Progressive Transformation Learning for Leveraging Virtual Images in Training
CVPR - Highlight
WEB Page thecvf
arXiv
YouTube
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection GitHub thecvf
arXiv
YouTube
Decoupling MaxLogit for Out-of-Distribution Detection GitHub thecvf YouTube
Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection GitHub thecvf
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding thecvf
arXiv
YouTube
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
CVPR - Highlight
thecvf
arXiv
YouTube
D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based Transformers thecvf
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining thecvf
arXiv
YouTube
Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection With Single Point Supervision GitHub thecvf
arXiv
YouTube
Generalized UAV Object Detection via Frequency Domain Disentanglement thecvf YouTube
Deep Frequency Filtering for Domain Generalization thecvf
arXiv
YouTube
Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images GitHub thecvf
arXiv
YouTube
Improved Test-Time Adaptation for Domain Generalization GitHub thecvf
arXiv
Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation
CVPR - Highlight
GitHub thecvf YouTube
Recurrence Without Recurrence: Stable Video Landmark Detection With Deep Equilibrium Models GitHub thecvf
arXiv
YouTube
VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision GitHub thecvf
arXiv
YouTube
DETRs With Hybrid Matching GitHub thecvf
arXiv
YouTube
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection GitHub thecvf
arXiv
YouTube
Clothing-Change Feature Augmentation for Person Re-Identification GitHub thecvf YouTube
Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis GitHub thecvf YouTube
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Optimal Proposal Learning for Deployable End-to-End Pedestrian Detection thecvf
DynamicDet: A Unified Dynamic Architecture for Object Detection GitHub thecvf
arXiv
YouTube
Switchable Representation Learning Framework With Self-Compatibility thecvf
arXiv
YouTube
DATE: Domain Adaptive Product Seeker for E-Commerce GitHub thecvf
arXiv
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery GitHub thecvf
arXiv
YouTube
Dynamic Neural Network for Multi-Task Learning Searching Across Diverse Network Topologies thecvf
arXiv
OvarNet: Towards Open-Vocabulary Object Attribute Recognition GitHub Page
GitHub
thecvf
arXiv
YouTube
HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models GitHub thecvf
arXiv
YouTube
Learning From Noisy Labels With Decoupled Meta Label Purifier GitHub thecvf
arXiv
A Light Touch Approach to Teaching Transformers Multi-View Geometry thecvf
arXiv
YouTube
OpenMix: Exploring Outlier Samples for Misclassification Detection
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Revisiting Reverse Distillation for Anomaly Detection GitHub thecvf YouTube
PROB: Probabilistic Objectness for Open World Object Detection GitHub thecvf
arXiv
YouTube
Equiangular Basis Vectors GitHub thecvf
arXiv
Weakly Supervised Posture Mining for Fine-Grained Classification GitHub thecvf
An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity thecvf
Weak-Shot Object Detection Through Mutual Knowledge Transfer thecvf
Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Exploring Structured Semantic Prior for Multi Label Recognition With Incomplete Labels GitHub thecvf
arXiv
YouTube
Learning Partial Correlation Based Deep Visual Representation for Image Classification GitHub Page
GitHub
thecvf
arXiv
YouTube
Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval GitHub thecvf
arXiv
YouTube
PHA: Patch-Wise High-Frequency Augmentation for Transformer-Based Person Re-Identification
CVPR - Highlight
GitHub thecvf
Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown Objects GitHub thecvf
arXiv
YouTube
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation GitHub thecvf
arXiv
YouTube
Annealing-Based Label-Transfer Learning for Open World Object Detection GitHub thecvf YouTube
Diversity-Measurable Anomaly Detection GitHub thecvf
arXiv
YouTube
Recurrent Vision Transformers for Object Detection With Event Cameras GitHub thecvf
arXiv
YouTube
AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers GitHub thecvf YouTube
Ranking Regularization for Critical Rare Classes: Minimizing False Positives at a High True Positive Rate thecvf
arXiv
YouTube
Contrastive Mean Teacher for Domain Adaptive Object Detectors GitHub thecvf
arXiv
YouTube
Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification GitHub thecvf
arXiv
YouTube
PartMix: Regularization Strategy To Learn Part Discovery for Visible-Infrared Person Re-Identification thecvf
arXiv
BiasAdv: Bias-Adversarial Augmentation for Model Debiasing thecvf YouTube
ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection GitHub thecvf
arXiv
Robust 3D Shape Classification via Non-Local Graph Attention Network thecvf YouTube
Two-Way Multi-Label Loss
CVPR - Highlight
GitHub thecvf YouTube
Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Object Detection With Self-Supervised Scene Adaptation GitHub thecvf YouTube
Data-Efficient Large Scale Place Recognition With Graded Similarity Supervision GitHub thecvf
arXiv
YouTube
Generating Features With Increased Crop-Related Diversity for Few-Shot Object Detection thecvf
arXiv
Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants With No False Negatives and No False Positives thecvf
arXiv
Deep Semi-Supervised Metric Learning With Mixed Label Propagation thecvf
Fine-Grained Classification With Noisy Labels thecvf
arXiv
YouTube