Skip to content

Latest commit

 

History

History
93 lines (88 loc) · 37.5 KB

scene-analysis-and-understanding.md

File metadata and controls

93 lines (88 loc) · 37.5 KB

CVPR-2023-Papers

Application App
New collections Conference

Scene Analysis and Understanding

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
You Only Segment Once: Towards Real-Time Panoptic Segmentation GitHub thecvf
arXiv
YouTube
IS-GGT: Iterative Scene Graph Generation with Generative Transformers GitHub Page thecvf YouTube
Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness GitHub thecvf
arXiv
Panoptic Video Scene Graph Generation GitHub Page
GitHub
thecvf
arXiv
YouTube
3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud thecvf YouTube
JacobiNeRF: NeRF Shaping with Mutual Information Gradients GitHub Page
GitHub
thecvf
arXiv
YouTube
Learning Geometric-Aware Properties in 2D Representation using Lightweight CAD Models, or Zero Real 3D Pairs GitHub Page thecvf YouTube
Learning and Aggregating Lane Graphs for Urban Automated Driving WEB Page
GitHub
thecvf
arXiv
YouTube
MIME: Human-Aware 3D Scene Generation WEB Page
GitHub
thecvf
arXiv
Connecting the Dots: Floorplan Reconstruction using Two-Level Queries GitHub Page
GitHub
thecvf
arXiv
YouTube
NeRF-RPN: A General Framework for Object Detection in NeRFs GitHub thecvf
arXiv
YouTube
Relational Context Learning for Human-Object Interaction Detection WEB Page
GitHub
thecvf
arXiv
YouTube
Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion GitHub thecvf YouTube
Token Contrast for Weakly-Supervised Semantic Segmentation GitHub thecvf
arXiv
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency GitHub Page
GitHub
thecvf
arXiv
YouTube
Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation GitHub Page
GitHub
thecvf
arXiv
YouTube
CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP GitHub thecvf
arXiv
YouTube
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline GitHub Page
GitHub
thecvf
Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting
CVPR - Highlight
GitHub thecvf YouTube
Indiscernible Object Counting in Underwater Scenes GitHub thecvf
arXiv
YouTube
Long Range Pooling for 3D Large-Scale Scene Understanding GitHub thecvf
arXiv
Delivering Arbitrary-Modal Semantic Segmentation GitHub Page
GitHub
thecvf
arXiv
YouTube
Images Speak in Images: A Generalist Painter for In-Context Visual Learning GitHub Page
GitHub
thecvf
arXiv
SCPNet: Semantic Scene Completion on Point Cloud
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Content-Aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers GitHub Page
GitHub
thecvf
arXiv
YouTube
OpenScene: 3D Scene Understanding with Open Vocabularies GitHub Page
GitHub
thecvf
arXiv
YouTube
Devil's on the Edges: Selective Quad Attention for Scene Graph Generation WEB Page
GitHub
thecvf
arXiv
YouTube
Delving into Shape-Aware Zero-Shot Semantic Segmentation GitHub thecvf
arXiv
YouTube
Category Query Learning for Human-Object Interaction Classification GitHub thecvf
arXiv
YouTube
Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision GitHub Page thecvf
arXiv
YouTube
DejaVu: Conditional Regenerative Learning to Enhance Dense Prediction thecvf
arXiv
SCOOP: Self-Supervised Correspondence and Optimization-based Scene Flow GitHub Page
GitHub
thecvf
arXiv
YouTube
Incremental 3D Semantic Scene Graph Prediction from RGB Sequences GitHub Page
GitHub
thecvf
arXiv
YouTube
PanelNet: Understanding 360 Indoor Environment via Panel Representation thecvf
arXiv
YouTube
Perspective Fields for Single Image Camera Calibration
CVPR - Highlight
GitHub Page
GitHub
thecvf
arXiv
YouTube
Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework thecvf
Fast Contextual Scene Graph Generation with Unbiased Context Augmentation GitHub thecvf
Diffusion-based Generation, Optimization, and Planning in 3D Scenes GitHub Page
GitHub
thecvf
arXiv
YouTube
TopNet: Transformer-based Object Placement Network for Image Compositing thecvf
arXiv
YouTube
Computational Flash Photography through Intrinsics GitHub Page
GitHub
thecvf
arXiv
YouTube
Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task using Artificial Neural Networks thecvf
arXiv
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting GitHub thecvf
arXiv
LEGO-Net: Learning Regular Rearrangements of Objects in Rooms WEB Page
GitHub
thecvf
arXiv
YouTube
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation GitHub thecvf
arXiv
YouTube
Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning GitHub thecvf YouTube
ScanDMM: A Deep Markov Model of Scanpath Prediction for 360° Images GitHub thecvf YouTube
Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields
CVPR - Highlight
WEB Page
GitHub
thecvf
arXiv
YouTube
TempSAL - Uncovering Temporal Information for Deep Saliency Prediction GitHub Page
GitHub
thecvf
arXiv
YouTube
Probabilistic Debiasing of Scene Graphs GitHub thecvf
arXiv
YouTube
Towards Unified Scene Text Spotting based on Sequence Generation GitHub thecvf
arXiv
Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph using Pre-trained Visual-Semantic Space GitHub thecvf YouTube
Modular Memorability: Tiered Representations for Video Memorability Prediction GitHub thecvf YouTube
Where we are and what we're Looking at: Query based Worldwide Image Geo-Localization using Hierarchies and Scenes GitHub thecvf
arXiv
YouTube
HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions GitHub Page
GitHub
thecvf
arXiv
YouTube