Skip to content

Latest commit

 

History

History
71 lines (66 loc) · 23 KB

scene-analysis-and-understanding.md

File metadata and controls

71 lines (66 loc) · 23 KB

ICCV-2023-Papers

Application App

Scene Analysis and Understanding

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Generalized Few-Shot Point Cloud Segmentation via Geometric Words GitHub thecvf
arXiv
Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer GitHub thecvf
arXiv
EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization GitHub thecvf
arXiv
Multi-Task View Synthesis with Neural Radiance Fields GitHub Page
GitHub
thecvf
arXiv
Multi-Task Learning with Knowledge Distillation for Dense Prediction thecvf
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World GitHub thecvf
arXiv
CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation GitHub thecvf
arXiv
VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering thecvf
arXiv
Disentangle then Parse: Night-Time Semantic Segmentation with Illumination Disentanglement GitHub thecvf
arXiv
Visual Traffic Knowledge Graph Generation from Scene Images WEB Page
thecvf
Agglomerative Transformer for Human-Object Interaction Detection GitHub thecvf
arXiv
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation GitHub Page
GitHub
thecvf
arXiv
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation GitHub thecvf
arXiv
RLIPv2: Fast Scaling of Relational Language-Image Pre-Training GitHub thecvf
arXiv
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase GitHub thecvf
arXiv
See more and Know More: Zero-Shot Point Cloud Segmentation via Multi-Modal Visual Data GitHub thecvf
arXiv
Compositional Feature Augmentation for Unbiased Scene Graph Generation GitHub thecvf
arXiv
Multi-Weather Image Restoration via Domain Translation GitHub thecvf
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition thecvf
arXiv
Towards Models that Can See and Read thecvf
arXiv
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving GitHub Page
GitHub
thecvf
arXiv
DDP: Diffusion Model for Dense Visual Prediction GitHub thecvf
arXiv
Understanding 3D Object Interaction from a Single Image GitHub Page
GitHub
thecvf
arXiv
YouTube
ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces WEB Page
GitHub
thecvf
arXiv
YouTube
Improving Equivariance in State-of-the-Art Supervised Depth and Normal Predictors GitHub thecvf
arXiv
CrossMatch: Source-Free Domain Adaptive Semantic Segmentation via Cross-Modal Consistency Training thecvf
Semantic Attention Flow Fields for Monocular Dynamic Scene Decomposition WEB Page thecvf
arXiv
Holistic Geometric Feature Learning for Structured Reconstruction GitHub thecvf
arXiv
Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process GitHub thecvf
arXiv
TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-Experts thecvf
arXiv
Thinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks GitHub thecvf
STEERER: Resolving Scale Variations for Counting and Localization via Selective Inheritance Learning GitHub thecvf
arXiv
Object-Aware Gaze Target Detection GitHub thecvf
arXiv
Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency thecvf
Vision Relation Transformer for Unbiased Scene Graph Generation GitHub thecvf
arXiv
YouTube
DDIT: Semantic Scene Completion via Deformable Deep Implicit Templates thecvf
DQS3D: Densely-Matched Quantization-Aware Semi-Supervised 3D Detection GitHub thecvf
arXiv
Shape Anchor Guided Holistic Indoor Scene Understanding GitHub thecvf
arXiv
SGAligner: 3D Scene Alignment with Scene Graphs WEB Page
GitHub
thecvf
arXiv
YouTube
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation WEB Page
GitHub
thecvf
arXiv
YouTube