Publications in CVPR 2024

Transformers

Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers
FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer

Contrastive Learning

Improving Graph Contrastive Learning via Adaptive Positive Sampling
CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning

Scene Graph Generation

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation
EGTR: Extracting Graph from Transformer for Scene Graph Generation
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
Leveraging Predicate and Triplet Learning for Scene Graph Generation

Scene Graphs

SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks
Action Scene Graphs for Long-Form Understanding of Egocentric Videos
GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
Multi-Level Neural Scene Graphs for Dynamic Urban Environments

Point Clouds

GLiDR: Topologically Regularized Graph Generative Network for Sparse LiDAR Point Clouds
DGC-GNN: Leveraging Geometry and Color Cues for Visual Descriptor-Free 2D-3D Matching
Denoising Point Cloud in Latent Space via Graph Convolution and Invertible Neural Network

Dynamic Graphs

Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image Analysis
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs

Diffusion

HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly

3D Data

MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
CAGE: Controllable Articulation GEneration

Miscellaneous

FC-GNN: Recovering Reliable and Accurate Correspondences from Interferences
Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
Constrained Layout Generation with Factor Graphs
Domain Separation Graph Neural Networks for Saliency Object Ranking
Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments
SignGraph: A Sign Sequence is Worth Graphs of Nodes
Image Processing GNN: Breaking Rigidity in Super-Resolution
Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images
Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs
Learning Structure-from-Motion with Graph Attention Networks
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
MemoNav: Working Memory Model for Visual Navigation
Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning
Neural Markov Random Field for Stereo Matching

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Publications in CVPR 2024

Transformers

Contrastive Learning

Scene Graph Generation

Scene Graphs

Point Clouds

Dynamic Graphs

Diffusion

3D Data

Miscellaneous

Files

README.md

Latest commit

History

README.md

File metadata and controls

Publications in CVPR 2024

Transformers

Contrastive Learning

Scene Graph Generation

Scene Graphs

Point Clouds

Dynamic Graphs

Diffusion

3D Data

Miscellaneous