- Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers
- FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer
- Improving Graph Contrastive Learning via Adaptive Positive Sampling
- CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning
- DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation
- EGTR: Extracting Graph from Transformer for Scene Graph Generation
- HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
- HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
- OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
- LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
- From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
- Leveraging Predicate and Triplet Learning for Scene Graph Generation
- SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos
- GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
- Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
- Multi-Level Neural Scene Graphs for Dynamic Urban Environments
- GLiDR: Topologically Regularized Graph Generative Network for Sparse LiDAR Point Clouds
- DGC-GNN: Leveraging Geometry and Color Cues for Visual Descriptor-Free 2D-3D Matching
- Denoising Point Cloud in Latent Space via Graph Convolution and Invertible Neural Network
- Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image Analysis
- GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
- HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models
- Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
- DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly
- MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
- CAGE: Controllable Articulation GEneration
- FC-GNN: Recovering Reliable and Accurate Correspondences from Interferences
- Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline
- BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
- Constrained Layout Generation with Factor Graphs
- Domain Separation Graph Neural Networks for Saliency Object Ranking
- Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments
- SignGraph: A Sign Sequence is Worth Graphs of Nodes
- Image Processing GNN: Breaking Rigidity in Super-Resolution
- Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images
- Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs
- Learning Structure-from-Motion with Graph Attention Networks
- The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
- MemoNav: Working Memory Model for Visual Navigation
- Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning
- Neural Markov Random Field for Stereo Matching