Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
-
Updated
Jul 25, 2024 - Python
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"
[CVPR 2024 Award Candidate] Producing and Leveraging Online Map Uncertainty in Trajectory Prediction
This is the official implementation of our CVPR 2024 paper "BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition"
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
[CVPR 2024] Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024
The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"
Code for the CVPR 2024 paper highlight and demo "PIGEON: Predicting Image Geolocations".
Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detection in PRocedural EGOcentric videos.
[CVPR2024] The official implementation of "MoCha-Stereo: Motif Channel Attention Network for Stereo Matching”.
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
[CVPR 2024] Official implementation of the paper "ReGenNet: Towards Human Action-Reaction Synthesis"
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Implementation of "Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring". (Zhang et al., CVPR 2024)
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!
Add a description, image, and links to the cvpr2024 topic page so that developers can more easily learn about it.
To associate your repository with the cvpr2024 topic, visit your repo's landing page and select "manage topics."