cvpr2024

Star

Here are 101 public repositories matching this topic...

VinAIResearch / Open3DIS

Star

Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)

3d-point-clouds 3d-instance-segmentation 3d-scene-understanding open-vocabulary cvpr2024

Updated Jul 25, 2024
Python

astra-vision / PaSCo

Star

[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

ensemble uncertainty-estimation lidar-point-cloud semantic-scene-understanding mimo semantic-scene-completion cvpr-oral scene-completion cvpr2024 panoptic-scene-completion

Updated Jul 25, 2024
Python

alfredgu001324 / MapUncertaintyPrediction

Star

[CVPR 2024 Award Candidate] Producing and Leveraging Online Map Uncertainty in Trajectory Prediction

autonomous-driving uncertainty-estimation trajectory-prediction map-estimation motion-prediction cvpr2024

Updated Jul 25, 2024
Python

ZhouYuxuanYX / BlockGCN

Star

This is the official implementation of our CVPR 2024 paper "BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition"

gcn skeleton-based-action-recognition cvpr2024

Updated Jul 25, 2024
Python

navervision / lincir

Star

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

image-retrieval composed-image-retrieval cvpr2024

Updated Jul 25, 2024
Python

xxxupeng / ADL

Star

[CVPR 2024] Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching

stereo-matching loss-function cvpr2024

Updated Jul 24, 2024
Python

htcr / sam_road

Star

Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024

computer-vision graph navigation mapping transformers remote-sensing autonomous-driving scene-graph graph-neural-networks graph-representation-learning segmentation-models scene-graph-generation segment-anything cvpr2024

Updated Jul 23, 2024
Python

thswodnjs3 / CSTA

Star

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

deep-neural-networks computer-vision deep-learning video-summarization cnn pytorch video-processing supervised-learning pretrained-models attention-mechanism cvpr video-understanding pytorch-implementation videosummarization cvpr2024

Updated Jul 23, 2024
Python

LukasHaas / PIGEON

Star

Code for the CVPR 2024 paper highlight and demo "PIGEON: Predicting Image Geolocations".

location coordinates clip place geolocalization multimodal geoguessr cvpr2024

Updated Jul 21, 2024
Python

ICTMCG / U-VAP

Star

[CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation

cvpr cvpr2024

Updated Jul 21, 2024
Python

Becomebright / GroundVQA

Star

Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.

cvpr2024

Updated Jul 18, 2024
Python

robustsam / RobustSAM

Star

RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)

computer-vision deep-learning image-processing sam artificial-intelligence segmentation cvpr iccv eccv segment-anything segment-anything-meta segment-anything-model zero-shot-segmentation cvpr2024

Updated Jul 18, 2024
Python

aleflabo / PREGO

Star

The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detection in PRocedural EGOcentric videos.

procedural-learning egocentric-vision mistake-detection cvpr2024

Updated Jul 18, 2024
Python

ZYangChen / MoCha-Stereo

Star

[CVPR2024] The official implementation of "MoCha-Stereo: Motif Channel Attention Network for Stereo Matching”.

stereo-matching cvpr2024

Updated Jul 18, 2024
Python

Open3DA / LL3DA

Star

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

gpt language-model multi-modal 3d 3d-models scene-understanding llm instruction-tuning cvpr2024 3d-to-text

Updated Jul 17, 2024
Python

liangxuy / ReGenNet

Star

[CVPR 2024] Official implementation of the paper "ReGenNet: Towards Human Action-Reaction Synthesis"

diffusion-models human-motion-generation cvpr2024 human-human-interaction human-reaction-generation interaction-order

Updated Jul 17, 2024
Python

WisconsinAIVision / ViP-LLaVA

Star

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

chatbot llama multi-modal clip vision-language gpt-4 foundation-models visual-prompting llava llama2 cvpr2024 gpt-4-vision

Updated Jul 17, 2024
Python

huicongzhang / BSSTNet

Star

Implementation of "Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring". (Zhang et al., CVPR 2024)

video-deblurring pytorch-implementation cvpr2024

Updated Jul 15, 2024
Python

zhengli97 / PromptKD

Star

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

clip knowledge-distillation multi-modal-learning prompt-learning vision-language-model cvpr2024

Updated Jul 15, 2024
Python

DmitryRyumin / CVPR-2023-24-Papers

Star

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Updated Jul 15, 2024
Python

Improve this page

Add a description, image, and links to the cvpr2024 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cvpr2024 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cvpr2024

Here are 101 public repositories matching this topic...

VinAIResearch / Open3DIS

astra-vision / PaSCo

alfredgu001324 / MapUncertaintyPrediction

ZhouYuxuanYX / BlockGCN

navervision / lincir

xxxupeng / ADL

htcr / sam_road

thswodnjs3 / CSTA

LukasHaas / PIGEON

ICTMCG / U-VAP

Becomebright / GroundVQA

robustsam / RobustSAM

aleflabo / PREGO

ZYangChen / MoCha-Stereo

Open3DA / LL3DA

liangxuy / ReGenNet

WisconsinAIVision / ViP-LLaVA

huicongzhang / BSSTNet

zhengli97 / PromptKD

DmitryRyumin / CVPR-2023-24-Papers

Improve this page

Add this topic to your repo