Lists (8)
Sort Name ascending (A-Z)
Starred repositories
[CVPR 2023] CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Recent weakly supervised semantic segmentation paper
Segment Anything in High Quality [NeurIPS 2023]
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)
Tracking and collecting papers/projects/others related to Segment Anything.
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
The Codes and Data of A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection [ICLR'25]
[CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)
KITTI Object Visualization (Birdview, Volumetric LiDar point cloud )
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
Unsupervised Scale-consistent Depth Learning from Video (IJCV2021 & NeurIPS 2019)
TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.
Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scal…
[CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects
This repository contains utility scripts for the KITTI-360 dataset.
An official code release of our CVPR'23 paper, BEVHeight
使用open3d显示kitti数据的3D视角和BEV视角
Codes for RoadBEV: road surface reconstruction in Bird's Eye View
PyTorch code and models for the DINOv2 self-supervised learning method.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
[CVPR 2024] Real-Time Open-Vocabulary Object Detection