OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots

Juno Kim^*, Yesol Park^*, Hye-Jung Yoon^* and Byoung-Tak Zhang^‡
Seoul National University

^*Equal contribution. ^‡ Corresponding author.

Abstract

This paper introduces a groundbreaking approach to open-world 3D navigation for mobile robots, leveraging the integration of open-features into the 3D map to enhance capabilities in navigation. We address the challenges of creating zero-shot 3D scene mappings in open-world environments, hindered by the scarcity of extensive open-world 3D datasets and the limitations of current methodologies that result in `feature flooding'. Our novel strategy employs a per-instance feature to 3D instance mapping technique, utilizing a class-agnostic segmentation model to project 2D masks into the 3D space. This process, combined with a 3D mask voting mechanism, allows for the generation of zero-shot 3D instance segmented maps without reliance on supervised learning models, enabling more accurate and adaptable open-vocabulary 3D mapping. We validate our approach through extensive experiments on publicly available datasets such as ScanNet200 and Replica, demonstrating superior zero-shot performance and effectiveness in diverse environments. Additionally, we extend our evaluation to navigation tasks, showcasing significant improvements in navigation success rates through per-instance querying. Our real-world experiments further attest to the method's adaptability and robustness, proving its potential for widespread application in various environments.

Method Overview

News

[2024-12-25] Our paper is published to IEEE. OV-MAP.
[2024-6-30] Our paper is accepted to IROS 2024.

TODO List

Release the main code.
Release the evaluation script.
Code cleaning.
Release the real-world test data.

Installation

Step 1: Create Conda Environment

conda create -n ovmap python=3.8 -y
conda activate ovmap

Step 2: Install Required Libraries

Install Required Libraries

conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch -y
conda install plyfile -c conda-forge -y
pip install -r requirements.txt
pip install git+https://github.com/facebookresearch/segment-anything.git

Install CUDA (if local cuda is not available)

conda install nvidia/label/cuda-11.3.1::cuda -y

Step 3: Compile PointOps

Usual

cd libs/pointops
python setup.py install
cd ..

Multi GPU Arch

e.g. RTX 3090TI=8.6, RTX 8000=7.5, A100=8.0, H100=9.0
More available in: https://developer.nvidia.com/cuda-gpus

cd libs/pointops
# TORCH_CUDA_ARCH_LIST="ARCH LIST" python setup.py install
TORCH_CUDA_ARCH_LIST="8.6" python  setup.py install
cd ..

Step 4: Install Detectron2

git clone https://github.com/facebookresearch/detectron2.git
python -m pip install -e detectron2

Step 5: Install Open Query

pip install git+https://github.com/openai/CLIP.git@a9b1bf5920416aaeaec965c25dd9e8f98c864f16 --no-deps

Step 6: Additional Setup

cp -r CropFormer detectron2/projects
cd detectron2/projects/CropFormer/entity_api/PythonAPI
make
cd ../..
cd mask2former/modeling/pixel_decoder/ops
sh make.sh

Step 7: Install Other Packages

pip install numba==0.58.1 open_clip_torch==2.24.0 pillow==9.3.0
pip install -U openmim
mim install mmcv

Acknowledgements

OVMap is based by the following repos: Segment Anything, Pointcept, SAM3D, CropFormer, OpenMask3d.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
libs		libs
open_query		open_query
scannet-preprocess		scannet-preprocess
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
install_process.txt		install_process.txt
ovmap.py		ovmap.py
ovmap_env.yml		ovmap_env.yml
prepare_evaluation.py		prepare_evaluation.py
requirements.txt		requirements.txt
visualize_ovmap_result.py		visualize_ovmap_result.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots

Abstract

Method Overview

News

TODO List

Installation

Step 1: Create Conda Environment

Step 2: Install Required Libraries

Install Required Libraries

Install CUDA (if local cuda is not available)

Step 3: Compile PointOps

Usual

Multi GPU Arch

Step 4: Install Detectron2

Step 5: Install Open Query

Step 6: Additional Setup

Step 7: Install Other Packages

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots

Abstract

Method Overview

News

TODO List

Installation

Step 1: Create Conda Environment

Step 2: Install Required Libraries

Install Required Libraries

Install CUDA (if local cuda is not available)

Step 3: Compile PointOps

Usual

Multi GPU Arch

Step 4: Install Detectron2

Step 5: Install Open Query

Step 6: Additional Setup

Step 7: Install Other Packages

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages