SpatialTracker: Tracking Any 2D Pixels in 3D Space

SpatialTracker: Tracking Any 2D Pixels in 3D Space,
Yuxi Xiao*, Qianqian Wang*, Shangzhan Zhang, Nan Xue, Sida Peng, Yujun Shen, Xiaowei Zhou,
CVPR 2024, Highlight Paper at arxiv

News and ToDo

Release SpatialTracker-v2 (in coming).
Release HuggingFace/Gradio demo.
Release SpatialTracker inference code and checkpoints (approximated at late April).
05.04.2024: SpatialTracker is selected as Highlight Paper!
26.02.2024: SpatialTracker is accepted at CVPR 2024!

Requirements

The inference code was tested on

Ubuntu 20.04
Python 3.10
PyTorch 2.1.1
PyTorch Lightning 2.2.1
1 NVIDIA GPU (RTX A6000) with CUDA version 11.8. (Other GPUs are also suitable, and 22GB GPU memory is sufficient for dense tracking (~10k points) with our code.)

Setup an environment

conda create -n SpaTrack python==3.10
conda activate SpaTrack

Install Flash-Attention

pip install flash-attn --no-build-isolation

or install from source codes of flash attention

Other Dependencies

pip install -r requirements.txt

Depth Estimator

In our default setting, monocular depth estimator is needed to acquire the metric depths from video input. There are several models for options (ZoeDepth, Metric3D, UniDepth and DepthAnything). We take ZoeDepth as default model. Download dpt_beit_large_384.pt, ZoeD_M12_K.pt, ZoeD_M12_NK.pt into models/monoD/zoeDepth/ckpts.

Data

Our method supports RGB or RGBD videos input.

RGB Videos

RGBD Videos

Visualization 3D Trajectories

Firstly, please make sure that you have installed blender. We provide the visualization code for blender:

/Applications/Blender.app/Contents/MacOS/Blender -P create.py -- --input ${OUTPUT}.npy

For example, the butterfly looked like

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
__pycache__		__pycache__
assets		assets
blender_vis		blender_vis
config		config
models		models
notebooks		notebooks
utils		utils
.DS_Store		.DS_Store
Dockerfile		Dockerfile
README.md		README.md
convert.py		convert.py
demo.py		demo.py
logger.py		logger.py
mde.py		mde.py
requirements.txt		requirements.txt
run_demo.sh		run_demo.sh

henry123-boy/SpaTracker

Folders and files

Latest commit

History

Repository files navigation

SpatialTracker: Tracking Any 2D Pixels in 3D Space

News and ToDo

Requirements

Setup an environment

Install Flash-Attention

Other Dependencies

Depth Estimator

Data

RGB Videos

RGBD Videos

Visualization 3D Trajectories

About

Resources

Stars

Watchers

Forks

Languages