In this repository we release (for now) the inference code for our work:
MS-RAFT+: High Resolution Multi-Scale RAFT
IJCV 2024
Azin Jahedi, Maximilian Luz, Marc Rivinius, Lukas Mehl and Andrés Bruhn
High Resolution Multi-Scale RAFT (Robus Vision Challenge 2022)
Robust Vision Challenge 2022
Azin Jahedi, Maximilian Luz, Lukas Mehl, Marc Rivinius and Andrés Bruhn
If you find our work useful please cite via BibTeX.
This work builds upon MS_RAFT.
The code has been tested with PyTorch 1.10.2+cu113. Install the required dependencies via
pip install -r requirements.txt
Alternatively you can also manually install the following packages in your virtual environment:
torch,torchvision, andtorchaudio(e.g., with--extra-index-url https://download.pytorch.org/whl/cu113for CUDA 11.3)matplotlibscipytensorboardopencv-pythontqdmparse
You can download our pre-trained model from the releases page.
Datasets are expected to be located under ./data in the following layout:
./data
├── kitti15 # KITTI 2015
│ └── dataset
│ ├── testing/...
│ └── training/...
├── middlebury # Middlebury
│ ├── test/...
│ │ └── img/...
│ └── training/...
│ ├── flow/...
│ └── img/...
├── sintel # Sintel
│ ├── test/...
│ └── training/...
└── viper # Viper
├── test/img/...
└── val
├── flow/...
└── img/...
For running MS_RAFT_plus on MPI Sintel images you need about 4 GB of GPU VRAM.
To compile the CUDA correlation module run the following once:
cd alt_cuda_corr && python setup.py install && cd ..And then you can evaluate the pre-trained model via:
python evaluate.py --model mixed.pth --dataset sintel --cuda_corrNote that the above-mentioned (with --cuda_corr) code performs on-demand cost computation and does not pre-compute the cost volume, because such computation is very memory intensive on high resolutions.
- Our code is licensed under the BSD 3-Clause No Military License. See LICENSE.
- The provided checkpoint is under the CC BY-NC-SA 3.0 license.
Parts of this repository are adapted from RAFT (license). We thank the authors for their excellent work.