Improving the Ground-Truth:
MegaDepth in 2023

Alexander Veicht · Felix Yang · Andri Horat · Deep Desai · Philipp Lindenberger

Report | Poster

Our reimplemented and improved pipeline for MegaDepth generates high-quality depth maps
and camera poses for unstructured images of popular tourist landmarks.

MegaDepth, a dataset of unstructured images featuring popular tourist landmarks, was introduced in 2018. By leveraging structure from motion (SfM) and multi-view stereo (MVS) techniques along with data cleaning methods, MegaDepth generates camera poses and depth maps for each image. Nonetheless, the outcomes suffer from limitations like degenerate camera poses, incomplete depth maps, and inaccuracies caused by unregistered images or noise in the pipeline. Despite these flaws, MegaDepth has become an industry-standard dataset for training various computer vision models, including but not limited to single-view depth estimation, local features, feature matching, and multi-view refinement. This is primarily due to the diversity of scenes, occlusions, and appearance changes captured in the dataset, enabling the models to generalize well. Our project aims to systematically address these problems to establish a refined MegaDepth ground-truth (GT) pipeline using recent methods such as the hloc and Pixel-Perfect Structure-from-Motion.

Setup

Clone and install the repository by running the following commands:

git clone https://github.com/fyangch/RefinedMegaDepth.git
cd RefinedMegaDepth
pip install -e .

First Reconstruction

Download the south building dataset and extract it to the data folder.

mkdir data
wget https://demuc.de/colmap/datasets/south-building.zip -O data/south-building.zip
unzip data/south-building.zip -d data
rm -rf data/south-building.zip data/south-building/sparse data/south-building/database.db

Run the following command to start the pipeline:

python -m megadepth.reconstruction scene=south-building

Reconstructing Custom Scenes

The images are expected to be split by scenes and stored in the following format:

data
├── scene_1
│   ├── images
│   │   ├── 00000000.jpg
│   │   ├── 00000001.jpg
│   │   ├── ...
├── scene_2
│   ├── images
│   │   ├── 00000000.jpg
│   │   ├── 00000001.jpg
│   │   ├── ...
├── ...

You can simply run the the reconstruction pipeline by specifying the scene name:

python -m megadepth.reconstruction scene=scene_1

The pipeline will read the images from folder and create the following folders for the outputs:

data
├── scene_1
│   ├── images
│   ├── features
│   ├── matches
│   ├── sparse
│   ├── dense
│   ├── metrics
│   ├── results
├── scene_2
│   ├── ...
├── ...

Next Steps

Fix rotations
Test mvs
Remove dependencies (xarray, mit_semseg, ...)
Check for licenses (segmentation models etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
assets		assets
megadepth		megadepth
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Poster.pdf		Poster.pdf
README.md		README.md
Report.pdf		Report.pdf
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving the Ground-Truth:
MegaDepth in 2023

Report | Poster

Setup

First Reconstruction

Reconstructing Custom Scenes

Next Steps

About

Releases

Packages

Contributors 4

Languages

fyangch/RefinedMegaDepth

Folders and files

Latest commit

History

Repository files navigation

Improving the Ground-Truth:MegaDepth in 2023

Report | Poster

Setup

First Reconstruction

Reconstructing Custom Scenes

Next Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Improving the Ground-Truth:
MegaDepth in 2023

Packages