Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

This is the official implementation of the CVPR24' paper "Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation" by Lior Talker, Aviad Cohen, Erez Yosef, Alexandra Dana and Michael Dinerstein (Samsung Israel Research Center - SIRC).

Paper (ArXiv) | Paper (CVPR24') | Supp (CVPR24') | Poster (CVPR24') | Video (CVPR24')

Abstract: Monocular Depth Estimation (MDE) is a fundamental problem in computer vision with numerous applications. Recently, LIDAR-supervised methods have achieved remarkable per-pixel depth accuracy in outdoor scenes. However, significant errors are typically found in the proximity of depth discontinuities, i.e., depth edges, which often hinder the performance of depth-dependent applications that are sensitive to such inaccuracies, e.g., novel view synthesis and augmented reality. Since direct supervision for the location of depth edges is typically unavailable in sparse LIDAR-based scenes, encouraging the MDE model to produce correct depth edges is not straightforward. To the best of our knowledge this paper is the first attempt to address the depth edges issue for LIDAR-supervised scenes. In this work we propose to learn to detect the location of depth edges from densely-supervised synthetic data, and use it to generate supervision for the depth edges in the MDE training. %Despite the ’domain gap’ between synthetic and real data, we show that depth edges that are estimated directly are significantly more accurate than the ones that emerge indirectly from the MDE training. To quantitatively evaluate our approach, and due to the lack of depth edges ground truth in LIDAR-based scenes, we manually annotated subsets of the KITTI and the DDAD datasets with depth edges ground truth. We demonstrate significant gains in the accuracy of the depth edges with comparable per-pixel depth accuracy on several challenging datasets.

Code and weights of our method will be released soon!

Datasets

The KITTI Depth Edges (KITTI-DE) and DDAD Depth Edges (DDAD-DE) validation datasets from the paper are provided as binary edge images in data/kitti_de/gt and data/ddad_de/gt. Text lists with a GT image per-line are stored in data/kitti_de/kitti_de_annotated_edges.txt and data/ddad_de/ddad_de_annotated_edges.txt.

The RGB images for KITTI-DE are taken from the KITTI's Semantic Instance Segmentation Evaluation benchmark. The corresponding RGB images have the same filename as the filenames in the KITTI-DE validation set.

The RGB images for DDAD-DE are taken from the DDAD's instance segmentation images in the validation set. (The validation set corresponds to clips 150-199, where in each clip, one image has instance segmentation GT, from which we annotated the depth edge GT.)

To evaluate depth maps (in .npy format) on the KITTI-DE dataset using the AUC (edges) metric (as in Tab.1 in the paper):

python eval_depth_edges.py 
--depth_pred_list_path [path_to_pred_npy_name_list] 
--depth_pred_dir_path [path_to_dir_with_pred_npy_files]
--depth_edge_gt_list_path data/kitti_de/kitti_de_annotated_edges.txt
--depth_edge_gt_dir_path data/kitti_de/gt

[path_to_pred_npy_name_list] is a path to a txt file with the names only of the predicted depth .npy files.
[path_to_dir_with_pred_npy_files] is a path to the directory that contains the predicted depth .npy files.
Note: the order of the predicted depth and the depth edge GT must be the same.

To evaluate depth maps (in .npy format) on the DDAD-DE dataset using the AUC (edges) metric:

python eval_depth_edges.py  
--depth_pred_list_path [path_to_pred_npy_name_list] 
--depth_pred_dir_path [path_to_dir_with_pred_npy_files]
--depth_edge_gt_list_path data/ddad_de/ddad_de_annotated_edges.txt
--depth_edge_gt_dir_path data/ddad_de/gt
--prec_recall_eval_range_min
0.14
--prec_recall_eval_range_max
0.37

prec_recall_eval_range_min and prec_recall_eval_range_max are the (partial) range in which the edge AUC metric is computed (as in Tab.2 in the paper).

Citation

If you find this work relevant, please consider citing:

@inproceedings{talker2022mind,
  title={Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation},
  author={Talker, Lior and Cohen, Aviad and Yosef, Erez and Dana, Alexandra and Dinerstein, Michael},
  booktitle={Proceedings of the IEEE/CVF conference on computer vision and pattern recognition},
  year={2024}
}

Acknowledgements

py-bsds500

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
bsds_metric		bsds_metric
data		data
LICENSE		LICENSE
README.md		README.md
edge.py		edge.py
eval_depth_edges.py		eval_depth_edges.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Paper (ArXiv) | Paper (CVPR24') | Supp (CVPR24') | Poster (CVPR24') | Video (CVPR24')

Code and weights of our method will be released soon!

Datasets

Citation

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

liortalker/MindTheEdge

Folders and files

Latest commit

History

Repository files navigation

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Paper (ArXiv) | Paper (CVPR24') | Supp (CVPR24') | Poster (CVPR24') | Video (CVPR24')

** Code and weights of our method will be released soon! **

Datasets

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Code and weights of our method will be released soon!

Packages