Depth Estimation for Colonoscopy Images with Self-supervised Learning from Videos

Abstract

Depth estimation in colonoscopy images provides geometric clues for downstream medical analysis tasks, such as polyp detection, 3D reconstruction, and diagnosis. Recently, deep learning technology has made significant progress in monocular depth estimation for nat- ural scenes. However, without sufficient ground truth of dense depth maps for colonoscopy images, it is signicantly challenging to train deep neural networks for colonoscopy depth estimation. In this paper, we pro- pose a novel approach that makes full use of both synthetic data and real colonoscopy videos.We use synthetic data with ground truth depth maps to train a depth estimation network with a generative adversarial network model. Despite the lack of ground truth depth, real colonoscopy videos are used to train the network in a self-supervision manner by exploiting temporal consistency between neighboring frames. Furthermore, we design a masked gradient warping loss in order to ensure temporal consis- tency with more reliable correspondences. We conducted both quantita- tive and qualitative analysis on an existing synthetic dataset and a set of real colonoscopy videos, demonstrating the superiority of our method on more accurate and consistent depth estimation for colonoscopy images.

[In MICCAI 2021].

Prerequisites

Linux
Python 3, pytorch1.4
NVIDIA GPU (11G memory or larger) + CUDA cuDNN

Plan

We will clean the code and write more detailed instruction soon.

Testing

Test the model (bash ./scripts/test.sh):

#!./scripts/test.sh
python syn_test.py --name colon2depth_512p --no_instance --label_nc 0

Dataset

For synthetic data, you can download from http://cmic.cs.ucl.ac.uk/ColonoscopyDepth/.
For real dataset, you can download from https://github.com/dashishi/LDPolypVideo-Benchmark. This is our raw data, the data used for training is the subset of it. We crop the raw image to remove the black corner.

Multi-GPU training

Train a model using multiple GPUs (bash ./scripts/train.sh):

#!./scripts/train.sh
python train.py --name colon2depth_512p --batchSize 8 --gpu_ids 1,2 --label_nc 0 --no_instance --tf_log --no_vgg_loss --continue_train```

Citation

If you find this useful for your research, please use the following.

Kai. Cheng, Yiting. Ma, Yang. Li, Bin. Sun and Xuejin. Chen. "Depth Estimation for Colonoscopy Images with
Self-supervised Learning from Videos", Medical Image Computing and Computer Assisted Intervention Society, 2021

Acknowledgments

This code borrows from NVIDIA/pix2pixHD.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
correlation		correlation
data		data
models		models
oflow		oflow
options		options
scripts		scripts
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
ablation_study.py		ablation_study.py
check_unit.py		check_unit.py
encode_features.py		encode_features.py
oc_inference.py		oc_inference.py
pix2pixHD_LICENSE.txt		pix2pixHD_LICENSE.txt
precompute_feature_maps.py		precompute_feature_maps.py
run_engine.py		run_engine.py
syn_test.py		syn_test.py
test.py		test.py
train.py		train.py
ustc_test.py		ustc_test.py
vis_mask.py		vis_mask.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Depth Estimation for Colonoscopy Images with Self-supervised Learning from Videos

Abstract

Prerequisites

Plan

Testing

Dataset

Multi-GPU training

Citation

Acknowledgments

About

Releases

Packages

Languages

License

ckLibra/Self-Supervised-Depth-Estimation-for-Colonoscopy

Folders and files

Latest commit

History

Repository files navigation

Depth Estimation for Colonoscopy Images with Self-supervised Learning from Videos

Abstract

Prerequisites

Plan

Testing

Dataset

Multi-GPU training

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages