CNN-based monocular disparity (inverse depth) estimation network for surgical videos collected in da Vinci surgery. The source code and data are associated with a short report presented at the Hamlyn Symposium on Medical Robotics 2017.
If you use the code or data, please cite following:
Ye, M., Johns, E., Handa, A., Zhang, L., Pratt, P. and Yang, G.Z.
Self-Supervised Siamese Learning on Stereo Image Pairs for Depth
Estimation in Robotic Surgery. Hamlyn Symposium on Medical Robotics. 2017.
You can download our data (9.3GB) and pretrained models and place them in the "data" and "trained" folders, respectively.
Torch
Torch-colormap (for visualisation only)
This code is distributed under BSD License.
-
The autoencoder model in this implementation is slightly different from the one in the report. Certain layers have been removed for memory consideration and skip layers and multiscale training have been added.
-
Please adjust the mini-batch size according to your specific GPU memory.
-
This implementation has been tested in Ubuntu.
-
Please see run_train.lua and run_inference.lua for example usage.