Thank you for your work! I have a few questions about fine-tuning that I'd like to ask regarding depth map estimation when camera parameters are known.
-
If the intrinsic and extrinsic parameters are known, and I want to obtain a depth map, during fine-tuning, should I use the default loss function, or only losses related to depth, scale, and mask?
-
Given that the intrinsic and extrinsic parameters are known, should I freeze the main encoder, and also freeze any encoder components related to processing these parameters?
-
After obtaining the depth map with known intrinsic and extrinsic parameters, would it be better to use the ground truth (GT) of intrinsic and extrinsic parameters or the predicted intrinsic and extrinsic to generate the point map for optimal results?