outputs["position_depth", scale, frame_id]的维度问题 #56

nanasylum · 2024-04-05T09:17:55Z

很感谢您所做的工作！

我在您网络的基础上试图改进时，想使用tgt_depth和ref_depth这两个变量。根据我对train_stage_two.py的理解，tgt_depth应该是指outputs["depth", 0, scale]，ref_depth应该是指outputs["position_depth", scale, frame_id]。但是我却发现outputs["position_depth", scale, frame_id]的shape是[12,2,256,320]。为什么第二个维度是2？不应该是3吗？

期待您的回复

nanasylum · 2024-04-06T08:51:37Z

还有一个问题，请问train_stage_two.py文件中，outputs["position_depth"]和outputs["depth"]有什么区别吗？

Line421 我的理解是用视差计算深度

            _, depth = disp_to_depth(disp, self.opt.min_depth, self.opt.max_depth)

            outputs[("depth", 0, scale)] = depth

Line457 但是这里就不太理解，输入相机的内参可以怎么样？

                outputs[("position_depth", scale, frame_id)] = self.position_depth[source_scale](
                        cam_points, inputs[("K", source_scale)], T)

期待您的回复

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

outputs["position_depth", scale, frame_id]的维度问题 #56

outputs["position_depth", scale, frame_id]的维度问题 #56

nanasylum commented Apr 5, 2024

nanasylum commented Apr 6, 2024

outputs["position_depth", scale, frame_id]的维度问题 #56

outputs["position_depth", scale, frame_id]的维度问题 #56

Comments

nanasylum commented Apr 5, 2024

nanasylum commented Apr 6, 2024