Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

outputs["position_depth", scale, frame_id]的维度问题 #56

Open
nanasylum opened this issue Apr 5, 2024 · 1 comment
Open

outputs["position_depth", scale, frame_id]的维度问题 #56

nanasylum opened this issue Apr 5, 2024 · 1 comment

Comments

@nanasylum
Copy link

很感谢您所做的工作!

我在您网络的基础上试图改进时,想使用tgt_depth和ref_depth这两个变量。根据我对train_stage_two.py的理解,tgt_depth应该是指outputs["depth", 0, scale],ref_depth应该是指outputs["position_depth", scale, frame_id]。但是我却发现outputs["position_depth", scale, frame_id]的shape是[12,2,256,320]。为什么第二个维度是2?不应该是3吗?

期待您的回复

@nanasylum
Copy link
Author

还有一个问题,请问train_stage_two.py文件中,outputs["position_depth"]和outputs["depth"]有什么区别吗?

Line421 我的理解是用视差计算深度

            _, depth = disp_to_depth(disp, self.opt.min_depth, self.opt.max_depth)

            outputs[("depth", 0, scale)] = depth

Line457 但是这里就不太理解,输入相机的内参可以怎么样?

                outputs[("position_depth", scale, frame_id)] = self.position_depth[source_scale](
                        cam_points, inputs[("K", source_scale)], T)

期待您的回复

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant