How about the result on real world datasets? #5

endlesswho · 2022-02-23T07:54:44Z

@dazinovic How about the result on real-world datasets. I collect the datasets with my own RGBD camera and estimate the pose with colmap. But the results is a mass. Any advices?

dazinovic · 2022-02-23T10:18:19Z

I think you'll have to be a bit more specific about the problem. Do you have a problem with COLMAP or does my method give you bad results when you use it with the COLMAP poses? If it's the latter, make sure the cameras are in the correct coordinate system and that you are correctly loading the depth maps. You will have to either save the data in the described format or write your own dataset loader. Since you have depth, you can also try aligning the images with BundleFusion.

endlesswho · 2022-02-24T12:30:29Z

e depth maps. You will have to either save the data in the described format or write your own dataset loader. Since you have depth, you can also try aligning the images

The poses are estimated by colmap. The depth image are normalized to 0-1 for training. I wondered whether I need to reconstruct my scene my rgbd reconstruction and get the sc_factor and translation correct for the network?

dazinovic · 2022-02-25T10:18:55Z

The depth images need to be in metric space.

rancheng · 2022-02-26T07:14:25Z

@endlesswho is the problem solved? could you please post your result here?

endlesswho · 2022-03-01T05:08:16Z

@endlesswho is the problem solved? could you please post your result here?

I'm so sad the problem still remains. The depth images in metric space, but the output pose of colmap is a scaled value. I think a rgbd reconstruction method would work!

dazinovic · 2022-03-01T14:57:46Z

You can use some flavor of KinectFusion to to obtain camera poses. If you want to use the COLMAP poses with your depth sensor's measurements, you will need to scale the translation vectors of your camera poses.

endlesswho · 2022-03-03T07:26:21Z

@rancheng My problem was solved with a rgbd reconstruction with icp matching and get the trajectory. However, the reconstruction results with @dazinovic 's method seems no so good. I also run the result with breakfast_room. With a disturb of trajectory, the result was shown bellow:

dazinovic · 2022-03-03T10:08:10Z

It looks like your camera extrinsics are in the wrong coordinate system. My method uses the OpenGL convention (same as NeRF). Maybe one of these issues can help you:
#4
#2

endlesswho · 2022-03-04T09:38:57Z

convention

Reasonable! I'll have a try and pose my new results.

endlesswho · 2022-03-07T03:31:35Z

It looks like your camera extrinsics are in the wrong coordinate system. My method uses the OpenGL convention (same as NeRF). Maybe one of these issues can help you: #4 #2

My camea extrinsics are in wrong coordinate system. I transform my coordinate system to OpenGL convention, the results are better. However, what if I don't know the bounding of the scene, any suggestions to solve this question?

dazinovic · 2022-03-07T10:30:33Z

You can approximate it with your camera positions.

endlesswho · 2022-03-09T09:14:37Z

You can approximate it with your camera positions.

My result is all right with the help of your advices. Thanks for your kindly reply.

JyotiLuxolis · 2022-07-04T21:18:15Z

Hello @endlesswho @dazinovic, can you briefly describe what changes you made to get it to work with real world datasets? Is it something as follows?:

Generate Poses using Colmap --> 2. Normalize Depth maps to 0-1 --> 3. Transform poses as described in How to Transform ScanNet Poses? #2 --> 3. Running the training procedure

Also a few more questions @endlesswho:

What dataset did you use?
What is meant by "keeping depth images in metric space"?

dazinovic · 2022-07-05T16:56:34Z

Hello @endlesswho @dazinovic, can you briefly describe what changes you made to get it to work with real world datasets? Is it something as follows?:
1. Generate Poses using Colmap --> 2. Normalize Depth maps to 0-1 --> 3. Transform poses as described in [How to Transform ScanNet Poses? #2](https://github.com/dazinovic/neural-rgbd-surface-reconstruction/issues/2) --> 3. Running the training procedure

I generated poses using BundleFusion (although, you can also use Colmap for this) and then applied the transformed as described in the linked issue. The depth maps are not normalized. The values need to be in meters. Scannet depth maps are in millimeters, so you simply need to divide by 1000. The method will work with other scales too, but the depth maps need to be consistent with the camera poses.

junshengzhou · 2022-07-14T02:45:55Z

Hello, I try to reproducing Neural-RGBD with the data used by manhattan SDF and NICE-SLAM (e.g. Replica).
I find that some views are optimized correctly (the rendered depths and images seems to be correct) while most of the views are optimized wrongly. I simply take the fx and fy in the intrinsic as the focal, and I don't know how to do with the cx and cy in the intrinsic. And it seems that extrinsics are similar with your provided data, and I also transform the depth to be in meters.

Do you have any ideas? Thanks! @dazinovic @endlesswho

ZuoJiaxing · 2022-08-17T19:32:54Z

I have encounter the same issue with @junshengzhou , could you reply to us? @dazinovic @endlesswho
Normally, fx, fy , cx, cy are provoded, but it seems that you only need a single value of focal length. How to deal with others? What is the focal length value I should use given the fx, fy, cx, cy?

endlesswho closed this as completed Mar 9, 2022

Daniil-Osokin mentioned this issue Sep 30, 2022

Camera pose axes to OpenGL coordinate space alignment #31

Closed

Daniil-Osokin mentioned this issue Oct 18, 2022

Train on a custom dataset #33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How about the result on real world datasets? #5

How about the result on real world datasets? #5

endlesswho commented Feb 23, 2022

dazinovic commented Feb 23, 2022

endlesswho commented Feb 24, 2022

dazinovic commented Feb 25, 2022

rancheng commented Feb 26, 2022

endlesswho commented Mar 1, 2022

dazinovic commented Mar 1, 2022

endlesswho commented Mar 3, 2022

dazinovic commented Mar 3, 2022

endlesswho commented Mar 4, 2022

endlesswho commented Mar 7, 2022 •

edited

dazinovic commented Mar 7, 2022

endlesswho commented Mar 9, 2022

JyotiLuxolis commented Jul 4, 2022

dazinovic commented Jul 5, 2022

junshengzhou commented Jul 14, 2022

ZuoJiaxing commented Aug 17, 2022

How about the result on real world datasets? #5

How about the result on real world datasets? #5

Comments

endlesswho commented Feb 23, 2022

dazinovic commented Feb 23, 2022

endlesswho commented Feb 24, 2022

dazinovic commented Feb 25, 2022

rancheng commented Feb 26, 2022

endlesswho commented Mar 1, 2022

dazinovic commented Mar 1, 2022

endlesswho commented Mar 3, 2022

dazinovic commented Mar 3, 2022

endlesswho commented Mar 4, 2022

endlesswho commented Mar 7, 2022 • edited

dazinovic commented Mar 7, 2022

endlesswho commented Mar 9, 2022

JyotiLuxolis commented Jul 4, 2022

dazinovic commented Jul 5, 2022

junshengzhou commented Jul 14, 2022

ZuoJiaxing commented Aug 17, 2022

endlesswho commented Mar 7, 2022 •

edited