Depth map and intrinsic #25

jzhzhang · 2021-12-07T14:09:34Z

Thanks for your amazing dataset!

I encountered some weird results (as shown below) when planning to back project the depth map to generate the point cloud. The intrinsic matrix is obtained by @liuyuan in issue#4, and the depth map is directly from the car/106_12650_23736/depths/frame000001.jpg.geometric.png. It seems the intrinsic matrix is not related to the depth map.

Can you give me some quick advice or references?

The text was updated successfully, but these errors were encountered:

jzhzhang · 2021-12-08T01:46:34Z

The vaule of the depth map car/106_12650_23736/depths/frame000001.jpg.geometric.png range from [16490-22649], which indicates the camera center is far from the object.

The color map of the depth (scaled by 1e-3):

davnov134 · 2021-12-08T13:12:55Z

Hi, we provide tooling for operating with the depth maps loaded using the Co3DDataset object.
Specifically, this function

co3d/dataset/visualize.py

Line 92 in 8ad0f03

point_cloud = get_rgbd_point_cloud(

contains an example of loading depth maps and converting to point clouds. Can you try to use it to unproject your depth maps? There are few more examples in /tests/test_dataset_visualize.py

jzhzhang · 2021-12-08T13:33:33Z

Thanks. I will follow the scripts and let you know how it turns out.

jzhzhang · 2021-12-10T14:06:10Z

@colesbury Sorry to bother you agiain. I want to know some details about the depth maps.

Do the depth maps directly come from the COLMAP? Have you made any post-processing modifications to the depth maps?

jzhzhang · 2021-12-11T02:10:43Z

I guess i found the resaon why the depth map is not working.

The depth value in the depth maps should be loaded as 16 bit float number. The same as the _load_16bit_depth

co3d/dataset/co3d_dataset.py

Line 728 in b22f145

def _load_16big_png_depth(depth_png):

This is how it looks:

AlexisStdp · 2021-12-20T21:59:24Z

I guess i found the resaon why the depth map is not working.

The depth value in the depth maps should be loaded as 16 bit float number. The same as the _load_16bit_depth

co3d/dataset/co3d_dataset.py

Line 728 in b22f145

def _load_16big_png_depth(depth_png):

This is how it looks:

Hello, this projection is exactly what I need in my current project. However I didn't fully understand how you obtained it, I tried quite a few provided functions in this repository but it's a bit unclear how to use them. Please, could you tell us what steps you followed and which functions you used to get this nice point cloud?

For example, loading depth as 16 bit had a similar result as your first projection.

shapovalov · 2021-12-23T12:49:53Z

@AlexisStdp Did you use the provided function to load depth? Please note the files are not a standard 16-bit PNG; the function reinterprets binary 16-bit values as floats.

AlexisStdp · 2021-12-31T12:52:53Z

@shapovalov Thank you very much for your answer! I indeed used some of the provided functions to load depth, and to load RGB image. And some functions to get the frame_annotations.
Ideally if I could have the following setting it would be perfect: given an RGB image and a depth image (and the .jgz annotations), we load RGB and depth, we get the intrinsic (and maybe scale adjustment), and we backproject to get the partial point cloud.

In the provided code, it seems that the function "get_rgbd_point_cloud" is exactly that, but I'm unfortunately unable to make it work because I'm not sure how to get the argument "camera: CamerasBase" (is there a simple way I could get it?).
There is a function "get_co3d_sequence_pointcloud" which I tried to use in order to get this "camera" (because it itself uses "get_rgdp_point_cloud"), but inside of it the Dataloader didn't work for various reasons.

Is there maybe an alternative approach? - I tried using open3d for example but this gives me a "flat" point cloud (where all points seem to lie on a 2D plane).

Thanks in advance, and happy new year!

shapovalov · 2022-01-14T12:56:25Z

@AlexisStdp Co3dDataset should load the data in the required format. In particular, frame_data.camera is in PyTorch3D format, which should be compatible with get_rgbd_point_cloud. If that does not work for you, could you share more details?

If you want to dig deeper, this is the code that reads the Cameras object: https://github.com/facebookresearch/co3d/blob/main/dataset/co3d_dataset.py#L490

Note that the camera is in NDC coordinate system. Please refer to PyTorch3D documentation for details: https://pytorch3d.org/docs/cameras

jzhzhang closed this as completed Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Depth map and intrinsic #25

Depth map and intrinsic #25

jzhzhang commented Dec 7, 2021

jzhzhang commented Dec 8, 2021

davnov134 commented Dec 8, 2021

jzhzhang commented Dec 8, 2021

jzhzhang commented Dec 10, 2021

jzhzhang commented Dec 11, 2021

AlexisStdp commented Dec 20, 2021

shapovalov commented Dec 23, 2021

AlexisStdp commented Dec 31, 2021

shapovalov commented Jan 14, 2022

Depth map and intrinsic #25

Depth map and intrinsic #25

Comments

jzhzhang commented Dec 7, 2021

jzhzhang commented Dec 8, 2021

davnov134 commented Dec 8, 2021

jzhzhang commented Dec 8, 2021

jzhzhang commented Dec 10, 2021

jzhzhang commented Dec 11, 2021

AlexisStdp commented Dec 20, 2021

shapovalov commented Dec 23, 2021

AlexisStdp commented Dec 31, 2021

shapovalov commented Jan 14, 2022