About the camera intrinsics matrix #4

OasisYang · 2021-07-30T03:10:00Z

Hi! Thanks for your wonderful dataset!
I have a question about the camera intrinsics matrix. I found for all data the principal_point is [0, 0], which is really rare for real-world cameras. Could you please explain it briefly? Thanks in advance.

davnov134 · 2021-07-30T08:24:25Z

HI, we store the intrinsics in the PyTorch3D convention. More info here:
https://pytorch3d.org/docs/cameras

OasisYang · 2021-07-30T17:44:53Z

Thanks for your reply! I know I need to convert the given principal point to screen space. But I mean after converting, the principal point also located at the center of image, which is not very common. Did you warp the image or implement other preprocessing steps to make sure it?

davnov134 · 2021-08-02T11:18:41Z

The location of the principal point is decided by the COLMAP image rectification algorithm.
I just checked the raw COLMAP data, and it seems that the COLMAP undistorter also resamples the image so that the principal point is exactly coinciding with the center of the image.
Thanks for spotting this.

liuyuan-pal · 2021-09-10T09:53:14Z

Thanks for sharing this dataset.

I finally figure out how to convert the annotation to the opencv-style extrinsics and intrinsics, which may be helpful for others:

def co3d_annotation_to_opencv_pose(entry):
    p = entry.viewpoint.principal_point
    f = entry.viewpoint.focal_length
    h, w = entry.image.size
    K = np.eye(3)
    s = (min(h, w) - 1) / 2
    K[0, 0] = f[0] * (w - 1) / 2
    K[1, 1] = f[1] * (h - 1) / 2
    K[0, 2] = -p[0] * s + (w - 1) / 2
    K[1, 2] = -p[1] * s + (h - 1) / 2

    R = np.asarray(entry.viewpoint.R).T   # note the transpose here
    T = np.asarray(entry.viewpoint.T)
    pose = np.concatenate([R,T[:,None]],1)
    pose = np.diag([-1,-1,1]).astype(np.float32) @ pose # flip the direction of x,y axis

    # "pose" is the extrinsic and "K" is the intrinsic
    # pose = [R|t]
    # x_img = K (R @ x_wrd + t)
    # x_img is in pixel

However, I still have a question about how to convert the points from the estimated depth into the coordinate system of "pointcloud.ply".

MaybeOjbk · 2021-12-18T03:18:52Z

Thanks for sharing this dataset.

I finally figure out how to convert the annotation to the opencv-style extrinsics and intrinsics, which may be helpful for others:

def co3d_annotation_to_opencv_pose(entry):
    p = entry.viewpoint.principal_point
    f = entry.viewpoint.focal_length
    h, w = entry.image.size
    K = np.eye(3)
    s = (min(h, w) - 1) / 2
    K[0, 0] = f[0] * (w - 1) / 2
    K[1, 1] = f[1] * (h - 1) / 2
    K[0, 2] = -p[0] * s + (w - 1) / 2
    K[1, 2] = -p[1] * s + (h - 1) / 2

    R = np.asarray(entry.viewpoint.R).T   # note the transpose here
    T = np.asarray(entry.viewpoint.T)
    pose = np.concatenate([R,T[:,None]],1)
    pose = np.diag([-1,-1,1]).astype(np.float32) @ pose # flip the direction of x,y axis

    # "pose" is the extrinsic and "K" is the intrinsic
    # pose = [R|t]
    # x_img = K (R @ x_wrd + t)
    # x_img is in pixel

However, I still have a question about how to convert the points from the estimated depth into the coordinate system of "pointcloud.ply".

Thanks a lot, and also we should use parameters of train_dataset[idx].camera instead of parameters of entry.viewpoint, when we need to crop images, because after we crop and resize images, principal_point and focal_length may changed

MaybeOjbk · 2021-12-18T03:20:04Z

my test code is here:

  train_dataset = datasets['train']
  def co3d_annotation_to_opencv_pose(idx):
      camera = train_dataset[idx].camera
      p = camera.principal_point[0]
      f = camera.focal_length[0]
      R = camera.R[0]
      T = camera.T[0]
      _, h, w = train_dataset[idx].image_rgb.size()
      K = np.eye(3)
      s = (min(h, w) - 1) / 2
      K[0, 0] = f[0] * (w - 1) / 2
      K[1, 1] = f[1] * (h - 1) / 2
      K[0, 2] = -p[0] * s + (w - 1) / 2
      K[1, 2] = -p[1] * s + (h - 1) / 2
  
      R = np.asarray(R).T   # note the transpose here
      T = np.asarray(T)
      pose = np.concatenate([R,T[:,None]],1)
      pose = np.diag([-1,-1,1]).astype(np.float32) @ pose # flip the direction of x,y axis
      return K, pose

shapovalov · 2021-12-23T13:06:18Z

Please note that PyTorch3D NDC convention has −1 and 1 coordinates at the corners of the image, not the centres of the corner pixels, so you should not subtract 1 from h, w, and min(h, w).

Please try to use the provided data loaders. If they do not fulfil some needs, please let us know.

The reference for parsing the viewpoint (applying the crop if needed) is https://github.com/facebookresearch/co3d/blob/main/dataset/co3d_dataset.py#L490 .
For conversion to OpenCV format, PyTorch3D has a function
https://github.com/facebookresearch/pytorch3d/blob/main/pytorch3d/utils/camera_conversions.py#L65
with the actual implementation in
https://github.com/facebookresearch/pytorch3d/blob/main/pytorch3d/renderer/camera_conversions.py#L61 .

Red-Fairy · 2023-06-15T09:01:37Z

my test code is here:

  train_dataset = datasets['train']
  def co3d_annotation_to_opencv_pose(idx):
      camera = train_dataset[idx].camera
      p = camera.principal_point[0]
      f = camera.focal_length[0]
      R = camera.R[0]
      T = camera.T[0]
      _, h, w = train_dataset[idx].image_rgb.size()
      K = np.eye(3)
      s = (min(h, w) - 1) / 2
      K[0, 0] = f[0] * (w - 1) / 2
      K[1, 1] = f[1] * (h - 1) / 2
      K[0, 2] = -p[0] * s + (w - 1) / 2
      K[1, 2] = -p[1] * s + (h - 1) / 2
  
      R = np.asarray(R).T   # note the transpose here
      T = np.asarray(T)
      pose = np.concatenate([R,T[:,None]],1)
      pose = np.diag([-1,-1,1]).astype(np.float32) @ pose # flip the direction of x,y axis
      return K, pose

Thanks for sharing. I tried to obtain the cam2world matrix by using your code to get the pose and then inverse it. However, I found all the translation part of the cam2world matrix (i.e., [:, 3:]) has a negative third entry, indicating that the object is placed at z<0, which seems very strange. Did you observe this phenomenon? Thanks in advance.

AzmiHaider92 · 2024-02-16T12:27:02Z

Edited:

In these docs:
https://pytorch3d.org/docs/cameras

the focal lengths are:

s = min(w,h)
K[0, 0] = f[0] * s / 2
K[1, 1] = f[1] * s / 2

And in the code posted above, the focal lengths are:

K[0, 0] = f[0] * w / 2
K[1, 1] = f[1] * h / 2

Which one is correct?

Thanks

shapovalov · 2024-02-19T11:11:46Z

At some point after CO3Dv1 release PyTorch3D changed the NDC convention; they differ for non-square images.
If the viewpoint annotation contains intrinsics_format: ndc_isotropic field, it is the new format (your former snippet applies), otherwise it is the legacy format (the latter snippet applies).

See https://github.com/facebookresearch/co3d/blob/main/co3d/dataset/data_types.py#L72

Hope it helps.

AzmiHaider92 · 2024-02-19T11:20:36Z

Yes. This is very helpful.
Thank you so much.

davnov134 closed this as completed Aug 9, 2021

jzhzhang mentioned this issue Dec 7, 2021

Depth map and intrinsic #25

Closed

dcharatan mentioned this issue Feb 10, 2024

Question on generate point cloud figure code. dcharatan/pixelsplat#25

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the camera intrinsics matrix #4

About the camera intrinsics matrix #4

OasisYang commented Jul 30, 2021

davnov134 commented Jul 30, 2021

OasisYang commented Jul 30, 2021

davnov134 commented Aug 2, 2021

liuyuan-pal commented Sep 10, 2021

MaybeOjbk commented Dec 18, 2021

MaybeOjbk commented Dec 18, 2021 •

edited

Loading

shapovalov commented Dec 23, 2021

Red-Fairy commented Jun 15, 2023

AzmiHaider92 commented Feb 16, 2024 •

edited

Loading

shapovalov commented Feb 19, 2024

AzmiHaider92 commented Feb 19, 2024

About the camera intrinsics matrix #4

About the camera intrinsics matrix #4

Comments

OasisYang commented Jul 30, 2021

davnov134 commented Jul 30, 2021

OasisYang commented Jul 30, 2021

davnov134 commented Aug 2, 2021

liuyuan-pal commented Sep 10, 2021

MaybeOjbk commented Dec 18, 2021

MaybeOjbk commented Dec 18, 2021 • edited Loading

shapovalov commented Dec 23, 2021

Red-Fairy commented Jun 15, 2023

AzmiHaider92 commented Feb 16, 2024 • edited Loading

shapovalov commented Feb 19, 2024

AzmiHaider92 commented Feb 19, 2024

MaybeOjbk commented Dec 18, 2021 •

edited

Loading

AzmiHaider92 commented Feb 16, 2024 •

edited

Loading