Matching the geometry to the input image #22

jamalknight · 2020-03-26T06:18:55Z

Hi there

I have a question about aligning the geometry to a camera in a 3D app like Maya.
Object classification/segmentation is ok example

The geometry obj file is created, but I was wondering if there is a way to align the geometry with a perspective camera that matches the original image.

This geometry is in 3d space not aligned here
The geometry is manually aligned here by eye here, but might not be accurate

I would like it to match the image - is there a relatively simple way this could be done?

gkioxari · 2020-03-26T17:06:52Z

Hi @jamalknight! This is an excellent question! I want to give a complete (and thus somewhat long!) response, as I think this will be useful for others as well. My answer below assumes a perspective camera.
(Edit: I posted a different answer before because I misunderstood the issue but this answer version should be addressing your question!)

What does Mesh R-CNN output?

First, let's see what Mesh R-CNN outputs. Mesh R-CNN returns the 3D shape of an object in the camera coordinate system confined in a 3D box which respects the aspect ratio of the object detected in the image. If you provide the focal length f of the camera and the actual depth location t_z of the object's center , i.e. how far the center of the object is from the image plane in the Z axis, then Mesh R-CNN would pixel align the predicted 3D object shape with the image and the prediction would correspond to the true metric size of the object - its actual scale in the real world!.

Metric Scale

While most images nowadays have access to their focal_length f from the image metadata, knowing t_z is difficult. Also note that Mesh R-CNN does not make a prediction for t_z because Pix3D does not contain useful metric depth of the objects. In the Pix3D annotations, the tuple (f, t_z) provided does not correspond to the actual camera metadata nor metric depth of the object but is computed subsequently at annotation time by their annotation process and annotation tool and thus is somewhat adhoc. This is the reason we don't tackle the problem of estimating t_z (this problem is also called the scene layout prediction problem).

I don't care about metric scale. I just want to pixel align.

However if you don't care about metric scale and you only care about pixel aligning the object to the image, that is possible with our demo! The demo runs with a default focal_length f=20 (this is the blender focal length assuming 32mm sensor width and is not the true focal_length of the image! We make it up!) . The demo also places the object at some arbitrary t_z > 0.0, again this is not the true metric depth of the object. Given these choices of (f, t_z), the demo will output an object shape placed at t_z. The metric size of the predicted object from the demo will not correspond to the true size of the object in the world, but it will be a scaled version of it. Now to pixel align the predicted shape with the image, all you need to do is render the 3D mesh with f=20. Note that the value 20 is inconsequential. You would be getting the same pixel alignment if f was something else, but it's important that the value of f you pick when running the demo is also used when rendering!

Here is an example! When I run the demo on an input image (1st image), it recognizes the sofa (2nd image). I get a 3D shape prediction for the sofa which after I render with blender with focal length f=20 I get the final result (3rd image).

vadimkantorov · 2021-08-05T07:06:18Z

@gkioxari Could you please share the blender script (if you still have it) you used to produce the third image? Also I wonder if it's possible to render the mesh onto the image using PyTorch3D directly. Figuring this out (new to 3D)...

vadimkantorov · 2021-08-06T09:13:18Z

I've imported the "chair" mesh (that I obtained from running demo.py) into Blender, but the camera looks away from the object. Do I need to manually reset the camera position / orientation? If yes, what should it be (location, rotation angles, focal length, clip start/end)?

Thank you!

vadimkantorov · 2021-08-06T10:54:08Z

I am currently using (location: (0, 0, 0), rotation: (180deg, 0,180deg), focal length: 20mm, clip start/end: 0.1m/100m)

I'm still getting a gray render when I go to camera view. Is it a problem with the light? or maybe clipping?

Here's the original image and mesh produced by MeshRCNN: chair3.zip

Thank you @gkioxari !

vadimkantorov · 2021-08-06T11:25:55Z

Hmm, enabling "Depth of Field" makes it show something. Not sure why "Depth of Field" is needed...

vadimkantorov · 2021-08-06T12:19:15Z

Now I got the one below. Does it make sense?

Welcoming any advice on adjusting camera / light parameters :) I'm a complete noob in Blender :(

gkioxari added the question Further information is requested label Mar 27, 2020

gkioxari self-assigned this Mar 27, 2020

gkioxari mentioned this issue Mar 29, 2020

How to use the mesh to generate the mask. #23

Closed

gkioxari closed this as completed Apr 13, 2020

dmitrykudinov mentioned this issue Jul 21, 2020

Orthographic camera, t_z, focal_length #52

Closed

vadimkantorov mentioned this issue Aug 11, 2021

Blender script for overlaying aligned mesh on input images #100

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matching the geometry to the input image #22

Matching the geometry to the input image #22

jamalknight commented Mar 26, 2020

gkioxari commented Mar 26, 2020 •

edited

Loading

vadimkantorov commented Aug 5, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021

vadimkantorov commented Aug 6, 2021 •

edited

Loading

Matching the geometry to the input image #22

Matching the geometry to the input image #22

Comments

jamalknight commented Mar 26, 2020

gkioxari commented Mar 26, 2020 • edited Loading

What does Mesh R-CNN output?

Metric Scale

I don't care about metric scale. I just want to pixel align.

vadimkantorov commented Aug 5, 2021 • edited Loading

vadimkantorov commented Aug 6, 2021 • edited Loading

vadimkantorov commented Aug 6, 2021 • edited Loading

vadimkantorov commented Aug 6, 2021

vadimkantorov commented Aug 6, 2021 • edited Loading

gkioxari commented Mar 26, 2020 •

edited

Loading

vadimkantorov commented Aug 5, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021 •

edited

Loading

vadimkantorov commented Aug 6, 2021 •

edited

Loading