Generating 2d bounding boxes for the dataset-images, missing mesh_vertices_hdf5_file #18

yeshwanths88 · 2021-03-30T08:01:22Z

Hi, i was trying to generate the 2d bounding boxes for ai_001_001 by using the script - "dataset_generate_bounding_boxes.py".

The script fails with an assertion. The script requires the following files -

mesh_vertices.hdf5
mesh_faces_vi.hdf5
mesh_faces_oi.hdf5
I couldn't find these above files in the downloaded dataset nor in the checked-in code.
Do I need to purchase the asset files for creating the 2d bounding boxes?

Also in issue #17 you mentioned that "re-rendering our dataset will cost roughly $51K USD" could you please elaborate what you mean by re-rendering process? And do i have to pay for the re-rendering as well to generate the 2d bounding boxes for the existing images in your dataset.

mikeroberts3000 · 2021-03-30T18:57:19Z

Hi! Can you provide a bit more detail about what exactly you want?

If you only need 2D bounding boxes in image space, you can generate them yourself from the segmentation images we provide. If you need 3D bounding boxes in world space, where one axis is always aligned with the world-space gravity, we provide those too.

We don't provide the mesh files you're referring to. In order to obtain these files, you need to purchase the assets yourself. It costs about $6K to purchase the entire dataset, or roughly $15 per scene, and scenes can be purchased in small batches.

Rendering our entire dataset costs roughly $51K, and we describe our rendering pipeline in detail in our arXiv paper.

yeshwanths88 · 2021-03-31T03:39:08Z

Hi @mikeroberts3000, Thanks for your reply! Just like you mentioned we only need 2D bounding boxes in image space. Do you already have any script to generate the bounding boxes from the segmentation images?

mikeroberts3000 · 2021-03-31T19:51:18Z

We don't provide code for this. Is there something preventing you from simply computing 2D bounding boxes directly from the segmentation masks?

americast · 2021-07-05T20:24:55Z

Hi @mikeroberts3000, I tried generating 2D bounding boxes using the semantic segmentation map. However, due to the presence of some missing annotations, I am facing some issues. For instance, this is the semantic segmentation map of ai_038_006/images/scene_cam_00_geometry_preview/frame.0001.semantic.png

The original image looks like

It is clearly evident that there are many chairs here in the scene, but they have not been marked. Also, due to the presence of the chairs, the floor annotation in the semantic map has speckles here and there. Thus when I am generating 2D bounding boxes, they have a weird appearance.

Also, many images have overlapping annotations from the instance segmentation map, which is expected due to the high no of objects in a scene. However, that is leading to too many bounding boxes, giving it a weird look. I tried to generate one here:

May I know if you have any suggestions as to how I could obtain bounding box annotations such that they are suitable to train a detector?

mikeroberts3000 · 2021-07-05T21:09:05Z

Most, but not all, of our pixels have semantic and instance annotations. In our paper, we make a note of this limitation, which arises because of the structure in some of our 3D assets. Our annotation pipeline assumes that 3D assets are grouped into low-level object parts, and we use our custom annotation tool to merge the parts into semantically meaningful objects and label them. However, sometimes the 3D assets are "under-segmented", i.e., a single (supposedly) low-level object part might already contain multiple objects spanning across multiple categories. In this case, our annotation pipeline has no way of splitting the under-segmented object part into multiple objects. Rather than group and label such assets incorrectly, we choose to leave them unlabeled.
To my eye, the floor in the image you posted looks correctly segmented. Why is this segmentation mask a problem in your application? Do you want to compute 2D bounding boxes in a way that ignores occlusions, i.e., do you want compute "amodal" bounding boxes?
If there are lots of distinct objects in the scene, then it is expected that there will be lots of 2D bounding boxes. I don't know anything about your application, so I can't make filtering decisions on your behalf. You need to decide how to filter these bounding boxes in a way that makes sense for your application.

americast · 2021-07-06T17:22:57Z

Thanks for your response @mikeroberts3000

To my eye, the floor in the image you posted looks correctly segmented. Why is this segmentation mask a problem in your application? Do you want to compute 2D bounding boxes in a way that ignores occlusions, i.e., do you want compute "amodal" bounding boxes?

If there are lots of distinct objects in the scene, then it is expected that there will be lots of 2D bounding boxes. I don't know anything about your application, so I can't make filtering decisions on your behalf. You need to decide how to filter these bounding boxes in a way that makes sense for your application.

Yes, the floor image is correctly segmented. And I understand that a lot of distinct objects would come up and that is expected. I wondered if there could be a different way of obtaining the 2D bounding boxes -- something like converting them from the 3D bounding boxes? May I know if that would be possible?

mikeroberts3000 · 2021-07-06T23:08:42Z

Yes, you can compute 2D bounding boxes by projecting each corner of each 3D bounding box into an image. We provide code to do this (see code/python/tools/scene_generate_images_bounding_box.py for details).

Using the 3D bounding boxes in this way will yield "amodal" 2D bounding boxes, i.e., bounding boxes that ignore occlusions. It is also worth mentioning that the 2D bounding boxes you compute will be conservative, in the sense that they will be the same size or bigger than if you projected the object into 2D with no occlusions.

mikeroberts3000 closed this as completed Mar 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating 2d bounding boxes for the dataset-images, missing mesh_vertices_hdf5_file #18

Generating 2d bounding boxes for the dataset-images, missing mesh_vertices_hdf5_file #18

yeshwanths88 commented Mar 30, 2021

mikeroberts3000 commented Mar 30, 2021

yeshwanths88 commented Mar 31, 2021

mikeroberts3000 commented Mar 31, 2021

americast commented Jul 5, 2021

mikeroberts3000 commented Jul 5, 2021 •

edited

americast commented Jul 6, 2021

mikeroberts3000 commented Jul 6, 2021

Generating 2d bounding boxes for the dataset-images, missing mesh_vertices_hdf5_file #18

Generating 2d bounding boxes for the dataset-images, missing mesh_vertices_hdf5_file #18

Comments

yeshwanths88 commented Mar 30, 2021

mikeroberts3000 commented Mar 30, 2021

yeshwanths88 commented Mar 31, 2021

mikeroberts3000 commented Mar 31, 2021

americast commented Jul 5, 2021

mikeroberts3000 commented Jul 5, 2021 • edited

americast commented Jul 6, 2021

mikeroberts3000 commented Jul 6, 2021

mikeroberts3000 commented Jul 5, 2021 •

edited