getting the mask of first frame without using XMem or SAM as a preprocessing #51

monajalal · 2024-04-11T17:43:44Z

This is a followup question.

Is there a way to make FoundationPose work with only the 2D bounding box of he object of interest?

Has anyone streamlined it so that there was no need for providing the mask of the first frame as a pre-requisite?

Also, I am not sure how I can provide the pre-req by clicking on one single point on the object. Can someone please walk me through?

wenbowen123 · 2024-04-11T18:57:33Z

Is there a way to make FoundationPose work with only the 2D bounding box of he object of interest?

Yes, you can convert the bbox to a segmentation mask and run the same way. It will work fine. To convert, make the pixels inside the box >0 and background==0.

monajalal · 2024-04-12T14:49:26Z

Thanks for your response. Could you please clarify this or please link me to a reference? Any chance you may be able to provide an example of this?

Yes, you can convert the bbox to a segmentation mask and run the same way. It will work fine. To convert, make the pixels inside the box >0 and background==0.

monajalal · 2024-04-12T14:50:13Z

Do you expect the performance to drop if I use 2D bbox instead of segmentation mask?

wenbowen123 · 2024-04-12T17:19:32Z

Suppose your bbox is [umin, vmin, umax, vmax]

mask = np.zeros((height, width), dtype=bool)
mask[vmin:vmax, umin:umax] = 1

wenbowen123 · 2024-04-12T17:19:55Z

no, it should work as good as the segmentation. I've tried this many times.

monajalal · 2024-04-15T18:53:14Z

@wenbowen123
Thanks a lot for your guidance. I just wanted to confirm I was able to perform FoundationPose with only 2D bbox of first frame in yolox format and converting it to binary mask.

wenbowen123 · 2024-04-16T04:17:24Z

yes

abhishekmonogram · 2024-04-25T15:28:09Z

@wenbowen123 In this case, the generated mask will be completely white. Will it still work?

wenbowen123 · 2024-04-25T17:56:36Z

@wenbowen123 In this case, the generated mask will be completely white. Will it still work?

the area inside the 2D box will be all white, yes, this will be fine.

monajalal closed this as completed Apr 12, 2024

wenbowen123 mentioned this issue Apr 25, 2024

How can I obtain an initial mask? #88

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

getting the mask of first frame without using XMem or SAM as a preprocessing #51

getting the mask of first frame without using XMem or SAM as a preprocessing #51

monajalal commented Apr 11, 2024

wenbowen123 commented Apr 11, 2024

monajalal commented Apr 12, 2024 •

edited

monajalal commented Apr 12, 2024

wenbowen123 commented Apr 12, 2024

wenbowen123 commented Apr 12, 2024

monajalal commented Apr 15, 2024

wenbowen123 commented Apr 16, 2024 •

edited

abhishekmonogram commented Apr 25, 2024

wenbowen123 commented Apr 25, 2024

getting the mask of first frame without using XMem or SAM as a preprocessing #51

getting the mask of first frame without using XMem or SAM as a preprocessing #51

Comments

monajalal commented Apr 11, 2024

wenbowen123 commented Apr 11, 2024

monajalal commented Apr 12, 2024 • edited

monajalal commented Apr 12, 2024

wenbowen123 commented Apr 12, 2024

wenbowen123 commented Apr 12, 2024

monajalal commented Apr 15, 2024

wenbowen123 commented Apr 16, 2024 • edited

abhishekmonogram commented Apr 25, 2024

wenbowen123 commented Apr 25, 2024

monajalal commented Apr 12, 2024 •

edited

wenbowen123 commented Apr 16, 2024 •

edited