-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
getting the mask of first frame without using XMem or SAM as a preprocessing #51
Comments
Yes, you can convert the bbox to a segmentation mask and run the same way. It will work fine. To convert, make the pixels inside the box >0 and background==0. |
Thanks for your response. Could you please clarify this or please link me to a reference? Any chance you may be able to provide an example of this? Yes, you can convert the bbox to a segmentation mask and run the same way. It will work fine. To convert, make the pixels inside the box >0 and background==0. |
Do you expect the performance to drop if I use 2D bbox instead of segmentation mask? |
Suppose your bbox is [umin, vmin, umax, vmax]
|
no, it should work as good as the segmentation. I've tried this many times. |
@wenbowen123 |
yes |
@wenbowen123 In this case, the generated mask will be completely white. Will it still work? |
the area inside the 2D box will be all white, yes, this will be fine. |
This is a followup question.
Is there a way to make FoundationPose work with only the 2D bounding box of he object of interest?
Has anyone streamlined it so that there was no need for providing the mask of the first frame as a pre-requisite?
Also, I am not sure how I can provide the pre-req by clicking on one single point on the object. Can someone please walk me through?
The text was updated successfully, but these errors were encountered: