Skip to content
This repository has been archived by the owner on Jul 1, 2024. It is now read-only.

SOTA Model for Text Prompt Segmentation #575

Open
xiaobanni opened this issue Sep 26, 2023 · 7 comments
Open

SOTA Model for Text Prompt Segmentation #575

xiaobanni opened this issue Sep 26, 2023 · 7 comments

Comments

@xiaobanni
Copy link

I am looking for a state-of-the-art (SOTA) model for text prompt segmentation. Currently, I am aware of two choices: Grounded-Segment-Anything and SEEM. However, both of these models fail to meet my requirements.

Consider the following example: I want the model to segment the lane lines, but the results from the aforementioned methods are as follows (i hope they can segment the lane line in the road):

Grounded-Segment-Anything:
image

SEEM Model:
image

Unfortunately, neither of them can solve this problem effectively. I would greatly appreciate any recommendations you may have.

Any information regarding the timeline for the release of SAM text-prompt capabilities would be welcome.

@emi-dm
Copy link

emi-dm commented Sep 26, 2023

I recommend you this: https://github.com/luca-medeiros/lang-segment-anything

@xiaobanni
Copy link
Author

Thank you for the recommendation. However, I have tried it and found that it is just an easier-to-read version of Grounded-Segment-Anything. It uses the same method of using GroundingDINO to translate the text prompt to a box prompt and then sending it to SAM, resulting in similar outcomes to the Grounded-Segment-Anything mentioned earlier. I believe that an oriented text prompt segment model (rather than the two-stage invoking) is necessary to address the issue at hand and facilitate broader downstream applications.

@TerryYiDa
Copy link

I am looking for a state-of-the-art (SOTA) model for text prompt segmentation. Currently, I am aware of two choices: Grounded-Segment-Anything and SEEM. However, both of these models fail to meet my requirements.

Consider the following example: I want the model to segment the lane lines, but the results from the aforementioned methods are as follows (i hope they can segment the lane line in the road):

Grounded-Segment-Anything: image

SEEM Model: image

Unfortunately, neither of them can solve this problem effectively. I would greatly appreciate any recommendations you may have.

Any information regarding the timeline for the release of SAM text-prompt capabilities would be welcome.

Do you have any good solutions? I'm facing the same problem now

@xiaobanni
Copy link
Author

@TerryYiDa No. So, I hope this issue can track the progress of the advanced text-prompt segmentation model.

@iacopo97
Copy link

I have the same problem, do you find a solution?

@YuetianW
Copy link

Lol, really wish it was possible to open up the ability to use text prompts . A two-stage approach like Grounded-Segment-Anything is neither useful nor elegant.😣

@muhammadsr
Copy link

Anyone made progress with this issue?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants