feat: Segment Anything Model Integration #253

shondle · 2023-04-19T17:04:23Z

This adds the ability to use Facebook's Segment Anything Model (SAM) with Label Studio.

Users can use a smart keypoint to generate a blush annotation with SAM for any object on an image, and adjust using tools already provided in Label Studio. This makes it much easier to create annotations quickly for any image segmentation use case.

For this, I created an ML back end that takes the image and smart KeyPoint a user places in Label Studio, and use it to generate the mask for the selected object with SAM. This is then converted to RLE and then passed to Label Studio to be generated as a blush annotation.

This has been tested using a computing cluster environment and with a CPU (after adjusting the device parameter) on a local machine.

Aciid · 2023-04-19T19:13:32Z

Hi, thanks for awesomeoness.
Quick suggestion, can it be modified so it works like in roboflow's segment-anything. Re-clicking anyhere adds to same segment, reclicking removes from segment.

Heres a few sample pictures, if you have not tried the one in roboflow, it probably uses "invisible to userinterface" -layer segments to handle this.

This makes segmenting really quick and not tedious, choosing when to add/substract from segment or to create new label could of course be using a modifier-key such as ctrl or a gui switch too?

Thanks for awesomeness

iraadit · 2023-04-24T08:25:16Z

Hi @Aciid and @shondle

Thank you for your work @shondle !

In https://www.trainyolo.com implementation, they use:

left click to add to a segmentation
right click to remove from this segmentation
space to validate the segmentation (and start a new one)

JaneX8 · 2023-04-24T10:37:53Z

I can't wait to try this. This would bring this tool to a whole next level. :)

vansin · 2023-04-25T00:52:14Z

Thanks for contribute to community~

We integrate it in OpenMMLab PlayGround , Welcome Star

Support Point2Label and BBox2Label

https://github.com/open-mmlab/playground/tree/main/label_anything

shondle · 2023-04-25T15:36:45Z

vansin, that looks awesome! All I ask is if you could credit my GitHub username for borrowing some of the code.

Also, I added a few more features that should lead to faster inference and a background eraser capability.

fix: ENV variable AUTO_UPDATE can't set false (HumanSignal#256)

vansin · 2023-04-26T01:01:42Z

vansin, that looks awesome! All I ask is if you could credit my GitHub username for borrowing some of the code.

Also, I added a few more features that should lead to faster inference and a background eraser capability.

I add some information before,

no problem, I will add github username in openmmlab playground~

by the way , would you like contribute code to openmmlab playground ?

https://github.com/open-mmlab/playground/tree/main/label_anything

label_studio_ml/examples/segment_anything_model/labelstudio.py

erinmikailstaples · 2023-05-02T15:18:08Z

Checking in on this — @KonstantinKorotaev, is there anything I can do to help get this across the line? 😊

davidblom603 · 2023-05-03T06:48:00Z

+1 extremely interested in this

ZhangYuef · 2023-05-06T02:41:31Z

For "Eraser" feature: In my opinion, it should be used in adding and erasing smart keypoints for a same target label item.

iraadit · 2023-05-10T10:38:54Z

Hi
When will it be merged?
Thank you a lot for the work !

makseq · 2023-05-11T00:58:16Z

@shondle could you please check pytests and try to fix it in the new PR? Seems it was timeouted because a model tried to load too long.

croche2574 · 2023-05-12T02:30:13Z

Could you add functionality to convert from bounding boxes to segmentations? This would be extremely useful for converting detection datasets to segmentation datasets for YOLO models

shondle · 2023-05-15T16:45:42Z

Could you add functionality to convert from bounding boxes to segmentations? This would be extremely useful for converting detection datasets to segmentation datasets for YOLO models

Here I added the ability to create the mask with a smart rectangle label. For converting an existing dataset you would need to change the box input x, y, width, and height by gathering from the tasks (what is already annotated on the image) instead of the kwargs. Then, you could just select all of the images and send them through the model.

Unsure about this, but may be better (for faster inference over a large set of images) to do this when I used the pytorch model this commit. instead of the ONNX (what I referenced earlier).

shondle · 2023-05-15T16:48:44Z

@shondle could you please check pytests and try to fix it in the new PR? Seems it was timeouted because a model tried to load too long.

I have tried to fix it, but running into issues with Docker (which I do not have much experience with). I fixed a few issues before this PR was merged, but was unable to figure out the fix for the timeout. If there is anyone with more knowledge working with Docker that is able to fix this issue and get over the hump, I would appreciate it. Otherwise, I will try to get back to this sometime later.

Cupcee · 2023-05-25T08:55:58Z

Hi, I tried installing this and everything seemed to go smoothly until I try to get a prediction:

...
def predict(self, tasks, **kwargs):
        """Returns the predicted mask for a smart keypoint that has been placed."""

        # Use this to check times for your predictions
        # print(f"Current data and time1: {str(datetime.datetime.now())}") # Current data and time1: 2023-04-16 18:56:09.361688 (ALMOST INSTANTANEOUS FROM THE RUN)

        results = []
        predictions = []
        predictor = PREDICTOR

        image_url = tasks[0]["data"]["image"]
        print(f"the kwargs are {kwargs}")
        print(f"the tasks are {tasks}")

        # getting the height and width of the image that you are annotating real-time
        height = kwargs["context"]["result"][0]["original_height"] # ERROR
        ...

It fails when getting the context from predict's kwargs. Context is None for me.

the kwargs are {'login': None, 'password': None, 'context': None}

Any idea on what goes wrong?

KonstantinKorotaev · 2023-05-25T09:06:33Z

Any idea on what goes wrong?

Did you add ML backend as interactive?

Cupcee · 2023-05-25T09:28:29Z

Did you add ML backend as interactive?

If you mean this switch, it's checked:

KonstantinKorotaev · 2023-05-25T12:17:17Z

If you mean this switch, it's checked:

Great!
Do you have Webhooks enabled?

Cupcee · 2023-05-25T18:04:13Z

Do you have Webhooks enabled?

How do I check if they are enabled and how do I enable them? Are they not enabled by default? The installation instructions mentioned nothing about this.

EDIT: I have this:

KonstantinKorotaev · 2023-05-26T09:03:06Z

Please deactivate option "Show predictions to annotators in the Label Stream and Quick View" in your Machine Learning settings for your project and check one more time.

Marouene-Oueslati · 2023-05-30T20:30:54Z

the the ML backend http address seems not to work properly..any fix for the SAM

aiyou9 · 2023-06-06T18:27:59Z

Could you add functionality to convert from bounding boxes to segmentations? This would be extremely useful for converting detection datasets to segmentation datasets for YOLO models

I am also looking forward to this feature, especially in conjunction with this project

https://github.com/facebookresearch/ov-seg demo: https://huggingface.co/spaces/facebook/ov-seg

In this test example, vietanhdev/anylabeling#89

labeling other cells in the image can be manually corrected to the cells that need to be labeled and converted into instance segmentation, which greatly improves labeling efficiency.

another project:
https://github.com/luca-medeiros/lang-segment-anything

So, Image annotation can be combined with text prompt

jackmead515 · 2023-06-29T21:05:18Z

Hi, I tried installing this and everything seemed to go smoothly until I try to get a prediction:

...
def predict(self, tasks, **kwargs):
        """Returns the predicted mask for a smart keypoint that has been placed."""

        # Use this to check times for your predictions
        # print(f"Current data and time1: {str(datetime.datetime.now())}") # Current data and time1: 2023-04-16 18:56:09.361688 (ALMOST INSTANTANEOUS FROM THE RUN)

        results = []
        predictions = []
        predictor = PREDICTOR

        image_url = tasks[0]["data"]["image"]
        print(f"the kwargs are {kwargs}")
        print(f"the tasks are {tasks}")

        # getting the height and width of the image that you are annotating real-time
        height = kwargs["context"]["result"][0]["original_height"] # ERROR
        ...

It fails when getting the context from predict's kwargs. Context is None for me.

the kwargs are {'login': None, 'password': None, 'context': None}

Any idea on what goes wrong?

Bumping this issue up. Seems like this is a persistent issue on my end as well. Is there a solution to this?

DimIsaev · 2023-07-18T10:58:07Z

Why is this problem not solved, it is 5-6 problems and still not solved!
Only messages that will be considered sometime...

shondle · 2023-07-18T13:33:11Z

Why is this problem not solved, it is 5-6 problems and still not solved!

Only messages that will be considered sometime...

Hi, are you using the smart keypoint tool on the toolbar while selecting one of the bottom two labels provided in the labeling config in the README?

Other things to check are whether Auto Annotation and Auto accept annotation are on in the image tab before you use the smart keypoint using one of the bottom labels to make a prediction.

Make sure "Use for interactive preannotations" is toggled on when you are adding your model. In the Machine Learning tab, make sure only the bottom toggle is activated aka "Show predictions to annotators in the Label Stream and Quick View".

DimIsaev · 2023-07-18T13:48:40Z

yes such a problem

shondle · 2023-07-18T13:58:52Z

yes such a problem

Once it has the smart keypoint selected can you click off to the side of the screen to unselect? If that doesn't keep it selected then if you need to toggle through the options just keep clicking the purple box in the toolbar not the sidebar.

DimIsaev · 2023-07-18T14:48:21Z

Once it has the smart keypoint selected can you click off to the side of the screen to unselect? If that doesn't keep it selected then if you need to toggle through the options just keep clicking the purple box in the toolbar not the sidebar.

it works illogically, or I don't understand the logic
But what about this error? She's more important
feat: Segment Anything Model Integration #253 (comment)

Thanks

shondle added 6 commits April 19, 2023 09:14

Add SAM integration

da4f86e

Add setup docs

9de388c

Add backend server files

e369af5

Update README.md

caee4ec

Fix installation instructions

abbbf16

Update README.md

4367b18

shondle mentioned this pull request Apr 19, 2023

Integrating SAM in label studio HumanSignal/label-studio#4058

Closed

makseq assigned makseq and KonstantinKorotaev Apr 20, 2023

shondle added 4 commits April 25, 2023 15:03

Add ONNX usability and eraser feature

3349572

Update README.md to include ONNX addition

2e517e2

Update visuals for faster inference update

b39d21e

Create requirements.txt

cb68781

shondle added 2 commits April 25, 2023 15:47

Allow pip installations

28259f5

Merge pull request #1 from heartexlabs/master

6001b0f

fix: ENV variable AUTO_UPDATE can't set false (HumanSignal#256)

KonstantinKorotaev reviewed Apr 26, 2023

View reviewed changes

label_studio_ml/examples/segment_anything_model/labelstudio.py Outdated Show resolved Hide resolved

KonstantinKorotaev approved these changes Apr 27, 2023

View reviewed changes

KonstantinKorotaev reviewed Apr 27, 2023

View reviewed changes

label_studio_ml/examples/segment_anything_model/labelstudio.py Outdated Show resolved Hide resolved

shondle changed the title ~~Segment Anything Model Integration~~ feat: Segment Anything Model Integration May 4, 2023

shondle added 3 commits May 4, 2023 10:21

Export model env variable, fix image URL grab

a5e7981

Remove requirements bloat

3328303

Add onnx model converter

c4e82a7

shondle added 3 commits May 4, 2023 14:54

Rename file

8b34628

Update file reference

efa5e30

Update Dockerfile for model downloads

ca6f5de

shondle requested a review from KonstantinKorotaev May 5, 2023 20:19

shondle added 2 commits May 9, 2023 16:54

Update requirements for gunicorn and opencv

63522a1

Fix model install with wget

752b53f

KonstantinKorotaev merged commit cb98234 into HumanSignal:master May 10, 2023
2 of 3 checks passed

shondle mentioned this pull request Jul 18, 2023

When trying to use SAM the model in my label studio show this error in the logs about the predictions #294

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Segment Anything Model Integration #253

feat: Segment Anything Model Integration #253

shondle commented Apr 19, 2023

Aciid commented Apr 19, 2023

iraadit commented Apr 24, 2023

JaneX8 commented Apr 24, 2023

vansin commented Apr 25, 2023 •

edited

Loading

shondle commented Apr 25, 2023

vansin commented Apr 26, 2023

erinmikailstaples commented May 2, 2023

davidblom603 commented May 3, 2023

ZhangYuef commented May 6, 2023 •

edited

Loading

iraadit commented May 10, 2023

makseq commented May 11, 2023

croche2574 commented May 12, 2023

shondle commented May 15, 2023

shondle commented May 15, 2023

Cupcee commented May 25, 2023 •

edited

Loading

KonstantinKorotaev commented May 25, 2023

Cupcee commented May 25, 2023 •

edited

Loading

KonstantinKorotaev commented May 25, 2023

Cupcee commented May 25, 2023 •

edited

Loading

KonstantinKorotaev commented May 26, 2023

Marouene-Oueslati commented May 30, 2023

aiyou9 commented Jun 6, 2023 •

edited

Loading

jackmead515 commented Jun 29, 2023

DimIsaev commented Jul 18, 2023

shondle commented Jul 18, 2023 •

edited

Loading

DimIsaev commented Jul 18, 2023

shondle commented Jul 18, 2023 •

edited

Loading

DimIsaev commented Jul 18, 2023

feat: Segment Anything Model Integration #253

feat: Segment Anything Model Integration #253

Conversation

shondle commented Apr 19, 2023

Aciid commented Apr 19, 2023

iraadit commented Apr 24, 2023

JaneX8 commented Apr 24, 2023

vansin commented Apr 25, 2023 • edited Loading

shondle commented Apr 25, 2023

vansin commented Apr 26, 2023

erinmikailstaples commented May 2, 2023

davidblom603 commented May 3, 2023

ZhangYuef commented May 6, 2023 • edited Loading

iraadit commented May 10, 2023

makseq commented May 11, 2023

croche2574 commented May 12, 2023

shondle commented May 15, 2023

shondle commented May 15, 2023

Cupcee commented May 25, 2023 • edited Loading

KonstantinKorotaev commented May 25, 2023

Cupcee commented May 25, 2023 • edited Loading

KonstantinKorotaev commented May 25, 2023

Cupcee commented May 25, 2023 • edited Loading

KonstantinKorotaev commented May 26, 2023

Marouene-Oueslati commented May 30, 2023

aiyou9 commented Jun 6, 2023 • edited Loading

jackmead515 commented Jun 29, 2023

DimIsaev commented Jul 18, 2023

shondle commented Jul 18, 2023 • edited Loading

DimIsaev commented Jul 18, 2023

shondle commented Jul 18, 2023 • edited Loading

DimIsaev commented Jul 18, 2023

vansin commented Apr 25, 2023 •

edited

Loading

ZhangYuef commented May 6, 2023 •

edited

Loading

Cupcee commented May 25, 2023 •

edited

Loading

Cupcee commented May 25, 2023 •

edited

Loading

Cupcee commented May 25, 2023 •

edited

Loading

aiyou9 commented Jun 6, 2023 •

edited

Loading

shondle commented Jul 18, 2023 •

edited

Loading

shondle commented Jul 18, 2023 •

edited

Loading