Add an initial Segformer implementation #1617

jimexist · 2024-01-24T05:31:28Z

this is a copy from the huggingface pytorch model.

this is inference only so i omitted the parts about dropout, etc.

for image classification with `num_labels`

the model outputs a tensor with shape [batch_size, num_labels] where the last dim is logits

for image segmentation with `num_labels`, `height` and `width`

the model outputs a tensor with shape [batch_size, num_labels, height // 4, width // 4] to represent each 4-pixel patch's logits

Classification example

this shall give you burger label:

classification logits [3.275261e-5, 0.0008562019, 0.0008868563, 0.9977506, 0.0002465068, 0.0002241473, 2.846596e-6]
label: hamburger

Segmentation example

input image (url is https://huggingface.co/datasets/hf-internal-testing/fixtures_ade20k/resolve/main/ADE_val_00000001.jpg):

output segmentation image:

LaurentMazare · 2024-02-03T13:24:44Z

This looks like a neat addition, thanks for working on this.
Would you mind adding to the PR a couple sample input and output so as to illustrate what to expect from the model?

jimexist · 2024-02-19T03:46:17Z

This looks like a neat addition, thanks for working on this. Would you mind adding to the PR a couple sample input and output so as to illustrate what to expect from the model?

@LaurentMazare i added some in the PR description

LaurentMazare · 2024-02-25T19:57:31Z

This looks like a neat addition, thanks for working on this. Would you mind adding to the PR a couple sample input and output so as to illustrate what to expect from the model?

@LaurentMazare i added some in the PR description

Apologies if I misread your update but I was more expecting some example images as well as the segmentation masks generated by the model.

jimexist · 2024-03-01T15:05:02Z

This looks like a neat addition, thanks for working on this. Would you mind adding to the PR a couple sample input and output so as to illustrate what to expect from the model?

@LaurentMazare i added some in the PR description

Apologies if I misread your update but I was more expecting some example images as well as the segmentation masks generated by the model.

sample image added with readme updated.

jimexist · 2024-03-01T15:06:57Z

segmentation image added

LaurentMazare · 2024-03-02T16:36:54Z

Actually I meant more adding some details to the PR rather than to the repo, it's a bit tricky to add more image file in it as it's already on the large side and we would prefer limiting how much more we add. Could you remove the image files and replace them by some wget commands in the readme?

jimexist · 2024-03-03T14:24:18Z

Actually I meant more adding some details to the PR rather than to the repo, it's a bit tricky to add more image file in it as it's already on the large side and we would prefer limiting how much more we add. Could you remove the image files and replace them by some wget commands in the readme?

ah i see. i have included the image into the pull request itself and updated the readme file accordingly, also removed the image files from git history.

LaurentMazare · 2024-03-03T15:01:51Z

Thanks!

LaurentMazare · 2024-03-03T15:14:09Z

Also I would encourage you to advertise this new model on reddit/X/... or via some form of blog post. Would be great to attract some attention on it as it seems like a potentially pretty useful model and it can be nicely illustrated with your sample pics.

jimexist force-pushed the add-segformer branch from 35874e1 to 0418879 Compare January 24, 2024 17:23

jimexist marked this pull request as ready for review January 24, 2024 17:23

jimexist force-pushed the add-segformer branch 3 times, most recently from 1944792 to 513ae54 Compare February 3, 2024 01:38

jimexist force-pushed the add-segformer branch 2 times, most recently from 56fe84e to 7862d0c Compare March 1, 2024 14:34

jimexist force-pushed the add-segformer branch from 429a39a to cf33bf2 Compare March 1, 2024 15:35

jimexist force-pushed the add-segformer branch from ec728a4 to 79fe76a Compare March 3, 2024 14:18

add segformer

6595922

jimexist force-pushed the add-segformer branch from 4c51d7a to 6595922 Compare March 3, 2024 14:24

Make the id2label field optional.

87337f8

LaurentMazare approved these changes Mar 3, 2024

View reviewed changes

LaurentMazare merged commit 924ccae into huggingface:main Mar 3, 2024
10 checks passed

jimexist deleted the add-segformer branch March 4, 2024 00:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an initial Segformer implementation #1617

Add an initial Segformer implementation #1617

jimexist commented Jan 24, 2024 •

edited

Loading

LaurentMazare commented Feb 3, 2024

jimexist commented Feb 19, 2024

LaurentMazare commented Feb 25, 2024

jimexist commented Mar 1, 2024

jimexist commented Mar 1, 2024 •

edited

Loading

LaurentMazare commented Mar 2, 2024

jimexist commented Mar 3, 2024

LaurentMazare commented Mar 3, 2024

LaurentMazare commented Mar 3, 2024

Add an initial Segformer implementation #1617

Add an initial Segformer implementation #1617

Conversation

jimexist commented Jan 24, 2024 • edited Loading

for image classification with num_labels

for image segmentation with num_labels, height and width

Classification example

Segmentation example

LaurentMazare commented Feb 3, 2024

jimexist commented Feb 19, 2024

LaurentMazare commented Feb 25, 2024

jimexist commented Mar 1, 2024

jimexist commented Mar 1, 2024 • edited Loading

LaurentMazare commented Mar 2, 2024

jimexist commented Mar 3, 2024

LaurentMazare commented Mar 3, 2024

LaurentMazare commented Mar 3, 2024

jimexist commented Jan 24, 2024 •

edited

Loading

for image classification with `num_labels`

for image segmentation with `num_labels`, `height` and `width`

jimexist commented Mar 1, 2024 •

edited

Loading