[FEATURE] Segment Anything backbones

Hello,

Maybe it's out of the scope of timm so far, but it could be interestint to add support for the 3 models released by Meta (ViT-B/L/H) in their [Segment Anything](https://github.com/facebookresearch/segment-anything/tree/main) project. These models are pretrained using MAE and then finetuned on 11M images for segmentation. An interesting feature is that these models have been trained with an image size of 1024x1024. The ViT are implemented in pure pytorch [here](https://github.com/facebookresearch/segment-anything/blob/main/segment_anything/modeling/image_encoder.py) without using `scaled_dot_product_attention`. Curious to see how these models transfer to classification tasks :) 

Thanks,
Simon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[FEATURE] Segment Anything backbones #1757

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[FEATURE] Segment Anything backbones #1757

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions