Skip to content

[FEATURE] Segment Anything backbones #1757

@SimJeg

Description

@SimJeg

Hello,

Maybe it's out of the scope of timm so far, but it could be interestint to add support for the 3 models released by Meta (ViT-B/L/H) in their Segment Anything project. These models are pretrained using MAE and then finetuned on 11M images for segmentation. An interesting feature is that these models have been trained with an image size of 1024x1024. The ViT are implemented in pure pytorch here without using scaled_dot_product_attention. Curious to see how these models transfer to classification tasks :)

Thanks,
Simon

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions