Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Support EfficientViT #1815

Closed
Randl opened this issue May 18, 2023 · 7 comments
Closed

[FEATURE] Support EfficientViT #1815

Randl opened this issue May 18, 2023 · 7 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@Randl
Copy link

Randl commented May 18, 2023

Add models from
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention https://arxiv.org/abs/2305.07027
https://github.com/microsoft/Cream/tree/main/EfficientViT

Looks like fast and high-quality models, would be nice to have them in timm

@Randl Randl added the enhancement New feature or request label May 18, 2023
@rwightman
Copy link
Collaborator

rwightman commented May 18, 2023

@Randl I noticed that one, also related are their mini/tiny vit. Look like reasonable arch, blend of LeViT / EfficientFormer w/ Swin and other ideas. BUT, they all need fairly extensive refactoring w/ checkpoint mapping to fit timm, get feat extraction working, etc so not currently something I have bandwidth for.

If anyone wants to tackle this or the others, criteria for accepting:

@rwightman rwightman added the help wanted Extra attention is needed label May 18, 2023
@seefun
Copy link
Contributor

seefun commented Aug 2, 2023

I tried to implement EfficientViT (MSRA) in here: #1894

There is another job with the same name before this EfficientViT, which looks good and has also been added.

@youssefadr
Copy link

youssefadr commented Aug 2, 2023

Hello @rwightman, is there a model I can work on? I would be very happy to contribute!

@rwightman
Copy link
Collaborator

@youssefadr this one is done, I currently have MobileOne, FastViT and Inception_neXt (and a prototype underway... but #1842 (FasterViT) hasn't been tackled, although it looks like it's an easy adaptation, I'd want the downsamples moved, and some other aspects cleaned up, so that can be a bit of fun...

@youssefadr
Copy link

youssefadr commented Sep 3, 2023

Thank you for the answer, I will take a look at FasterViT soon 👍

@Kaschi14
Copy link

Kaschi14 commented Oct 6, 2023

In the EfficientVit https://github.com/mit-han-lab/efficientvit there are weights for higher res models which are currently not supported in timm. Is it planned to include the higher res (1024x2048) Models in Timm as well? :)

@seefun
Copy link
Contributor

seefun commented Oct 6, 2023

@Kaschi14 I noticed the origin repo added the L-series efficientvit weights and the SAM distillation weights, but the higher res (1024x2048) models is from cityscape segmentation models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

5 participants