Efficientformer #20459

Bearnardd · 2022-11-25T23:07:42Z

This PR adds Efficientformer, a model that has similar latency as MobileNets, but achieves better accuracy on ImageNet. It is based on the closed PR: #18296

Paper: https://arxiv.org/abs/2206.01191
Code and weights: https://github.com/snap-research/EfficientFormer

Fixes #18041

Who can review?

@alaradirik @NielsRogge

alaradirik · 2022-11-28T11:14:48Z

Hey @Bearnardd, thank you for working on this! Could you run make fixup to fix the failed style and code quality tests?

Also, type casting function arguments (e.g. def something(arg1: torch.Tensor):) causes errors if the type depends on a conditionally imported library (torch), you can see the failed test logs if you head over to the CI test details. Could you remove those from test_modeling_efficientformer.py?

…rmers into efficientformer

alaradirik

Thanks for working on this @Bearnardd! I left a couple of comments to fix inconsistencies and follow the best practices. Most of the comments are easy to address, some of the main issues are:

EfficientFormerLastStage could use some restructuring to get rid of using nn.Sequential.
test_feature_extraction_efficientformer.py is missing (can be copied from ViT).
EfficientFormerForImageClassificationWithTeacher uses a frozen external model (RegNetY-16GF) for distillation I'm not sure if the implementation is accurate as it consists of a single dense layer that is trained alongside the model.
Some common modeling tests (defined in tests/test_modeling_common.py) are failing, you can run all tests with:
pytest tests/models/efficientformer/test_modeling_efficientformer.py

Hope this helps :)

src/transformers/__init__.py

README_ko.md

docs/source/en/index.mdx

docs/source/en/model_doc/efficientformer.mdx

tests/models/efficientformer/test_modeling_efficientformer.py

src/transformers/models/efficientformer/modeling_efficientformer.py

tests/models/efficientformer/test_modeling_efficientformer.py

Bearnardd · 2022-12-10T00:58:02Z

Hi @alaradirik - thank you very much for the detailed review! I will address the changes shortly :) . I am aware of the failing tests but I am not entirely sure how to count the number of expected attentions and hidden layers for this particular model since it does not have a "standard" transformer based architecture. Nevertheless I think that I will address the current comments and as the next step I will ask you some questions about expected attention and hidden outputs.

alaradirik · 2022-12-12T13:06:18Z

Hi @alaradirik - thank you very much for the detailed review! I will address the changes shortly :) . I am aware of the failing tests but I am not entirely sure how to count the number of expected attentions and hidden layers for this particular model since it does not have a "standard" transformer based architecture. Nevertheless I think that I will address the current comments and as the next step I will ask you some questions about expected attention and hidden outputs.

Hey @Bearnardd, no problem at all! We define our own small model architecture within the test_modeling_efficientformer.py file as it is faster to test with a smaller dummy model. You would just need to check num_hidden_layers and num_attention_heads attributes of the test class to see the expected number of layers. It seems the model has the correct number of attention heads and hidden layers but doesn't return all of the outputs (attentions and hidden state outputs from all layers).

If you are sure the implementation is correct and this is expected (in this case or other cases), you can always override the common tests within test_modeling_efficientformer.py by adding a method with the same name to the test class (EfficientFormerModelTester).

…to efficientformer

…octree

HuggingFaceDocBuilderDev · 2022-12-19T23:46:14Z

The documentation is not available anymore as the PR was closed or merged.

alaradirik

The changes look good to me!

I left few a comments, including a code snippet to fix the model initialization test failure. My main comment is regarding the feature extractor. Could you rename EfficientFormerFeatureExtractor as EfficientFormerImageProcessor?

We are changing the naming to avoid confusion as users sometimes think the feature extractor is a model itself rather than the preprocessor. The renaming requires changing the filename (image_processing_efficientformer.py), test filename, replacing all instances of feature extraction imports with the image processor and adding it to models/auto/image_processing_auto.py.

You can see an example of this in this PR.

src/transformers/models/auto/modeling_auto.py

docs/source/en/model_doc/efficientformer.mdx

README_hd.md

README_zh-hans.md

src/transformers/models/efficientformer/modeling_efficientformer.py

sgugger · 2023-01-08T08:50:51Z

Thanks @Bearnardd !
@NielsRogge I'll let you have one last look and merge if you're happy :-)

src/transformers/__init__.py

src/transformers/models/efficientformer/configuration_efficientformer.py

...ers/models/efficientformer/convert_efficientformer_original_pytorch_checkpoint_to_pytorch.py

src/transformers/models/efficientformer/modeling_efficientformer.py

alaradirik

The PR is almost ready to be merged and looks great overall!

I left a few comments to update all repo names and fix the 3 failing slow tests. You can run all tests with:
RUN_SLOW=True pytest tests/models/efficientformer/test_modeling_efficientformer.py

src/transformers/models/efficientformer/configuration_efficientformer.py

src/transformers/models/efficientformer/modeling_efficientformer.py

tests/models/efficientformer/test_modeling_efficientformer.py

novice03 · 2023-01-21T11:38:11Z

Thank you so much for working on this @Bearnardd!

novice03 and others added 5 commits July 26, 2022 15:23

Add Efficientformer

491f182

Update model doc

974fb22

Merge branch 'huggingface:main' into efficientformer

1328ee7

merge upstream master

9445201

fix style, update modeling file

ac570d3

alaradirik requested review from alaradirik, NielsRogge and sgugger November 28, 2022 11:05

alaradirik mentioned this pull request Nov 29, 2022

Add ViViT #20441

Closed

5 tasks

Bearnardd and others added 5 commits November 29, 2022 22:33

fix style v2

8eaa974

Merge branch 'main' into efficientformer

e1e9fa8

remove annotation

4d3f25a

Merge branch 'efficientformer' of https://github.com/novice03/transfo…

2ae8aab

…rmers into efficientformer

add docs, fix style

1bd9ee6

alaradirik reviewed Dec 2, 2022

View reviewed changes

Bearnardd added 8 commits December 19, 2022 12:22

Merge branch 'main' of https://github.com/huggingface/transformers in…

b7fb403

…to efficientformer

fix style, add tests, update modeling file

064b458

remove copied from statements

40ee332

udpate README files

ac92b73

update efficientformer.mdx file, remove type annotation

454f0c1

fix typo

bffa775

fix typo v2

7bf673e

rename feature extraction test file to match the namings, add EF to t…

e2ab928

…octree

Bearnardd added 2 commits December 20, 2022 02:00

remove output annotations

2e61bf2

remove unused import

469be4f

alaradirik reviewed Dec 20, 2022

View reviewed changes

fix conflicts

f59382c