New Model Architectures - Implementation and Documentation Details #5319

jdsgomes · 2022-01-31T10:42:19Z

🚀 The feature

When adding a new model architecture there are some design/implementation details and documentation requirements that need to be taken into account. This issue intents to track such details in a dynamic manner, as it is possible to change over time.

Motivation, pitch

New Model Architectures - Implementation Details

Model development and training steps

When developing a new model there are some details not to be missed:

Implement a model factory function for each of the model variants
in the module constructor, pass layer constructor instead of instance for configurable layers like norm, activation, and log the api usage with _log_api_usage_once(self)
fuse layers together with existing common blocks if possible; For example consecutive conv, bn, activation layers could be replaced by ConvNormActivation
define __all__ in the beginning of the model file to expose model factory functions; import model public APIs (e.g. factory methods) in torchvision/models/__init__.py
create the model builder using the new API and add it to the prototype area. Here is an example on how to do this. The new API requires adding more information about the weights such as the preprocessing transforms necessary for using the model, meta-data about the model, etc
Make sure you write tests for the model itself (see _check_input_backprop, _model_params and _model_params in test/test_models.py) and for any new operators/transforms or important functions that you introduce
the new model should be torch scriptable (using torch.jit.script)
the new model should be fx compatible (using torch.fx.symbolic_trace)

Note that this list is not exhaustive and there are details here related to the code quality etc, but these are rules that apply in all PRs (see Contributing to TorchVision).

Once the model is implemented, you need to train the model using the reference scripts. For example, in order to train a classification resnet18 model you would:

go to references/classification
run the train command (for example torchrun --nproc_per_node=8 train.py --model resnet18)

After training the model, select the best checkpoint and estimate its accuracy with a batch size of 1 on a single GPU. This helps us get better measurements about the accuracy of the models and avoid variants introduced due to batch padding (read here for more details).

Finally, run the model test to generate expected model files for testing. Please include those generated files in the PR as well.:

EXPECTTEST_ACCEPT=1 pytest test/test_models.py -k {model_name}

Documentation and Pytorch Hub

docs/source/models.rst:
- add the model to the corresponding section (classification/detection/video etc.)
- describe how to construct the model variants (with and without pre-trained weights)
- add model metrics and reference to the original paper
hubconf.py:
- import the model factory functions
- submit a PR to https://github.com/pytorch/hub with a model page (or update an existing one)
README.md under the reference script folder:
- command(s) to train the model

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

datumbox · 2022-02-04T18:23:26Z

I've pinned the issue for now but we should consider making use of the Wiki pages, which are better suited for this kind of content. PyTorch core uses them extensively, so it might be worth aligning.

NicolasHug · 2022-02-07T10:09:02Z

Are we expecting these guidelines to change often?
Otherwise we might also consider just having this as a .md file in the repo (e.g. as part of CONTRIBUTING_MODELS.md) ?

jdsgomes · 2022-02-07T10:29:03Z

That is a good point, and has been discussed previously with @datumbox and also during this PR review. In short I think is fair so say that there are no strong feelings either way, but there were two main arguments to keep it in a ticket for now. First we didn't want to make the contribution guidelines too long, second the content can change. So I would still favour to keep it here for a while and if it seems stable enough we can move it to a .md file

datumbox · 2022-02-07T11:25:42Z

Things will eventually become stable. I think the biggest changes on this documentation will come from the following:

The Multi-weight support API rollout
The Model documentation revamping
Potentially by the model testing revamping work

Once things stabilise we can move to md if you prefer.

yassineAlouini · 2022-08-18T12:39:14Z

Thanks for this great guide @jdsgomes. There is a small typo here: ConvNormActication
(should be Activation)

jdsgomes mentioned this issue Jan 31, 2022

Model contrib guidelines #5315

Merged

datumbox pinned this issue Feb 4, 2022

datumbox mentioned this issue Jul 6, 2022

Add SwinV2 in TorchVision #6242

Closed

This was referenced Aug 12, 2022

[FEAT] Add MobileViT v1 & v2 #6404

Open

Add the S3D architecture to TorchVision #6412

Merged

NicolasHug unpinned this issue Feb 17, 2023

NicolasHug pinned this issue Feb 17, 2023

pmeier unpinned this issue Mar 22, 2023

senarvi mentioned this issue May 2, 2023

[RFC] Support YOLOX detection model #6341

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Model Architectures - Implementation and Documentation Details #5319

New Model Architectures - Implementation and Documentation Details #5319

jdsgomes commented Jan 31, 2022 •

edited by datumbox

Loading

datumbox commented Feb 4, 2022

NicolasHug commented Feb 7, 2022 •

edited

Loading

jdsgomes commented Feb 7, 2022

datumbox commented Feb 7, 2022

yassineAlouini commented Aug 18, 2022

New Model Architectures - Implementation and Documentation Details #5319

New Model Architectures - Implementation and Documentation Details #5319

Comments

jdsgomes commented Jan 31, 2022 • edited by datumbox Loading

🚀 The feature

Motivation, pitch

New Model Architectures - Implementation Details

Model development and training steps

Documentation and Pytorch Hub

Alternatives

Additional context

datumbox commented Feb 4, 2022

NicolasHug commented Feb 7, 2022 • edited Loading

jdsgomes commented Feb 7, 2022

datumbox commented Feb 7, 2022

yassineAlouini commented Aug 18, 2022

jdsgomes commented Jan 31, 2022 •

edited by datumbox

Loading

NicolasHug commented Feb 7, 2022 •

edited

Loading