[Minor] Expose bias options for both MLP and FusedMLP, use same defaults #220

blefaudeux · 2022-03-01T18:43:59Z

What does this PR do?

Looking into #219 exposed that MLP and FusedMLP were not using the same defaults (bias and no bias respectively), and that Timm defaulted to using a bias. This PR

exposes the bias flag
uses the same default for both MLP and FusedMLP
use default bias=True so that it's the same as Timm

Before submitting

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

blefaudeux · 2022-03-01T18:45:57Z

cc @jramapuram, but this is not the reason for your issue, since you were seeing the same number of parameters

codecov-commenter · 2022-03-01T19:15:51Z

Codecov Report

Merging #220 (2926d46) into main (a65c243) will not change coverage.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main     #220   +/-   ##
=======================================
  Coverage   91.72%   91.72%           
=======================================
  Files          60       60           
  Lines        3214     3214           
=======================================
  Hits         2948     2948           
  Misses        266      266

Flag	Coverage Δ
Python	`91.72% <100.00%> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
xformers/components/feedforward/fused_mlp.py	`91.30% <ø> (ø)`
xformers/components/feedforward/mlp.py	`100.00% <ø> (ø)`
xformers/triton/layer_norm.py	`76.92% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a65c243...2926d46. Read the comment docs.

blefaudeux · 2022-03-02T04:09:47Z

Guessing that this is small enough to land, not changing anything really deep, exposing existing params and aligning defaults

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 1, 2022

expose the bias options for both MLP and FusedMLP, use the same defaults

19af415

blefaudeux force-pushed the align_mlp_fused_mlp branch from dc22706 to 19af415 Compare March 1, 2022 18:45

blefaudeux linked an issue Mar 1, 2022 that may be closed by this pull request

xformers ViT-B ImageNet MAE + Deepnorm training instability #219

Open

blefaudeux requested review from fmassa and dianaml0 March 1, 2022 18:53

blefaudeux changed the title ~~[Minor] expose the bias options for both MLP and FusedMLP, use the same defaults~~ [Minor] Expose bias options for both MLP and FusedMLP, use same defaults Mar 1, 2022

blefaudeux requested a review from jieru-hu March 1, 2022 22:48

using the same eps in layernorm as default torch (#221)

2926d46

blefaudeux merged commit d4c28fb into main Mar 2, 2022

blefaudeux deleted the align_mlp_fused_mlp branch March 2, 2022 04:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Minor] Expose bias options for both MLP and FusedMLP, use same defaults #220

[Minor] Expose bias options for both MLP and FusedMLP, use same defaults #220

blefaudeux commented Mar 1, 2022

blefaudeux commented Mar 1, 2022

codecov-commenter commented Mar 1, 2022 •

edited

blefaudeux commented Mar 2, 2022

[Minor] Expose bias options for both MLP and FusedMLP, use same defaults #220

[Minor] Expose bias options for both MLP and FusedMLP, use same defaults #220

Conversation

blefaudeux commented Mar 1, 2022

What does this PR do?

Before submitting

PR review

blefaudeux commented Mar 1, 2022

codecov-commenter commented Mar 1, 2022 • edited

Codecov Report

blefaudeux commented Mar 2, 2022

codecov-commenter commented Mar 1, 2022 •

edited