Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 #322

jeffdaily · 2022-03-03T16:38:27Z

Description
The BatchNorm operator is not numerically stable in fp16. PyTorch documentation recommends to keep the BN op in fp32 for fp16 AMP models. Refer to https://pytorch.org/docs/stable/amp.html#ops-that-can-autocast-to-float32. Preserving BN in fp32 for superbench more accurately reflects real workloads.

superbench/benchmarks/model_benchmarks/pytorch_cnn.py

abuccts · 2022-03-04T11:09:49Z

/azp run

azure-pipelines · 2022-03-04T11:10:09Z

Azure Pipelines successfully started running 3 pipeline(s).

codecov · 2022-03-04T15:57:35Z

Codecov Report

Merging #322 (80626c2) into main (425b9ff) will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #322      +/-   ##
==========================================
+ Coverage   88.64%   88.66%   +0.01%     
==========================================
  Files          76       76              
  Lines        4500     4507       +7     
==========================================
+ Hits         3989     3996       +7     
  Misses        511      511

Flag	Coverage Δ
cpu-unit-test	`72.79% <100.00%> (+0.04%)`	⬆️
cuda-unit-test	`88.59% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...erbench/benchmarks/model_benchmarks/pytorch_cnn.py	`93.40% <100.00%> (+0.54%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 425b9ff...80626c2. Read the comment docs.

abuccts · 2022-03-06T13:00:56Z

/azp run

azure-pipelines · 2022-03-06T13:01:15Z

Azure Pipelines successfully started running 3 pipeline(s).

keep BatchNorm as fp32 for pytorch cnn models cast to fp16

309a562

jeffdaily requested a review from a team as a code owner March 3, 2022 16:38

cp5555 requested review from guoshzhao and abuccts March 3, 2022 23:07

cp5555 assigned guoshzhao Mar 3, 2022

cp5555 changed the title ~~keep BatchNorm as fp32 for pytorch cnn models cast to fp16~~ Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 Mar 4, 2022

cp5555 added benchmarks SuperBench Benchmarks model-benchmarks Model Benchmark Test for SuperBench Benchmarks labels Mar 4, 2022

abuccts reviewed Mar 4, 2022

View reviewed changes

superbench/benchmarks/model_benchmarks/pytorch_cnn.py Show resolved Hide resolved

lint fix

80626c2

cp5555 approved these changes Mar 4, 2022

View reviewed changes

guoshzhao approved these changes Mar 5, 2022

View reviewed changes

abuccts enabled auto-merge (squash) March 6, 2022 13:02

abuccts merged commit a9ef0f9 into microsoft:main Mar 6, 2022

cp5555 mentioned this pull request Mar 6, 2022

V0.5.0 Release Plan #280

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 #322

Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 #322

jeffdaily commented Mar 3, 2022

abuccts commented Mar 4, 2022

azure-pipelines bot commented Mar 4, 2022

codecov bot commented Mar 4, 2022 •

edited

Loading

abuccts commented Mar 6, 2022

azure-pipelines bot commented Mar 6, 2022

Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 #322

Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 #322

Conversation

jeffdaily commented Mar 3, 2022

abuccts commented Mar 4, 2022

azure-pipelines bot commented Mar 4, 2022

codecov bot commented Mar 4, 2022 • edited Loading

Codecov Report

abuccts commented Mar 6, 2022

azure-pipelines bot commented Mar 6, 2022

codecov bot commented Mar 4, 2022 •

edited

Loading