Add hysteresis support for AMP gradient scale update #1733

minitu · 2023-09-29T20:11:10Z

This PR adds support for hysteresis in AMP gradient scale update.

…s by tricking torch autograd (NVIDIA#1715) * input grad checks out * adding clamp gamma * Both old and proposed implementation checks out * 2 tests not yet passed due to numerical issues * mem_eff works * fast-layer-norm done * Moving mem-eff to templates * Relax tolerance for memory efficient backward * Fix backward api of python

* Add distopt support for param syncs with non-floating-point dtypes Signed-off-by: Tim Moon <tmoon@nvidia.com> * Update apex/contrib/optimizers/distributed_fused_adam.py Co-authored-by: Masaki Kozuki <mkozuki@nvidia.com> --------- Signed-off-by: Tim Moon <tmoon@nvidia.com> Co-authored-by: Masaki Kozuki <mkozuki@nvidia.com>

tests/L0/run_amp/test_update_scale_hysteresis.py

minitu · 2023-09-30T01:04:40Z

@crcrpar Thanks, addressed your comments

Jaemin Choi and others added 5 commits September 20, 2023 15:56

Add update_scale_hysteresis

16aaa78

Fix compile errors

79b7f4f

Add unit test

2657766

minitu force-pushed the update_scale branch from c4dc727 to 2657766 Compare September 29, 2023 23:40

Jaemin Choi added 2 commits September 29, 2023 16:42

Merge branch 'master' into update_scale

9a35867

Fix comment in unit test

28e0986

crcrpar reviewed Sep 30, 2023

View reviewed changes

tests/L0/run_amp/test_update_scale_hysteresis.py Outdated Show resolved Hide resolved

tests/L0/run_amp/test_update_scale_hysteresis.py Outdated Show resolved Hide resolved

tests/L0/run_amp/test_update_scale_hysteresis.py Outdated Show resolved Hide resolved

Remove unnecessary bits

0992537

crcrpar approved these changes Sep 30, 2023

View reviewed changes

crcrpar merged commit 6a77872 into NVIDIA:master Sep 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hysteresis support for AMP gradient scale update #1733

Add hysteresis support for AMP gradient scale update #1733

Uh oh!

minitu commented Sep 29, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

minitu commented Sep 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add hysteresis support for AMP gradient scale update #1733

Add hysteresis support for AMP gradient scale update #1733

Uh oh!

Conversation

minitu commented Sep 29, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

minitu commented Sep 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants