Added TanhGrad. by satyajandhyala · Pull Request #9507 · microsoft/onnxruntime

satyajandhyala · 2021-10-23T00:10:14Z

Description: Added Tanh gradient schema and kernel

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.

orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py

orttraining/orttraining/core/graph/training_op_defs.cc

SherlockNoMad

LGTM with nit comments.

satyajandhyala · 2021-10-25T20:05:18Z

Profiled test_tanh_grad with and without the TanhGrad changes using the following command

nvprof python -m pytest -sv orttraining_test_ortmodule_api.py -k test_tanh_grad

Using the new TanhGrad operator reduced the gradient computation from (36.5+21.5) 58 microseconds to 21 microseconds. The perf improvement is about 30%.

satyajandhyala added component:ortmodule training issues related to ONNX Runtime training; typically submitted using template labels Oct 23, 2021

satyajandhyala force-pushed the sajandhy/add_tanh_grad branch 2 times, most recently from 6e4615e to 215aad6 Compare October 25, 2021 00:57

satyajandhyala changed the title ~~[WIP] Added TanhGrad.~~ Added TanhGrad. Oct 25, 2021

Added TanhGrad.

3dbc993

satyajandhyala force-pushed the sajandhy/add_tanh_grad branch from 215aad6 to 3dbc993 Compare October 25, 2021 01:16

satyajandhyala added the component:operator label Oct 25, 2021

SherlockNoMad reviewed Oct 25, 2021

View reviewed changes

orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py Outdated Show resolved Hide resolved

SherlockNoMad reviewed Oct 25, 2021

View reviewed changes

orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py Outdated Show resolved Hide resolved

SherlockNoMad reviewed Oct 25, 2021

View reviewed changes

orttraining/orttraining/core/graph/training_op_defs.cc Outdated Show resolved Hide resolved

SherlockNoMad previously approved these changes Oct 25, 2021

View reviewed changes

satyajandhyala added 2 commits October 25, 2021 19:02

Reduce number of test iterations and test data size.

4cd9fb7

Remove TanhGrad opset domain/version.

d7f8ab3

satyajandhyala dismissed SherlockNoMad’s stale review via d7f8ab3 October 25, 2021 19:05

satyajandhyala requested review from aishwaryabh and ytaous October 25, 2021 20:09

satyajandhyala marked this pull request as ready for review October 25, 2021 20:33

satyajandhyala requested review from YUNQIUGUO, baijumeswani, edgchen1, guoyu-wang, liqunfu, skottmckay, thiagocrepaldi, tlh20 and xadupre as code owners October 25, 2021 20:33

SherlockNoMad approved these changes Oct 26, 2021

View reviewed changes

edgchen1 approved these changes Oct 26, 2021

View reviewed changes

satyajandhyala merged commit f29057c into master Oct 26, 2021

satyajandhyala deleted the sajandhy/add_tanh_grad branch October 26, 2021 16:10

xadupre mentioned this pull request Jul 27, 2023

[Training] Missing gradient kernels #16866

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added TanhGrad.#9507

Added TanhGrad.#9507
satyajandhyala merged 3 commits intomasterfrom
sajandhy/add_tanh_grad

satyajandhyala commented Oct 23, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SherlockNoMad left a comment

Uh oh!

satyajandhyala commented Oct 25, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

satyajandhyala commented Oct 23, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SherlockNoMad left a comment

Choose a reason for hiding this comment

Uh oh!

satyajandhyala commented Oct 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

satyajandhyala commented Oct 25, 2021 •

edited

Loading