Make output 1 of ConcatTraining Optional and place on CPU by ashbhandare · Pull Request #7199 · microsoft/onnxruntime

ashbhandare · 2021-03-31T19:59:08Z

The Output1 of ConcatTraining is consumed by SplitTraining in the gradient graph, which expects it on host, while ConcatTraining places it on the device. To avoid the DtoH memcpy, this PR places the output1 on host and also makes its computation optional in case it is not consumed by any other node.

mrry

Awesome, thank you!

orttraining/orttraining/training_ops/cuda/tensor/concat.cc

orttraining/orttraining/training_ops/cpu/tensor/concat.cc

SherlockNoMad

looks good, but let's still update the name.

ytaous · 2021-03-31T21:52:58Z

looking good on the testing, collecting number shortly

ashbhandare requested a review from a team as a code owner March 31, 2021 19:59

mrry previously approved these changes Mar 31, 2021

View reviewed changes

SherlockNoMad reviewed Mar 31, 2021

View reviewed changes

orttraining/orttraining/training_ops/cuda/tensor/concat.cc Outdated Show resolved Hide resolved

SherlockNoMad reviewed Mar 31, 2021

View reviewed changes

orttraining/orttraining/training_ops/cpu/tensor/concat.cc Outdated Show resolved Hide resolved

SherlockNoMad previously approved these changes Mar 31, 2021

View reviewed changes

ashbhandare dismissed stale reviews from SherlockNoMad and mrry via 7d66cc0 March 31, 2021 22:59

ashbhandare force-pushed the aibhanda/concat_training branch from 7d66cc0 to 2224a35 Compare March 31, 2021 23:53

SherlockNoMad previously approved these changes Apr 1, 2021

View reviewed changes

ytaous previously approved these changes Apr 1, 2021

View reviewed changes

ashbhandare added 2 commits April 1, 2021 16:49

Optional input 1 on CPU ConcatTraining

2ac15fb

Rename output_1

48da6d4

ashbhandare dismissed stale reviews from ytaous and SherlockNoMad via 48da6d4 April 1, 2021 16:49

ashbhandare force-pushed the aibhanda/concat_training branch from 2224a35 to 48da6d4 Compare April 1, 2021 16:49

SherlockNoMad approved these changes Apr 1, 2021

View reviewed changes

ashbhandare merged commit 15c67dd into master Apr 1, 2021

ashbhandare deleted the aibhanda/concat_training branch April 1, 2021 23:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make output 1 of ConcatTraining Optional and place on CPU#7199

Make output 1 of ConcatTraining Optional and place on CPU#7199
ashbhandare merged 2 commits intomasterfrom
aibhanda/concat_training

ashbhandare commented Mar 31, 2021

Uh oh!

mrry left a comment

Uh oh!

Uh oh!

Uh oh!

SherlockNoMad left a comment

Uh oh!

ytaous commented Mar 31, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ashbhandare commented Mar 31, 2021

Uh oh!

mrry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SherlockNoMad left a comment

Choose a reason for hiding this comment

Uh oh!

ytaous commented Mar 31, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants