Update gather to use multiple threads by RyanUnderhill · Pull Request #11524 · microsoft/onnxruntime

RyanUnderhill · 2022-05-14T00:32:41Z

Description: GatherElements wouldn't distribute the work across multiple threads

Motivation and Context
A user was comparing the performance of Onnxruntime vs Pytorch and saw that the latest Pytorch was 2x faster than Onnxruntime. A profile showed that they were using all CPU cores but we were limited to 1.

There is a separate issue where they were using Onnxruntime inefficiently (there's a memcpy on every Run() call to copy the output tensor, using io-bindings avoids the memcpy).

Here's some performance data comparing the old vs new. Note that there is a slight perf hit for the single threaded case, as it has to divide up the work into independent chunks vs a slightly faster incremental calculation between chunks.

New version using 8 threads:

onnx model: 0.770249s after 10000 iterations
pytorch model: 3.669060s after 10000 iterations

New version limited to one thread:

onnx model: 3.474726s after 10000 iterations
pytorch model: 3.579055s after 10000 iterations

Old version:

onnx model: 2.905542s after 10000 iterations
pytorch model: 3.634197s after 10000 iterations

hariharans29 · 2022-05-16T23:46:37Z

 // Copyright (c) Microsoft Corporation. All rights reserved.
 // Licensed under the MIT License.

+#include <string>


Just curious - why was this header inclusion required now ?

It was a lint warning to not include it. I forget the exact text but something about including libraries for types you use.

Update gather to use multiple threads

97c05d0

RyanUnderhill requested a review from hariharans29 May 14, 2022 00:32

RyanUnderhill added 3 commits May 13, 2022 18:39

Remove out of bounds tests, need to think more

cc1aa42

Add back error checking

893a96b

Fix lint warnings

f7df7e9

hariharans29 previously approved these changes May 16, 2022

View reviewed changes

hariharans29 reviewed May 16, 2022

View reviewed changes

Fix exception handling

25f0f21

RyanUnderhill dismissed hariharans29’s stale review via 25f0f21 May 17, 2022 00:03

hariharans29 approved these changes May 17, 2022

View reviewed changes

RyanUnderhill merged commit deef214 into master May 17, 2022

RyanUnderhill deleted the ryanunderhill/gather_perf2 branch May 17, 2022 02:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update gather to use multiple threads#11524

Update gather to use multiple threads#11524
RyanUnderhill merged 5 commits intomasterfrom
ryanunderhill/gather_perf2

RyanUnderhill commented May 14, 2022

Uh oh!

hariharans29 May 16, 2022

Uh oh!

RyanUnderhill May 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RyanUnderhill commented May 14, 2022

Uh oh!

hariharans29 May 16, 2022

Choose a reason for hiding this comment

Uh oh!

RyanUnderhill May 17, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants