synchronize inputs to onnx session on GPU #1061

isaacrob-roboflow · 2025-03-03T23:08:28Z

Description

Running GPU preprocessed inference while another process used the GPU sometimes caused the ONNX model to produce unexpected output. By asking the ONNX session to ensure that its input buffer is synchronized prior to running inference, we resolve this issue. We also cause faster inference. With CPU based preprocessing, my test based on our client's use case runs in 130ms. With GPU, we get 40ms. With synchronization, we get 22ms.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

I ran inference while training a neural network in another process. Without synchronization, the model would sometimes produce confidences that were all 0s or all 1s. With synchronization, I no longer observe that behavior.

If there's interest I could try to build a test case that covers this, but it would have to run on GPU and may randomly pass anyway.

Any specific deployment considerations

For example, documentation changes, usability, usage/costs, secrets, etc.

Docs

Docs updated? What were the changes:

synchronize inputs to onnx session on GPU

c8d650d

isaacrob-roboflow requested review from PawelPeczek-Roboflow, grzegorz-roboflow, yeldarby, probicheaux, hansent and EmilyGavrilenko as code owners March 3, 2025 23:08

grzegorz-roboflow approved these changes Mar 6, 2025

View reviewed changes

grzegorz-roboflow merged commit bf4cb5b into main Mar 6, 2025
30 checks passed

grzegorz-roboflow deleted the onnx_input_synchronization branch March 6, 2025 09:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

synchronize inputs to onnx session on GPU #1061

synchronize inputs to onnx session on GPU #1061

isaacrob-roboflow commented Mar 3, 2025

synchronize inputs to onnx session on GPU #1061

synchronize inputs to onnx session on GPU #1061

Conversation

isaacrob-roboflow commented Mar 3, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs