[WebNN EP] Automatically use ml-tensor for outputs #24282

egalli · 2025-04-02T21:14:32Z

Description

If it would improve performance, this patch moves outputs to MLTensor backed Tensors.

Motivation and Context

We are currently performing an extra copy on output tensors located in the CPU when using the WebNN EP (MLTensor -(copy)-> wasm heap -(copy)-> JS). This patch removes this copy by moving the readback to JS instead of wasm. As an extra benefit, we can also start the readbacks and wait for them in parallel.

This change is similar to #23073

### Description If it would improve performance, this patch moves outputs to MLTensor backed Tensors. ### Motivation and Context We are currently performing an extra copy on output tensors located in the CPU when using the WebNN EP (MLTensor -(copy)-> wasm heap -(copy)-> JS). This patch removes this copy by moving the readback to JS instead of wasm. As an extra benefit, we can also start and wait for the readbacks in parallel.

snnn · 2025-04-03T16:19:41Z

/azp run all

azure-pipelines · 2025-04-03T16:19:47Z

No pipelines are associated with this pull request.

snnn · 2025-04-03T16:20:25Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-04-03T16:20:49Z

Azure Pipelines successfully started running 5 pipeline(s).

js/web/lib/wasm/wasm-core-impl.ts

This reverts commit 11d5966.

fs-eire · 2025-04-13T06:37:13Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,ONNX Runtime Web CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline

fs-eire · 2025-04-13T06:37:15Z

/azp run Linux QNN CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2025-04-13T06:37:33Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2025-04-13T06:37:40Z

Azure Pipelines successfully started running 7 pipeline(s).

snnn closed this Apr 3, 2025

snnn reopened this Apr 3, 2025

guschmue added the ep:WebNN label Apr 3, 2025

fs-eire reviewed Apr 7, 2025

View reviewed changes

js/web/lib/wasm/wasm-core-impl.ts Outdated Show resolved Hide resolved

egalli added 4 commits April 7, 2025 13:42

Missing case where developer gives a filled tensor as output

0331c6a

Remove 'ml-tensor-cpu-output'

11d5966

Revert "Remove 'ml-tensor-cpu-output'"

a600315

This reverts commit 11d5966.

Adding comment on 'ml-tensor-cpu-output'

78a90c6

fs-eire approved these changes Apr 11, 2025

View reviewed changes

fs-eire merged commit 1c2225e into microsoft:main Apr 16, 2025
71 of 82 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN EP] Automatically use ml-tensor for outputs #24282

[WebNN EP] Automatically use ml-tensor for outputs #24282

egalli commented Apr 2, 2025

snnn commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

snnn commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

fs-eire commented Apr 13, 2025

fs-eire commented Apr 13, 2025

azure-pipelines bot commented Apr 13, 2025

azure-pipelines bot commented Apr 13, 2025

[WebNN EP] Automatically use ml-tensor for outputs #24282

[WebNN EP] Automatically use ml-tensor for outputs #24282

Conversation

egalli commented Apr 2, 2025

Description

Motivation and Context

snnn commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

snnn commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

fs-eire commented Apr 13, 2025

fs-eire commented Apr 13, 2025

azure-pipelines bot commented Apr 13, 2025

azure-pipelines bot commented Apr 13, 2025