test(e2e): expand winml perf coverage across EPs and devices by xieofxie · Pull Request #698 · microsoft/winml-cli

xieofxie · 2026-05-21T09:04:02Z

Summary

Expands tests/e2e/test_perf_e2e.py with parametrized --ep tests across qnn, vitisai, openvino, dml, nv_tensorrt_rtx, migraphx, combined with --device cpu/gpu/npu. Adds a CPU --monitor test and a new TestPerfHuggingFace class that runs the same EP/device matrix against microsoft/resnet-50 loaded via the perf command.
Refactors duplicated CLI-invocation and result-assertion code into _build_perf_args, _assert_hw_monitor_section, and _assert_monitor_result helpers so each scenario is a few lines instead of a hand-rolled argv list.
Hardens WinMLEPRegistry: when a provider's ensure_ready() raises, log a warning and skip it instead of aborting registry initialization, so one broken EP no longer prevents the rest from being discovered.
tests/e2e/require_ep.py: short-circuit DmlExecutionProvider alongside CPUExecutionProvider since DML ships with ORT and is not enumerated by WinMLEPRegistry, so require_ep("dml") no longer spuriously skips.
WinMLSession: drop the WINMLSESSION_VERBOSE env-var gating; "Compiling for device" / "Already compiled" now log unconditionally at INFO.
test_benchmark_auto: accept that --device auto resolves to a concrete gpu/npu in the output, since resolve_device no longer leaves the literal "auto" in benchmark_info.

Closes #502
Closes #688

## Summary - Expands `tests/e2e/test_perf_e2e.py` with parametrized `--ep` tests across `qnn`, `vitisai`, `openvino`, `dml`, `nv_tensorrt_rtx`, `migraphx`, combined with `--device cpu/gpu/npu`. Adds a CPU `--monitor` test and a new `TestPerfHuggingFace` class that runs the same EP/device matrix against `microsoft/resnet-50` loaded via the perf command. - Refactors duplicated CLI-invocation and result-assertion code into `_build_perf_args`, `_assert_hw_monitor_section`, and `_assert_monitor_result` helpers so each scenario is a few lines instead of a hand-rolled argv list. - Hardens `WinMLEPRegistry`: when a provider's `ensure_ready()` raises, log a warning and skip it instead of aborting registry initialization, so one broken EP no longer prevents the rest from being discovered. - `tests/e2e/require_ep.py`: short-circuit `DmlExecutionProvider` alongside `CPUExecutionProvider` since DML ships with ORT and is not enumerated by `WinMLEPRegistry`, so `require_ep("dml")` no longer spuriously skips. - `WinMLSession`: drop the `WINMLSESSION_VERBOSE` env-var gating; "Compiling for device" / "Already compiled" now log unconditionally at INFO. - `test_benchmark_auto`: accept that `--device auto` resolves to a concrete `gpu`/`npu` in the output, since `resolve_device` no longer leaves the literal `"auto"` in `benchmark_info`. Closes #502 Closes #688 --------- Co-authored-by: hualxie <hualxie@microsoft.com>

hualxie added 6 commits May 21, 2026 15:14

add cpu monitor

6148226

use shared functions

3fe4b2a

update tests

b843feb

add more tests

0b07bbb

test fix

3fe2bb1

auto should be removed

5f13213

xieofxie requested a review from a team as a code owner May 21, 2026 09:04

revert

992f755

xieofxie commented May 21, 2026

View reviewed changes

Comment thread src/winml/modelkit/session/ep_registry.py

xieofxie commented May 21, 2026

View reviewed changes

Comment thread src/winml/modelkit/session/session.py

hualxie added 2 commits May 21, 2026 17:06

to debug

666a72c

code fix

092a179

timenick approved these changes May 22, 2026

View reviewed changes

xieofxie merged commit 87cb7c6 into main May 22, 2026
9 checks passed

xieofxie deleted the hualxie/vitisa branch May 22, 2026 02:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(e2e): expand winml perf coverage across EPs and devices#698

test(e2e): expand winml perf coverage across EPs and devices#698
xieofxie merged 9 commits into
mainfrom
hualxie/vitisa

xieofxie commented May 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xieofxie commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xieofxie commented May 21, 2026 •

edited

Loading