Validate model task in config. by chinazhangchao · Pull Request #723 · microsoft/winml-cli

chinazhangchao · 2026-05-25T07:55:01Z

No description provided.

…to chao/validatetask

timenick

Three substantive findings. The preflight check works, but two of them are tied to recent changes elsewhere in the repo (#719 dedupe, CLAUDE.md import rules), and one is a coverage-asymmetry question that applies across CLI entrypoints.

…to chao/validatetask

zhenchaoni

A few concerns with the current acceptance semantics that I think are worth addressing before merge, because they interact with how the rest of the CLI resolves tasks.

1. `normalize_task` on both sides collapses modality

The check is:

normalized_supported = {normalize_task(t) for t in supported_tasks}
if normalize_task(task) in normalized_supported:
    return hf_config

normalize_task wraps TasksManager.map_from_synonym, which treats image-feature-extraction as a synonym of feature-extraction. Collapsing both sides through it means the gate cannot distinguish text vs image FE. Two consequences:

--task image-feature-extraction against a text-only arch (e.g. bert-base-uncased) passes — the normalized form feature-extraction is in BERT's supported list.
--task feature-extraction against a vision-only arch (e.g. dinov2) passes — vision arches register feature-extraction (Optimum-canonical).

The PR title says "Validate model task" but the validator can't catch cross-modality mismatches. Worse, the second case silently produces wrong downstream behavior: our four task registries (_EVALUATOR_REGISTRY, TASK_DATASET_MAPPING, TASK_SCHEMAS, TASK_TO_WINML_CLASS) are keyed by HF-pipeline task IDs, where feature-extraction is text-only (FeatureExtractionPipeline requires a PreTrainedTokenizer) and image-feature-extraction is a separate pipeline (ImageFeatureExtractionPipeline). So a vision model with loader.task = "feature-extraction" routes through text logic in eval/quantize.

Suggested change: compare without normalizing, falling back to normalized only when the verbatim check fails. Something like:

if task in supported_tasks:
    return hf_config
if normalize_task(task) in {normalize_task(t) for t in supported_tasks}:
    # Synonym match — log a hint about the canonical spelling instead of silently accepting
    logger.warning("Task %r matches via synonym; consider using canonical %r", task, normalize_task(task))
    return hf_config

2. HF-pipeline-only task names are falsely rejected

normalize_task doesn't consult TASK_SYNONYM_EXTENSIONS in io.py. Names handled there (next-sentence-prediction → text-classification, mask-generation preserved) won't be in supported_tasks and won't normalize to something in it. Same for sentence-similarity (HF-only, not in Optimum's synonym map). The if not supported_tasks: return hf_config escape only fires when Optimum knows nothing about the arch — it doesn't help for mainstream text models.

Repro: winml build -m bert-base-uncased --task next-sentence-prediction works pre-PR (handled in export/io.py) and will be rejected post-PR.

Suggested change: short-circuit on names that the rest of the codebase knows about:

from ..loader.task import KNOWN_TASKS
from ..export.io import TASK_SYNONYM_EXTENSIONS

if task in TASK_SYNONYM_EXTENSIONS or task in KNOWN_TASKS:
    # Accept names that other CLI commands accept; let downstream resolution
    # raise ONNXConfigNotFoundError if the arch can't actually export it.
    return hf_config

3. Acceptance set now diverges from `winml config` / `winml export` / `winml perf`

The docstring notes that the other commands rely on resolve_cfg → ONNXConfigNotFoundError. Those paths use resolve_task_and_model_class + KNOWN_TASKS, which is broader and lists image-feature-extraction as a distinct entry. Result: a config.json produced by winml config (and accepted by winml export) can be rejected by winml build. That asymmetry will surface as bug reports.

Suggested change: either widen the acceptance set in this PR to match the rest of the CLI (combination of fixes 1 + 2), or move the validation up into resolve_task_and_model_class so all commands share one definition.

4. Test coverage

The current test test_rejects_incompatible_config_task_and_model only exercises the rejection path. Suggest adding:

Vision-arch + --task feature-extraction → currently passes the gate (documenting the limitation).
Text-arch + --task image-feature-extraction → currently passes (documenting the limitation, or fixing).
Text-arch + --task next-sentence-prediction → currently rejected (regression vs. existing behavior).
Same task accepted by winml config -m ... --task ... should also be accepted by winml build.

Summary

The PR is solving a real problem, but the gate's notion of "valid" is Optimum's collapse, which is the wrong granularity for the four HF-pipeline-keyed registries that consume loader.task downstream. As-is, this introduces false rejections for some HF-pipeline names and silently accepts modality-mismatched inputs. Happy to pair on a follow-up if useful — there's overlapping work on the quantize/eval side where we're hitting the same canonical-name boundary.

…to chao/validatetask

chinazhangchao added 2 commits May 25, 2026 15:54

validate model task in config

9072792

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

0b90129

…to chao/validatetask

chinazhangchao changed the title ~~Chao/validatetask~~ Validate model task in config. May 25, 2026

chinazhangchao linked an issue May 25, 2026 that may be closed by this pull request

[winml build] [P1] Config loader.task is not validated against --model (or against the model architecture); incompatible pair fails mid-build with a cryptic upstream error #520

Closed

chinazhangchao marked this pull request as ready for review May 25, 2026 07:56

chinazhangchao requested a review from a team as a code owner May 25, 2026 07:56

chinazhangchao requested review from timenick, vortex-captain, xieofxie and zhenchaoni May 25, 2026 08:39

Merge branch 'main' into chao/validatetask

3350150

Copilot started work on behalf of chinazhangchao May 25, 2026 08:48 View session

Copilot stopped work on behalf of chinazhangchao due to an error May 25, 2026 08:51
The session was cancelled by the user.

chinazhangchao added 6 commits May 25, 2026 17:14

fix test

b98bf6d

revert test

d5ae8e4

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

39fb54d

…to chao/validatetask

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

ffeefe3

…to chao/validatetask

Merge branch 'main' into chao/validatetask

48df30a

Merge branch 'main' into chao/validatetask

10562b3

This comment was marked as outdated.

Sign in to view

timenick reviewed May 26, 2026

View reviewed changes

Comment thread src/winml/modelkit/loader/config.py Outdated

Comment thread src/winml/modelkit/commands/build.py Outdated

Comment thread src/winml/modelkit/commands/build.py

chinazhangchao added 3 commits May 26, 2026 11:32

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

5b6b7d7

…to chao/validatetask

fix comments

42e58c3

Merge branch 'main' into chao/validatetask

97433f3

chinazhangchao requested a review from timenick May 26, 2026 05:48

chinazhangchao added 4 commits May 26, 2026 15:39

Merge branch 'main' into chao/validatetask

d0f104a

Merge branch 'main' into chao/validatetask

a6a467a

Merge branch 'main' into chao/validatetask

f4ec418

Merge branch 'main' into chao/validatetask

5168a67

timenick approved these changes May 27, 2026

View reviewed changes

zhenchaoni requested changes May 27, 2026

View reviewed changes

chinazhangchao added 2 commits May 28, 2026 14:37

fix comments

7c6ed9b

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

acaf432

…to chao/validatetask

chinazhangchao requested a review from zhenchaoni May 28, 2026 06:39

chinazhangchao added 3 commits May 28, 2026 15:58

Merge branch 'main' into chao/validatetask

e5d0fbf

Merge branch 'main' into chao/validatetask

53e0e7c

Merge branch 'main' into chao/validatetask

6384905

zhenchaoni approved these changes May 29, 2026

View reviewed changes

chinazhangchao merged commit bb1a67c into main May 29, 2026
9 checks passed

chinazhangchao deleted the chao/validatetask branch May 29, 2026 07:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate model task in config.#723

Validate model task in config.#723
chinazhangchao merged 21 commits into
mainfrom
chao/validatetask

chinazhangchao commented May 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

timenick left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhenchaoni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chinazhangchao commented May 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

timenick left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhenchaoni left a comment

Choose a reason for hiding this comment

1. normalize_task on both sides collapses modality

2. HF-pipeline-only task names are falsely rejected

3. Acceptance set now diverges from winml config / winml export / winml perf

4. Test coverage

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

1. `normalize_task` on both sides collapses modality

3. Acceptance set now diverges from `winml config` / `winml export` / `winml perf`