Skip to content

Per-flow action types for ScriptFlow/TaskFlow and TaskFlow reliability benchmark#2080

Merged
hillary-mutisya merged 1 commit intomicrosoft:mainfrom
hillary-mutisya:authoring6
Mar 26, 2026
Merged

Per-flow action types for ScriptFlow/TaskFlow and TaskFlow reliability benchmark#2080
hillary-mutisya merged 1 commit intomicrosoft:mainfrom
hillary-mutisya:authoring6

Conversation

@hillary-mutisya
Copy link
Copy Markdown
Collaborator

Replace single executeScriptFlow/executeTaskFlow routing with per-flow action types that preserve named parameters in both grammar rules and dynamic schema. This fixes schema validation rejecting flow-specific parameters (P2) and gives the LLM translator richer type information for semantic matching.

  • ScriptFlow/TaskFlow: grammar rules now generate actionName per flow (e.g. "listFiles", "createTopSongsPlaylist") instead of "executeScriptFlow"/ "executeTaskFlow" with flowName param
  • Dynamic schema generates per-flow types with typed parameters instead of a single generic type with [key: string]: unknown
  • Index entries store parameter metadata for schema generation
  • ScriptFlow benchmark evaluator updated for new action name extraction
  • TaskFlow benchmark: 52 scenarios across 8 categories (seeding, grammar, LLM translation, execution, CRUD, recording, step patterns, error handling)
  • Baseline: 100% grammar match, 100% parameter extraction, 62.2% overall

…y benchmark

 Replace single executeScriptFlow/executeTaskFlow routing with per-flow action
  types that preserve named parameters in both grammar rules and dynamic schema.
  This fixes schema validation rejecting flow-specific parameters (P2) and gives
  the LLM translator richer type information for semantic matching.

  - ScriptFlow/TaskFlow: grammar rules now generate actionName per flow (e.g.
    "listFiles", "createTopSongsPlaylist") instead of "executeScriptFlow"/
    "executeTaskFlow" with flowName param
  - Dynamic schema generates per-flow types with typed parameters instead of
    a single generic type with [key: string]: unknown
  - Index entries store parameter metadata for schema generation
  - ScriptFlow benchmark evaluator updated for new action name extraction
  - TaskFlow benchmark: 52 scenarios across 8 categories (seeding, grammar,
    LLM translation, execution, CRUD, recording, step patterns, error handling)
  - Baseline: 100% grammar match, 100% parameter extraction, 62.2% overall
@hillary-mutisya hillary-mutisya added this pull request to the merge queue Mar 26, 2026
Merged via the queue into microsoft:main with commit 4b1cca6 Mar 26, 2026
13 of 15 checks passed
@hillary-mutisya hillary-mutisya deleted the authoring6 branch March 26, 2026 04:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant