Per-flow action types for ScriptFlow/TaskFlow and TaskFlow reliability benchmark#2080
Merged
hillary-mutisya merged 1 commit intomicrosoft:mainfrom Mar 26, 2026
Merged
Conversation
…y benchmark
Replace single executeScriptFlow/executeTaskFlow routing with per-flow action
types that preserve named parameters in both grammar rules and dynamic schema.
This fixes schema validation rejecting flow-specific parameters (P2) and gives
the LLM translator richer type information for semantic matching.
- ScriptFlow/TaskFlow: grammar rules now generate actionName per flow (e.g.
"listFiles", "createTopSongsPlaylist") instead of "executeScriptFlow"/
"executeTaskFlow" with flowName param
- Dynamic schema generates per-flow types with typed parameters instead of
a single generic type with [key: string]: unknown
- Index entries store parameter metadata for schema generation
- ScriptFlow benchmark evaluator updated for new action name extraction
- TaskFlow benchmark: 52 scenarios across 8 categories (seeding, grammar,
LLM translation, execution, CRUD, recording, step patterns, error handling)
- Baseline: 100% grammar match, 100% parameter extraction, 62.2% overall
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Replace single executeScriptFlow/executeTaskFlow routing with per-flow action types that preserve named parameters in both grammar rules and dynamic schema. This fixes schema validation rejecting flow-specific parameters (P2) and gives the LLM translator richer type information for semantic matching.