skills(activation): close coverage gaps + tighten descriptions#673
Conversation
- Add uipath-maestro-bpmn.jsonl (50) and uipath-llm-configuration-byo-connections.jsonl (45) for two skills that landed after the activation eval (#596) without coverage. - Replace uipath-gov-access-policy.jsonl + uipath-gov-aops-policy.jsonl with a single uipath-governance.jsonl (50 curated rows) — the two skills were merged into a unified uipath-governance skill in #601, so the split criteria could never produce true positives. - Update activation.yaml dataset.paths and success_criteria (now 20 skills, was 19) and refresh README counts. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
- Fix stale `uipath-gov-aops-policy` redirect (skill merged to uipath-governance in #601): llm-configuration-byo-connections. - Rewrite uipath-diagnostics description: lead with brand, quote, narrow scope, add → redirects to maestro-flow / maestro-bpmn / rpa / platform (per repo rule). - Drop forbidden `[PREVIEW]` prefix from uipath-review (rule: preview status belongs in body, not description); add Preview callout under H1. - Add "Always invoke for X" anchors (gold-standard pattern from uipath-maestro-flow) to skills with deterministic file/CLI signals: rpa (.xaml/.cs), maestro-bpmn (.bpmn + 5 package files), coded-apps (app.config.json/action-schema.json), maestro-case (caseplan.json), agents (agent.json), solution-design (sdd.md/pdd.md), rpa-legacy (project.json targetFramework=Legacy). - Add cross-redirects between sibling collisions: human-in-the-loop ↔ tasks (authoring vs operating); maestro-case adds .bpmn redirect; platform adds workflow-code redirects to rpa / maestro-flow / maestro-bpmn / agents. - Tighten uipath-planner trigger ("Use only for ambiguous or multi-product…") and uipath-rpa-legacy trigger (drop "or missing" — too broad). - Add → uipath-diagnostics redirect from uipath-feedback to disambiguate "something is wrong" between report-bug and investigate-error. All 20 descriptions pass the 1024-char validator. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
Consider adding smoke tests before merging. |
|
Claude finished @bai-uipath's task in 3m 51s —— View job PR Review: skills(activation) — close coverage gaps + tighten descriptions
SummaryRewrites 15 skill Change-by-Change Review1.
|
| Area | Status | Notes |
|---|---|---|
| Frontmatter | OK | All 15 SKILL.md files have valid YAML, name matches folder, descriptions well under 1024 chars |
| E2E Tests | N/A | This PR adds activation-level (classification) tests, not behavior tests. Pre-existing smoke gaps are noted above |
| Skill Body | OK | Only uipath-review body changed (Preview callout added) — correct |
| References & Assets | N/A | No reference or asset files changed |
| Repo Hygiene | OK | No secrets, no cross-skill references, CODEOWNERS already covers uipath-governance. Scoped changes |
Issues for Manual Review
- Activation false-positive risk from shared filenames in
uipath-maestro-bpmn: The "Always invoke forproject.uiproj,entry-points.json,operate.json,bindings_v2.json,package-descriptor.json" anchor claims 5 non-unique filenames. Whether this causes real false activations in practice depends on the skill matching implementation — worth monitoring in the activation benchmark results. uipath-plannernarrowing: The description was tightened from "non-trivial or ambiguous" to "ambiguous or multi-product … where the project type is unclear." Verify this doesn't under-activate for legitimate multi-step requests where the project type IS known but the user needs planning help.
Conclusion
Clean, well-structured PR. The governance test consolidation, two new activation test sets, and description tightening are all correct and internally consistent. One Medium concern: the uipath-maestro-bpmn description's "Always invoke" anchor claims 5 shared filenames alongside .bpmn — recommend narrowing the imperative to .bpmn only to avoid false activations, consistent with how uipath-maestro-flow anchors only on .flow. Everything else looks good to merge.
|
Claude finished @bai-uipath's task in 1m 28s —— View job Coder-eval task lint (advisory)1 task YAML changed; verdict counts: 0 Critical, 0 High, 0 Medium, 0 Low, 1 OK. Rubric: .claude/commands/lint-task.md. This check is advisory and never blocks merge. Evidence of passing run✅ Author confirms passing run: "Mean per-skill Per-task lint
|
* tests(activation): close skill coverage gaps - Add uipath-maestro-bpmn.jsonl (50) and uipath-llm-configuration-byo-connections.jsonl (45) for two skills that landed after the activation eval (#596) without coverage. - Replace uipath-gov-access-policy.jsonl + uipath-gov-aops-policy.jsonl with a single uipath-governance.jsonl (50 curated rows) — the two skills were merged into a unified uipath-governance skill in #601, so the split criteria could never produce true positives. - Update activation.yaml dataset.paths and success_criteria (now 20 skills, was 19) and refresh README counts. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(skills): tighten activation descriptions - Fix stale `uipath-gov-aops-policy` redirect (skill merged to uipath-governance in #601): llm-configuration-byo-connections. - Rewrite uipath-diagnostics description: lead with brand, quote, narrow scope, add → redirects to maestro-flow / maestro-bpmn / rpa / platform (per repo rule). - Drop forbidden `[PREVIEW]` prefix from uipath-review (rule: preview status belongs in body, not description); add Preview callout under H1. - Add "Always invoke for X" anchors (gold-standard pattern from uipath-maestro-flow) to skills with deterministic file/CLI signals: rpa (.xaml/.cs), maestro-bpmn (.bpmn + 5 package files), coded-apps (app.config.json/action-schema.json), maestro-case (caseplan.json), agents (agent.json), solution-design (sdd.md/pdd.md), rpa-legacy (project.json targetFramework=Legacy). - Add cross-redirects between sibling collisions: human-in-the-loop ↔ tasks (authoring vs operating); maestro-case adds .bpmn redirect; platform adds workflow-code redirects to rpa / maestro-flow / maestro-bpmn / agents. - Tighten uipath-planner trigger ("Use only for ambiguous or multi-product…") and uipath-rpa-legacy trigger (drop "or missing" — too broad). - Add → uipath-diagnostics redirect from uipath-feedback to disambiguate "something is wrong" between report-bug and investigate-error. All 20 descriptions pass the 1024-char validator. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Summary
Tightens skill activation by rewriting all 20 skill
descriptionfields with the "Always invoke for X" anchor pattern (unique file extensions and CLI tokens), merges the two stale governance skills (gov-access-policy+gov-aops-policy) into one curateduipath-governancetest set, and adds activation test coverage for two previously untested skills (uipath-maestro-bpmn,uipath-llm-configuration-byo-connections).Mean per-skill
recall.yesmoved from 0.483 → 0.683 (+0.20 absolute) and negative TN rate from 96% → 100%, with no precision cost (FP rate held at 0.14%).