Skip to content

feat(skills): add skill-creator assets and eval-viewer#758

Merged
JeremyDev87 merged 2 commits intomasterfrom
feat/skill-creator-assets-743
Mar 21, 2026
Merged

feat(skills): add skill-creator assets and eval-viewer#758
JeremyDev87 merged 2 commits intomasterfrom
feat/skill-creator-assets-743

Conversation

@JeremyDev87
Copy link
Owner

Summary

  • Add assets/eval_review.html: self-contained dark-mode trigger accuracy evaluation viewer with query cards (color-coded by should_trigger), edit/add/delete UI, and eval_set.json download button. Placeholders: __EVAL_DATA_PLACEHOLDER__, __SKILL_NAME_PLACEHOLDER__, __SKILL_DESCRIPTION_PLACEHOLDER__
  • Add assets/skill-template.md: default SKILL.md template matching codingbuddy pattern (Core principle, Iron Law, When to Use, When NOT to Use, The Process phases, Additional resources). Placeholders: {{SKILL_NAME}}, {{SKILL_DISPLAY_NAME}}
  • Add eval-viewer/generate_review.py: benchmark result HTML viewer generator with side-by-side with_skill/baseline comparison, --benchmark flag for stats summary (pass_rate, tokens, time mean±stddev), collapsible evidence display, --previous-workspace delta comparison, --static output option. Python 3.8+ standard library only

Test plan

  • eval_review.html opens in browser with placeholder data
  • Query add/edit/delete UI works, download produces valid JSON
  • No external CDN dependencies (self-contained)
  • skill-template.md placeholders match init_skill.sh expectations
  • python generate_review.py --help shows all arguments
  • python generate_review.py with sample data generates valid HTML
  • --benchmark flag loads and displays summary stats
  • --previous-workspace shows delta comparison
  • --static writes to file
  • Python syntax validation passes
  • markdownlint passes

Closes #743

- assets/eval_review.html: self-contained dark-mode trigger accuracy
  evaluation viewer with query cards, edit/add/delete UI, and
  eval_set.json download
- assets/skill-template.md: SKILL.md template with codingbuddy pattern
  (Core principle, Iron Law, Phases, Red Flags)
- eval-viewer/generate_review.py: benchmark result HTML viewer generator
  with side-by-side with_skill/baseline comparison, assertion pass/fail
  coloring, feedback collection, and delta comparison support
- skill-template.md: match exact issue template (TODO placeholders,
  "Use this ESPECIALLY when:", "The Process" header, Additional resources)
- generate_review.py: add --benchmark flag for benchmark.json stats,
  summary section (pass_rate, tokens, time with mean±stddev),
  collapsible evidence display, per-scenario token/time metadata,
  pass_rate delta in previous iteration comparison
@JeremyDev87 JeremyDev87 added feat sub-issue 상위 이슈의 하위 작업 skill New skill addition to .ai-rules/skills/ labels Mar 21, 2026
@vercel
Copy link

vercel bot commented Mar 21, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
codingbuddy-landing Ready Ready Preview, Comment Mar 21, 2026 1:56pm

@JeremyDev87 JeremyDev87 self-assigned this Mar 21, 2026
@JeremyDev87 JeremyDev87 merged commit 420ff00 into master Mar 21, 2026
25 checks passed
@JeremyDev87 JeremyDev87 deleted the feat/skill-creator-assets-743 branch March 21, 2026 14:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

feat skill New skill addition to .ai-rules/skills/ sub-issue 상위 이슈의 하위 작업

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[E] Implement skill-creator assets/ + eval-viewer/ viewers and templates

1 participant