Skip to content

ref(evals): Move eval harness into workspace package#331

Merged
dcramer merged 2 commits into
mainfrom
ref/evals-workspace-package
May 18, 2026
Merged

ref(evals): Move eval harness into workspace package#331
dcramer merged 2 commits into
mainfrom
ref/evals-workspace-package

Conversation

@dcramer
Copy link
Copy Markdown
Member

@dcramer dcramer commented May 18, 2026

Move Warden evals out of the production package tree and into a private packages/evals workspace package. The root package now delegates eval commands into that package, while npm packaging and Warden config keep eval fixtures out of shipped and runtime surfaces.

Fixture Provenance

Source-captured fixtures now include repository, source_ref, fixture-to-source mappings, and copied license support files. The scaffold command now requires and captures a root license so new fixtures carry the same metadata.

Repository Tooling

Lint now runs oxlint . with explicit generated and fixture ignores, eval CI watches packages/evals/**, and the docs and playbooks point at the new package layout.

Move the eval harness, fixtures, and scaffold command into a private workspace package so they stay out of the production package surface.

Backfill source provenance and license support files for captured fixtures so scenarios can be reproduced at the recorded upstream SHA.

Co-Authored-By: GPT-5 Codex <noreply@anthropic.com>
Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit d79da46. Configure here.

Comment thread packages/evals/src/verify.ts
Pass verifier supporting fixture files through to eval repo setup so source licenses and other support context are copied before verification runs.

Add a regression test covering the verifier path.

Co-Authored-By: GPT-5 Codex <noreply@anthropic.com>
@dcramer dcramer marked this pull request as ready for review May 18, 2026 04:31
@dcramer dcramer merged commit 2fff92b into main May 18, 2026
19 checks passed
@dcramer dcramer deleted the ref/evals-workspace-package branch May 18, 2026 04:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant