L/small changes 12 by lorenss-m · Pull Request #386 · hud-evals/hud-python

lorenss-m · 2026-04-01T01:05:09Z

Note

Medium Risk
Moderate risk: changes core evaluation helpers to be async (including subprocess execution) and extends CLI task discovery/import behavior, which could affect existing integrations and task loading in real projects.

Overview
Adds a significantly expanded hud.native evaluation toolkit: graders are now fully async, BashGrader uses async subprocesses with timeout_seconds, and a new Grade.gather() runs multiple graders/subscores in parallel. This also introduces LLMJudgeGrader (rubric-based LLM judging) plus built-in answer comparison/normalization helpers, and exports them via hud.native.

Improves CLI task collection to better match real project layouts: supports importing a directory as a package (running its own discovery), recursively finds **/task.py, and adds project-root sys.path injection to fix cross-module imports; includes extensive new tests for these edge cases.

Updates agent console output to show a tool discovery table at init and a structured per-step tool call/result summary at INFO, and refreshes docs for the new graders API plus minor doc link/typo fixes.

^{Written by Cursor Bugbot for commit 2d5b407. This will update automatically on new commits. Configure here.}

mintlify · 2026-04-01T01:05:26Z

Preview deployment for your docs. Learn more about Mintlify Previews.

Project	Status	Preview	Updated (UTC)
hud	🟢 Ready	View Preview	Apr 1, 2026, 1:05 AM

…/small-changes-12

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

lorenss-m added 3 commits March 31, 2026 18:01

small changes

da7967e

rf

e08f36b

sm

d988a0d

mintlify Bot deployed to staging - docs April 1, 2026 01:05 View deployment

cursor Bot reviewed Apr 1, 2026

View reviewed changes

Comment thread hud/cli/utils/collect.py

lorenss-m added 2 commits March 31, 2026 18:56

mg

8c912cd

pg

e312d80

mintlify Bot deployed to staging - docs April 1, 2026 01:57 View deployment

d

5a3882e

mintlify Bot deployed to staging - docs April 1, 2026 01:58 View deployment

Merge branch 'main' of https://github.com/hud-evals/hud-python into l…

56b76df

…/small-changes-12

mintlify Bot deployed to staging - docs April 1, 2026 02:01 View deployment

cursor Bot reviewed Apr 1, 2026

View reviewed changes

Comment thread hud/native/graders.py Outdated

lorenss-m added 2 commits March 31, 2026 19:04

fx mg

211ab1b

sm

afbe2c5

cursor Bot reviewed Apr 1, 2026

View reviewed changes

Comment thread hud/native/graders.py

lorenss-m added 2 commits March 31, 2026 19:21

tn

8ef1b99

dcs

73a1552

mintlify Bot deployed to staging - docs April 1, 2026 02:24 View deployment

cursor Bot reviewed Apr 1, 2026

View reviewed changes

Comment thread hud/native/__init__.py Outdated

Comment thread hud/native/graders.py Outdated

ok

2d5b407

lorenss-m merged commit c5f1c23 into main Apr 1, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L/small changes 12#386

L/small changes 12#386
lorenss-m merged 12 commits into
mainfrom
l/small-changes-12

lorenss-m commented Apr 1, 2026 •

edited by cursor Bot

Loading

Uh oh!

mintlify Bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lorenss-m commented Apr 1, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mintlify Bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lorenss-m commented Apr 1, 2026 •

edited by cursor Bot

Loading

mintlify Bot commented Apr 1, 2026 •

edited

Loading