fix(reflect): let a fresh mental model short-circuit forced retrieval (no extra LLM call) by nicoloboschi · Pull Request #2011 · vectorize-io/hindsight

nicoloboschi · 2026-06-05T12:58:30Z

Summary

Fixes #1971.

Reflect forced the full hierarchical retrieval path on the first iterations:

search_mental_models -> search_observations -> recall

The root cause is one mechanism: those iterations set tool_choice to a named function, and a named tool_choice forbids the model from emitting done. So even when search_mental_models returned a fresh, directly-relevant mental model, the agent could not answer — it was obligated to call search_observations and then recall first, paying for the lower layers (and often re-reading the very facts the mental model already summarized).

The fix

After the forced search_mental_models result, decide deterministically — with no extra LLM call — whether to keep forcing:

If the call is low/mid budget and every retrieved mental model is explicitly fresh (is_stale is False) with non-empty content → stop forcing from the next iteration onward.
That next iteration happens regardless; it now runs under auto tool choice, so the agent either answers directly, or — having just read the mental model — issues its own targeted search_observations/recall.
Stale, empty, or missing mental models keep the full forced path.
High budget always keeps the full verification path.

The key insight: a reflect iteration always makes exactly one call_with_tools; tool_choice only changes what the model may emit, not whether a call happens. So the sufficiency decision can be folded into the next agentic step that already occurs, rather than added as a separate classifier call.

Why not a sufficiency-classifier LLM call

An alternative (see #2005) inserts a dedicated structured "is this mental model sufficient?" LLM call after the mental-model layer. That re-introduces a cost on the very path the issue is about:

Scenario	Before	Classifier approach	This PR
mental model sufficient	3 forced rounds	mm + classifier + done → net −1	mm + done → net −2
mental model not sufficient	3 forced rounds	mm + classifier + obs + recall → net +1	mm + obs/recall under auto → net 0

This PR never adds an LLM round-trip in any scenario, and the targeted follow-up query the issue asks for comes for free: under auto the model composes its own observations/recall query having just read the mental model, so it naturally targets the gap/conclusion.

The deterministic freshness/usability guard (is_stale is False + non-empty content; missing flag treated as unsafe) is what makes "release to auto" safe — stale/empty results never short-circuit. Reliability-over-latency callers use high budget, which preserves the full forced path.

Behavior

No mental models found → continue to the enabled lower layers (unchanged).
Any retrieved mental model stale or empty → continue to the lower layers.
high budget → full forced verification path (unchanged).
Fresh + usable mental models on low/mid budget → forced lower-level retrieval stops; the next iteration is auto.

Tests

hindsight-api-slim/tests/test_reflect_agent.py:

TestMentalModelFreshnessHelper — the deterministic guard (fresh, stale, missing-flag, blank-content, empty-list).
test_fresh_mental_model_releases_forced_retrieval — fresh mm → second iteration is auto, obs/recall never called, and no extra llm.call.
test_short_circuited_agent_may_still_retrieve_under_auto — after release the agent can still recall, by its own choice, with its own query.
test_stale_mental_model_keeps_forced_retrieval / test_no_mental_models_keeps_forced_retrieval — full forced path preserved.
test_high_budget_keeps_forced_path_for_fresh_mental_model — high budget unaffected.

This is a deterministic control-flow change (no prompt or model-interpretation change), so fast MockLLM unit tests are the appropriate coverage.

Commands run:

./scripts/hooks/lint.sh
uv run ty check hindsight_api/engine/reflect/agent.py
uv run pytest tests/test_reflect_agent.py (50 passed)

Reflect forced the full hierarchical path search_mental_models -> search_observations -> recall via a named tool_choice on the first iterations. Because a named tool_choice forbids the model from emitting `done`, the agent could never answer off a fresh, directly-relevant mental model — it always paid for the lower layers too (issue #1971). Fix: after the forced search_mental_models result, decide deterministically (no extra LLM call) whether to keep forcing. If the call is low/mid budget and every retrieved mental model is explicitly fresh (is_stale is False) with non-empty content, stop forcing from the next iteration on. That iteration — which happens regardless — now runs under `auto`, so the agent either answers directly or, having just read the mental model, issues its own targeted search_observations/recall. Stale, empty, or missing mental models keep the full forced path; high budget always keeps it. This reuses the agentic step that already occurs instead of adding a separate sufficiency-classifier LLM call, so the sufficient path saves two forced rounds and no path ever adds a round.

Two hs_llm_core end-to-end tests drive the real agent loop (stubbed search functions, real llm_config) to verify behaviour the deterministic MockLLM tests cannot: - fresh + sufficient mental model: the released agent answers off it and never calls search_observations/recall (judge-verified grounding); - stale mental model: no short-circuit, lower layers stay forced, and the agent corrects the stale summary using the freshly retrieved raw fact. The stale case (forcing is deterministic) is used rather than a "fresh-but-incomplete model retrieves deeper on its own" case, because whether a released model chooses to dig deeper is model-dependent and not something the fix guarantees — only release-to-auto is guaranteed.

nicoloboschi added 2 commits June 5, 2026 14:57

nicoloboschi mentioned this pull request Jun 5, 2026

fix(reflect): gate mental-model retrieval by policy #2005

Closed

nicoloboschi merged commit c255d35 into main Jun 5, 2026
80 checks passed

nicoloboschi mentioned this pull request Jun 8, 2026

fix: prefer mental model abstraction in reflect to avoid unnecessary low-level retrieval #2012

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(reflect): let a fresh mental model short-circuit forced retrieval (no extra LLM call)#2011

fix(reflect): let a fresh mental model short-circuit forced retrieval (no extra LLM call)#2011
nicoloboschi merged 2 commits into
mainfrom
fix/reflect-mental-model-short-circuit

nicoloboschi commented Jun 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nicoloboschi commented Jun 5, 2026

Summary

The fix

Why not a sufficiency-classifier LLM call

Behavior

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant