You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added explicit document-visibility tiers for benchmark runs and downstream analysis: surfaced_docids for the full system-exposed retrieval pool, previewed_docids for result-page items actually shown to the model, and agent_docids for the union of documents the agent opened or cited. The benchmark runner, judge evaluation, run summarization, and Markdown reports now surface these tiers so retrieval diagnostics can distinguish hidden top-k availability from model-visible evidence and agent behavior.
Changed
Reconnected BM25 helper-side preview rendering to the active pi-search Anserini adapter, so search(...) once again hydrates top BM25 hits with cheap title/excerpt previews instead of showing only docid, score, and the fallback “No snippet available from this backend” message. This restores meaningful result-page visibility for the agent on the BM25 path without requiring extra read_document(...) calls just to understand top-ranked hits.
Moved the repo-local Anserini integration stack from src/bm25/ to src/search-providers/anserini/, keeping the package-owned pi-search adapter surface separate from provider-owned transport/process construction and updating docs/tests to reflect the clearer provider boundary. (commit 82e51cd)
Changed retrieval-evaluation/report wording from the ambiguous legacy "agent-set" framing toward explicit surfaced/previewed/agent-behavior semantics while retaining compatibility aliases for older run artifacts and downstream consumers.