Skip to content

v0.2.3

Choose a tag to compare

@justram justram released this 03 Apr 13:53
· 20 commits to main since this release

v0.2.3

Added

  • Added explicit document-visibility tiers for benchmark runs and downstream analysis: surfaced_docids for the full system-exposed retrieval pool, previewed_docids for result-page items actually shown to the model, and agent_docids for the union of documents the agent opened or cited. The benchmark runner, judge evaluation, run summarization, and Markdown reports now surface these tiers so retrieval diagnostics can distinguish hidden top-k availability from model-visible evidence and agent behavior.

Changed

  • Reconnected BM25 helper-side preview rendering to the active pi-search Anserini adapter, so search(...) once again hydrates top BM25 hits with cheap title/excerpt previews instead of showing only docid, score, and the fallback “No snippet available from this backend” message. This restores meaningful result-page visibility for the agent on the BM25 path without requiring extra read_document(...) calls just to understand top-ranked hits.
  • Moved the repo-local Anserini integration stack from src/bm25/ to src/search-providers/anserini/, keeping the package-owned pi-search adapter surface separate from provider-owned transport/process construction and updating docs/tests to reflect the clearer provider boundary. (commit 82e51cd)
  • Changed retrieval-evaluation/report wording from the ambiguous legacy "agent-set" framing toward explicit surfaced/previewed/agent-behavior semantics while retaining compatibility aliases for older run artifacts and downstream consumers.