Releases: teamchong/pxpipe
Releases · teamchong/pxpipe
Release list
v0.7.1
Fixed
- Relocated env block is now wrapped in
<system-reminder>tags. The
volatile# Environmenttext that pxpipe moves out of the cached system
prefix used to be appended to the last user message as bare prose — on an
empty or short user turn it could read as the user's entire message, and
models would mis-attribute it ("your message consisted of environment
metadata"). The block now carries an explicit provenance header
("Context relocated by pxpipe from the system prompt … not written by the
user"), fixing attribution. No cache impact: the wrapper rides the volatile
tail behind all cache breakpoints (~60 chars/request).
v0.7.0 — Fable 5 demo, telemetry, slab stability
Added
- Per-request telemetry:
stop_reason+ safety-flag logging. Every proxied
request now records how it ended, so refusal/classifier trips are measurable
instead of anecdotal. - Headless bench: multi-turn
claude -pdriver +events.jsonlscorer for
fast, non-interactive A/B runs; plus a constant-cost render-style eval harness. PXPIPE_DUMP_DIRpersists rendered PNGs per request for demo/debug
inspection of exactly what the model saw.- Dashboard/factsheet: one-time cache-create losses tagged in the recent
table; factsheet carries occurrence counts with ticket-style codes. - Demos:
claude-sonnet-5arm support; fable arm runsclaude-fable-5[1m]
(1M ctx) to match opus/sonnet.
Fixed
- Imaged slab frozen at first render. Volatile content (skill listings, cwd
caches) stays out of the imaged prefix so turn-2 system sha matches turn-1 —
no more silent cache-create churn between turns. - Volatile env text relocated behind all cache breakpoints (not just the
first), plus cross-session slab stability. - Refusal-classifier defusing: provenance-framed slab banner and reworded
tool-docs stub/header — eliminates spuriousreasoning_extractionrefusals
on compressed context. - Render fit to 1568×728 (~1.15 MP) on the Anthropic path for WYSIWYG
glyphs (what we rasterize is what the model samples). - Demo cost-ab arms run
--no-chromefor reproducible token baselines.
Removed
compressSchemasknob (superseded by slab stability work).
Docs
- Fable 5 side-by-side demo in the README with verified numbers from the
recording — same two tasks, same answers: plain $42.21 / 96% context vs
pxpipe $4.51 — plus the honest caveat (compressed arm needed one nudge for
single-reply format) and the full attempt log in
demo/effective-context/ATTEMPTS.md. - Node transform hook documented as kill-switch only.