faultline v0.4.0
Deterministic silent-failure testing for AI agents — no LLM judge.
Highlights
- Token-only hosted push (
FAULTLINE_TOKENis all you need) - Loud, false-green-proof results (
.ok/.failed/.assert_ok()) - Six CLI modes that all gate CI: run, probe, fuzz, scenarios, replay, mine
- Marketplace-grade GitHub Action (mode + fail-on-silent inputs, verdict outputs)
- Detector upgrades: display-arg FP fix, derived-value FN layer, pandas + dict support
Measured (85-case adversarial benchmark, independently audited, reproducible — see benchmark/MEASUREMENT.md): 97.5% recall · 2.2% false-alarm · 100% deterministic.
pip install faultline