Product Manager, writer, and occasional strength coach. Building deeply technical products that don't feel that way to use.
Pinned Loading
-
debate-degrades-reasoning
debate-degrades-reasoning PublicSingle-round debate degrades LLM reasoning in symmetric settings — 2,100 evaluations, 11 conditions, two benchmarks
Python
-
receipt-gated-pipelines
receipt-gated-pipelines PublicReceipt-Gated Pipelines: Cryptographic Verification of Tool-Call Claims in Multi-Agent LLM Systems
Python
-
structure-beats-scale
structure-beats-scale PublicStructure Beats Scale: How Structured Review Outperforms Brute-Force Generation in LLM Code Synthesis
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



