Research Publication Release
This release accompanies the publication of the technical note:
Grounding LLM Boolean Reasoning with a Deterministic Verification Engine: Architecture and Hallucination Benchmark across Six Models
DOI: https://doi.org/10.5281/zenodo.20623462
Summary
This work investigates failure modes of large language models on exact Boolean reasoning tasks and presents a deterministic verification framework that combines:
- Natural language to Boolean expression translation
- Formal Boolean evaluation over complete truth tables
- Quine-McCluskey minimization
- Machine-verifiable satisfiability checking
The accompanying benchmark evaluates multiple open and frontier LLMs on Boolean satisfiability tasks with deterministic ground truth, revealing systematic hallucination patterns and model-specific reasoning biases.
Resources
- Technical Note (Zenodo DOI): https://doi.org/10.5281/zenodo.20623462
- PyPI Package: https://pypi.org/project/boolean-algebra-engine/
- Source Code: https://github.com/Shrivastava-Aditya/boolean-algebra-engine
Citation
If you use this work in research or benchmarking, please cite the Zenodo publication associated with this release.