Research Publication Release

This release accompanies the publication of the technical note:

Grounding LLM Boolean Reasoning with a Deterministic Verification Engine: Architecture and Hallucination Benchmark across Six Models

DOI: https://doi.org/10.5281/zenodo.20623462

Summary

This work investigates failure modes of large language models on exact Boolean reasoning tasks and presents a deterministic verification framework that combines:

Natural language to Boolean expression translation
Formal Boolean evaluation over complete truth tables
Quine-McCluskey minimization
Machine-verifiable satisfiability checking

The accompanying benchmark evaluates multiple open and frontier LLMs on Boolean satisfiability tasks with deterministic ground truth, revealing systematic hallucination patterns and model-specific reasoning biases.

Resources

Technical Note (Zenodo DOI): https://doi.org/10.5281/zenodo.20623462
PyPI Package: https://pypi.org/project/boolean-algebra-engine/
Source Code: https://github.com/Shrivastava-Aditya/boolean-algebra-engine

Citation

If you use this work in research or benchmarking, please cite the Zenodo publication associated with this release.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research Publication v1.0

Choose a tag to compare

Sorry, something went wrong.