Skip to content

Research Publication v1.0

Latest

Choose a tag to compare

@Shrivastava-Aditya Shrivastava-Aditya released this 10 Jun 08:54
e97eddf

Research Publication Release

This release accompanies the publication of the technical note:

Grounding LLM Boolean Reasoning with a Deterministic Verification Engine: Architecture and Hallucination Benchmark across Six Models

DOI: https://doi.org/10.5281/zenodo.20623462

Summary

This work investigates failure modes of large language models on exact Boolean reasoning tasks and presents a deterministic verification framework that combines:

  • Natural language to Boolean expression translation
  • Formal Boolean evaluation over complete truth tables
  • Quine-McCluskey minimization
  • Machine-verifiable satisfiability checking

The accompanying benchmark evaluates multiple open and frontier LLMs on Boolean satisfiability tasks with deterministic ground truth, revealing systematic hallucination patterns and model-specific reasoning biases.

Resources

Citation

If you use this work in research or benchmarking, please cite the Zenodo publication associated with this release.