Zero-source LLM Hallucination Detection with Human-like Criteria Probing

Jiahao Yang, Shuhai Zhang, Hailong Kang, Feng Liu, Qi Chen, Mingkui Tan

✨ Abstract

Large language models (LLMs) often hallucinate by generating factually incorrect or unfaithful content, posing significant risks to their safe use. Detecting such hallucinations is particularly challenging under the zero-source constraint, where no model internals or external references are available, and detection must rely solely on the textual query–answer pair. In this paper, we propose Human-like Criteria Probing for Hallucination Detection (HCPD), a paradigm that emulates the multi-faceted reasoning of human evaluators. Its core is an Human-like Criteria Probing (HCP) mechanism, in which an LLM agent adaptively decomposes its judgment into a weighted set of interpretable criteria and aggregates criterion-specific scores into a final truthfulness measure. To achieve this adaptive capability, we introduce a reward-based alignment scheme using only weak supervision from semantic consistency. At inference, we employ a multi-sampling aggregation strategy to ensures robust decisions while preserving full interpretability. We further provide theoretical analysis supporting the reliability of our approach. Extensive experiments show that HCPD consistently outperforms state-of-the-art baselines, offering an effective and explainable solution for zero-source hallucination detection.

⚙️ Requirements

GPU: 2 × NVIDIA RTX GPUs with 80 GB memory
CUDA: 12.4
Python: 3.11
PyTorch: 2.6.0

💡 Virtual Environment

Create a virtual environment and install all required dependencies for training and evaluation.

bash setup.sh
conda activate HCPD

📂 Data and Pre-trained Models

Dataset: We use four widely adopted QA benchmarks (TriviaQA, SciQ (train), NQ Open, and CoQA) to construct the hallucination detection datasets. The generated datasets can be obtained and stored in ./generated_datasets by running the command below:

bash generate_datasets.sh

Pre-trained models: We adopt the Qwen2.5-7B-Instruct as the scoring agent and choose Llama-3.1-8B, Qwen3-8B as the evaluated target LLMs in the main experiments.

The datasets and pre-trained models will be automatically downloaded to ./.cache. Alternatively, they can be downloaded manually from the corresponding official repositories. After downloading, please configure the MODEL_PATH in the run scripts.

🚀 Quick Start

Pretrained checkpoints are provided in Google Drive. The results can be quickly verified using the following bash scripts.

bash quick_validation.sh

▶️ Main Experiments

Training and evaluation pipelines are provided through the following bash scripts.

TriviaQA:

bash run_TriviaQA.sh

SciQ:

bash run_SciQ.sh

NQ Open:

bash run_NQOpen.sh

CoQA:

bash run_CoQA.sh

Output Directory: Model checkpoints generated during training are saved to ./data_{metric}. Evaluation logs and test results are saved to ./logs.

📖 Citation

If you find this work useful in your research, please consider citing:

@inproceedings{yang2026zerosource,
  title={Zero-source {LLM} Hallucination Detection with Human-like Criteria Probing},
  author={Jiahao Yang and Shuhai Zhang and Hailong Kang and Feng Liu and Qi Chen and Mingkui Tan},
  booktitle={Forty-third International Conference on Machine Learning},
  year={2026},
  url={https://openreview.net/forum?id=s4Jn6bKYGI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
generated_datasets		generated_datasets
recipes		recipes
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
generate_datasets.sh		generate_datasets.sh
quick_validation.sh		quick_validation.sh
run_CoQA.sh		run_CoQA.sh
run_NQOpen.sh		run_NQOpen.sh
run_SciQ.sh		run_SciQ.sh
run_TriviaQA.sh		run_TriviaQA.sh
setup.cfg		setup.cfg
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-source LLM Hallucination Detection with Human-like Criteria Probing

Jiahao Yang, Shuhai Zhang, Hailong Kang, Feng Liu, Qi Chen, Mingkui Tan

✨ Abstract

⚙️ Requirements

💡 Virtual Environment

📂 Data and Pre-trained Models

🚀 Quick Start

▶️ Main Experiments

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Zero-source LLM Hallucination Detection with Human-like Criteria Probing

Jiahao Yang, Shuhai Zhang, Hailong Kang, Feng Liu, Qi Chen, Mingkui Tan

✨ Abstract

⚙️ Requirements

💡 Virtual Environment

📂 Data and Pre-trained Models

🚀 Quick Start

▶️ Main Experiments

📖 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages