HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

This is the repo for our paper: HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild, in which we introduce HaluEval-Wild, the first benchmark specifically designed to evaluate LLM hallucinations in the wild.

License

HaluEval-Wild uses MIT License.

Reference

@misc{HaluEval-Wild,
  author = {Zhiying Zhu and Yiming Yang and Zhiqing Sun},
  title = {HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild},
  year = {2024},
  journal={arXiv preprint arXiv:2403.04307},
  url={https://arxiv.org/abs/2403.04307}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

LICENSE

LICENSE

README.md

README.md

Repository files navigation

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

License

Reference

About

Releases 1

Packages

License

Dianezzy/HaluEval-Wild

Folders and files

Latest commit

History

Repository files navigation

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

License

Reference

About

Resources

License

Stars

Watchers

Forks