CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

Co-first author: Tong Zhang (Sichuan University), Peixin Qin (Sichuan University)

This is the benchmark test dataset, called CLAMBER, which is used to evaluate LLMs using a well-organized taxonomy in terms of identifying and clarifying ambiguous information needs.

Paper

Click Me

Ambiguity Taxonomy in the era of LLM

Dataset Information

Name	Meaning	Values
question	user query	string
context	context of user query	string
clarifying_question	suggested clarifying question	string
require_clarification	If user query is ambiguous	0/1
category	ambiguity type	{"FD": "Epistemic Misalignment", "MC": "Aleatoric Output", "LA": "Linguistic Ambiguity"}
subclass	sub-type	{"whom": "WHOM", "what": "WHAT", "when": "WHEN", "where": "WHERE", "NK": "UNFAMILIAR", "ICL": "CONTRADICTION", "co-reference": "SEMANTIC", "polysemy": "LEXICAL"}

Reference

If you make advantage of the DREditor in your research, please cite the following in your manuscript:

@misc{zhang2024clamber,
      title={CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models}, 
      author={Tong Zhang and Peixin Qin and Yang Deng and Chen Huang and Wenqiang Lei and Junhong Liu and Dingnan Jin and Hongru Liang and Tat-Seng Chua},
      year={2024},
      booktitle = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL)},
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
clamber_benchmark.jsonl		clamber_benchmark.jsonl
taxonomy.png		taxonomy.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

Paper

Ambiguity Taxonomy in the era of LLM

Dataset Information

Reference

About

Releases

Packages

zt991211/CLAMBER

Folders and files

Latest commit

History

Repository files navigation

CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

Paper

Ambiguity Taxonomy in the era of LLM

Dataset Information

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages