QRData

Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data [Paper] [Project Website]

Benchmark

We provide the questions of quantitative reasoning with data (QRData) in benchmark/QRData.json. It contains 411 questions with the following keys.

data_description
question
answer
data_files: a list of names of data files
meta_data: a dict contains reference, keywords, question_type, and multiple_choices (the possible choices if question_type is 'multiple_choice').

Data files related to the questions are in benchmark/data.zip.

Questions of quantitative reasoning with text (QRText) are in benchmark/QRText.json. It contains 290 questions with the following keys.

data_description
question
answer
meta_data: a dict contains reference, keywords, question_type, and multiple_choices (the possible choices if question_type is 'multiple_choice').

The script for evaluation is in 'benchmark/eval.py'.

Citation

Please cite our paper if this repository inspires your work.

@article{liu2024llms,
    title={Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data},
    author={Liu, Xiao and Wu, Zirui and Wu, Xueqing and Lu, Pan and Chang, Kai-Wei and Feng, Yansong},
    journal={arXiv preprint arXiv:2402.17644},
    year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
benchmark		benchmark
docs		docs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark

benchmark

docs

docs

README.md

README.md

Repository files navigation

QRData

Benchmark

Citation

About

Releases

Packages

Languages

xxxiaol/QRData

Folders and files

Latest commit

History

Repository files navigation

QRData

Benchmark

Citation

About

Resources

Stars

Watchers

Forks

Languages