Skip to content

zchuz/TimeBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

This is the repository containing evaluation datas, instructions and demonstrations with paper TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models (Chu et al., 2023)

Datasets

Symbolic Temporal Reasoning

Commonsense Temporal Reasoning

Event Temporal Reasoning

Models

Performance

image

Citation

If you find our work helpful, you can cite this paper as:

@misc{chu2023timebench,
      title={TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models}, 
      author={Zheng Chu and Jingchang Chen and Qianglong Chen and Weijiang Yu and Haotian Wang and Ming Liu and Bing Qin},
      year={2023},
      eprint={2311.17667},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2311.17667}
}

About

The repository for paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"

Topics

Resources

License

Stars

Watchers

Forks

Languages