Code and Data for our Findings of ACL 2021 paper titled 'Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation. Varun Gangal *, Harsh Jhamtani *, Eduard Hovy, Taylor Berg-Kirkpatrick'
- Relevant original and augmented reference files in are present in 'ref_files/' in the required format
- Human ratings file: 'human_rating_correlation/mturk_rating_processed_output.csv'. Please consider citing Gupta et al if you use the human ratings file.
Code and script to compute metric correlations with human ratings can be found in 'human_rating_correlation/' directory
- Python 3.7.5
- bert_score (0.3.7)
- nlgeval(Accessed: December 2020)
- scipy 1.1.0
@inproceedings{acl2021dialogeval,
title={Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation},
author={Gangal, Varun and Jhamtani, Harsh and Hovy, Ed and Berg-Kirkpatrick, Taylor},
booktitle={Findings of ACL},
year={2021}
}