GitHub - SmartDataAnalytics/transformers_dialogue_evaluators: Resources to reproduce the results reported in the paper: "Language Model Transformers as Evaluators for Open-domain Dialogues".

Language Model Transformers as Evaluators for Open-domain Dialogues

This repository provides the resources to reproduce the results reported in the paper: "Language Model Transformers as Evaluators for Open-domain Dialogues" (link).

These are the instructions for reproducing the results. We provide the following scripts and resources:

Details about the contents

transformers_dialogue_evaluators.py
- the scripts compute probability scores for the ConvAI1 and ConvAI2 datasets using BERT, XLNet and GPT2
- Depending on the available hardware the script can take a day or even longer to execute and compute the results.
- just execute the script to obtain the results:
  - python -u transformers_dialogue_evaluators.py
convai(1|2)_results.pickle.bz2 - we provide the already computed probability scores as a shortcut for the correlation analysis
convai(1|2)_corr.ipynb - Jupyter notebooks that:
- calculate the various aggregated scores for dialogues
- compute the correlation scores
- visualize them in an interactive spreadsheet

Instructions

Python 3.6 is used to run the scripts. We recommend using a virtual environment like (Ana|Mini)conda. Steps:

Install dependencies
- pip install jupyter requests numpy scipy scikit-learn seaborn tqdm torch==1.3.1 transformers==2.2.1 pandas qgrid
Activate qgrid Jupyter extension
- jupyter nbextension enable --py --sys-prefix qgrid
- Skipping this step would prevent Jupyter from rendering an interactive spreadsheet with the correlation scores
Start Jupyter:
- jupyter notebook
Open and run all the cells in the notebooks
- the correlation scores should be computed and visualized
- sample dialogues used in the paper are shown

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
convai1_corr.ipynb		convai1_corr.ipynb
convai1_results.pickle.bz2		convai1_results.pickle.bz2
convai2_corr.ipynb		convai2_corr.ipynb
convai2_results.pickle.bz2		convai2_results.pickle.bz2
readme.md		readme.md
transformers_dialogue_evaluators.py		transformers_dialogue_evaluators.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Model Transformers as Evaluators for Open-domain Dialogues

Details about the contents

Instructions

About

Releases

Packages

Languages

SmartDataAnalytics/transformers_dialogue_evaluators

Folders and files

Latest commit

History

Repository files navigation

Language Model Transformers as Evaluators for Open-domain Dialogues

Details about the contents

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages