DERP : Deep Evaluation for Response Predictors

Akshay Kumar Gupta, Shantanu Kumar, Surag Nair, Barun Patra

Description

Automatic evaluation of dialogue response generation systems has been a fundamentally difficult task faced by researchers in the field. It has been shown that most automatic metrics that are used either do not correlate or correlate very weakly with human scoring of a dialogue system. We propose a novel automatic method of evaluation that uses a trained deep learning model for the task. We hope that this method addresses the issues faced by traditional evaluation systems, and aligns better with human scoring.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
Code		Code
Documentation		Documentation
Papers		Papers
.gitignore		.gitignore
README.md		README.md
ScreenTutorial.md		ScreenTutorial.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DERP : Deep Evaluation for Response Predictors

Description

About

Releases

Packages

Contributors 4

Languages

SourKream/DERP

Folders and files

Latest commit

History

Repository files navigation

DERP : Deep Evaluation for Response Predictors

Description

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages