You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Would it be possible to add some details on when each metric is useful and how to invoke only 1 metric to the readme?
The text was updated successfully, but these errors were encountered:
bionicles
changed the title
please add some description of metrics and why / why not to use them
please add more background on metrics and describe how to invoke just 1 in Readme
Dec 31, 2018
Our paper http://arxiv.org/pdf/1706.09799 describes all the metrics very briefly and cites the papers that first proposed these metrics so you could read those in more details. In the research community, there is not much of a consensus on which of these metrics work better (people measure correlation with human evaluation to figure out which metrics are more suited for their task and results vary a lot) so people usually report several metrics. From what I have observed, BLEU-4 and METEOR are the most widely used ones but CIDEr usage has been increasing.
Would it be possible to add some details on when each metric is useful and how to invoke only 1 metric to the readme?
The text was updated successfully, but these errors were encountered: