GitHub - csnlp/nlp_metrics: Common metrics for dialogue system evaluations

NLP 对话领域的常用自动评价指标

BLEU

关于BLEU的讲解:BLEU 关于BLEU的代码:BLEU 相关依赖: nltk 使用方法:

import bleu

from bleu import cal_bleu

# sentence: a str separated by space; e.g. "Hello World."
# references: a list of sentences: e.g. ["Hello Me", "Good World"]
# weights: a list of number which be summed together is 1; [0.1, 0.2, 0.3, 0.4]. This is for BLEU-4
  The weight for unigram, bigram, 3-gram, 4-gram is [0.1, 0.2, 0.3, 0.4]

BLEU-SCORE = cal_bleu(sentence, references, weights)

ROUGE

DISTINCT

distinct is firstly proposed by Jiwei Li et.al in paper for the diversity evaluation of generated responses.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
bleu.py		bleu.py
commit.sh		commit.sh
distinct.py		distinct.py
distinct_test.txt		distinct_test.txt
utils.py		utils.py
utils.pyc		utils.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BLEU

ROUGE

DISTINCT

About

Releases

Packages

Languages

csnlp/nlp_metrics

Folders and files

Latest commit

History

Repository files navigation

BLEU

ROUGE

DISTINCT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages