Skip to content
/ KoROUGE Public

Calculating ROUGE score for Korean (Wrapper for ROUGE-1.5.5.pl script)

License

Notifications You must be signed in to change notification settings

hong8e/KoROUGE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KoROUGE

Calculating ROUGE scores for Korean (Wrapper for original ROUGE score perl script ROUGE-1.5.5.pl)
Existing the ROUGE summarization evaluation package does not support Korean. KoROUGE is designed to make getting ROUGE scores easier by automatically analyzing your summaries into morphemes and converting them into alphabets.
Given two files (system, reference) with the same number of lines, The KoROUGE calculates the average ROUGE scores of each line.
KoROUGE uses files2rouge by Paul Tardy : https://github.com/pltrdy/files2rouge.

1. Requirements and Installation

  • Python version >= 3.6
  • To install KoROUGE
pip install konlpy
pip install -U git+https://github.com/pltrdy/pyrouge

git clone https://github.com/pltrdy/files2rouge.git     
cd files2rouge
python setup_rouge.py
python setup.py install

git clone https://github.com/hong8e/KoROUGE.git
cd KoROUGE
python setup.py install

sudo apt install default-jdk
apt-get install libxml-parser-perl

2.Run

  • All sentences in the system summary and reference summary must start with <t> and end with </t>
    e.g. <t> 나는 학교에 간다 </t> <t> 나는 공부한다. </t>
KoROUGE $system_summary $gold_summary
# Basically, KoROUGE automatically analyzes words into morphemes and calcuate ROUGE scores,
# if you want to calculate ROUGE with word-level just add options "--no_morpheme_analyze".
  • Options
usage: KoROUGE [-h] [-nm] [-nl] [-n N_GRAM] [-s SAVETO] system reference

positional arguments:
  system                Path to system summary file
  reference             Path to reference file

optional arguments:
  -h, --help            show this help message and exit
  -nm, --no_morpheme_analyze
                        Turn off morphological analyzer
  -nl, --no_rouge_l     Do not calculate ROUGE-L
  -n N_GRAM, --n_gram N_GRAM
                        Change max-ngram
  -s SAVETO, --saveto SAVETO
                        File to save scores
  • Output
morph analyzing
calculating rouge
Preparing documents... 0 line(s) ignored
Running ROUGE...
---------------------------------------------
1 ROUGE-1 Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-1 Average_P: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-1 Average_F: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
---------------------------------------------
1 ROUGE-2 Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-2 Average_P: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-2 Average_F: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
---------------------------------------------
1 ROUGE-3 Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-3 Average_P: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-3 Average_F: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
---------------------------------------------
1 ROUGE-L Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-L Average_P: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-L Average_F: 1.00000 (95%-conf.int. 1.00000 - 1.00000)

License

KoROUGE is MIT-licensed.

About

Calculating ROUGE score for Korean (Wrapper for ROUGE-1.5.5.pl script)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages