Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
KodairaTomonori
KodairaTomonori [modify]
Latest commit c8a6933 Mar 23, 2016
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
BCCWJ_target_location [modify] Mar 22, 2016
Script fix Nov 30, 2015
annotation_data [modify] Mar 22, 2016
substitutes [modify] Mar 22, 2016
LICENCE Create LICENCE Jan 5, 2016
README.md [fix] English version Dec 1, 2015
README_ja.md

README.md

Evaluation Dataset for Japanese Lexical Simplification

Notes:

Sentences selected from BCCWJ, so they are not published.
Here, program which extract sentence is published.
This program is made by Python 2.7 .

Procedure:

git clone https://github.com/KodairaTomonori/EvaluationDataset
cd Script
python get\sent_from_BCCWJ.py xxxx/BCCWJ/SUW/
python extract_sentence_from_location.py

other Notes:

substitution ranking is in substitutes folder.
subs.csv: target word list
ave_rank.csv and mle_rank.csv: Substitutes in these file is sorted by average score and MLE score.
Cmma is indicated different rank, and space is indicated same rank.


Affiliation:
Tokyo Metropolitan University
System Design - Komachi Lab
Name: Kodaira Tomonori
E-mail: kodaira-tomonori-at-ed.tmu.ac.jp


You can’t perform that action at this time.