Skip to content

Commit

Permalink
first commit
Browse files Browse the repository at this point in the history
  • Loading branch information
moguranosenshi committed Jul 4, 2016
1 parent 4139656 commit 5fd18bd
Showing 1 changed file with 9 additions and 2 deletions.
11 changes: 9 additions & 2 deletions README.md
@@ -1,2 +1,9 @@
# sscorpus
A monolingual parallel corpus for sentence simplification
# sscorpus: A monolingual parallel corpus for sentence simplification

This corpus contains 492,993 aligned sentences extracted by pairing Simple English Wikipedia with English Wikipedia.
These source data were downloaded in May 2016.

The form of each line in the corpus:
`original sentence <TAB> simple sentence <TAB> similarity score`

For questions, please contact [Tomoyuki Kajiwara at Tokyo Metropolitan University](https://sites.google.com/site/moguranosenshi/).

0 comments on commit 5fd18bd

Please sign in to comment.