No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin/snippet
raw
settings
src
.classpath
.project
20-half.py
80-20.py
README.md

README.md

fyp

crawl.java ~ to get text data from websites

Match.java ~ to align the english and chinese data by paragraph

sentences.java ~ to separate the paragraph data to sentences

MergeFile.java ~ to merge many files to one

80-20.py and 20-half.py ~ to split the files to train(80%), development(10%) and test(10%) data