No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Pull request Compare This branch is 1 commit ahead, 1 commit behind kite1988:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
README.md
smsCorpus_en_sql_2015.03.09_all.zip
smsCorpus_en_xml_2015.03.09_all.zip
smsCorpus_zh_sql_2015.03.09.zip
smsCorpus_zh_xml_2015.03.09.zip

README.md

NUS SMS Corpus

Due to some technicial problems, the NUS SMS Corpus website http://wing.comp.nus.edu.sg/SMSCorpus is temporally unavailable. For your convenience, we upload the most recent release (Mar 9, 2015) of the corpus here.

Please cite the following paper if you use our corpus. Thanks!

Tao Chen and Min-Yen Kan (2013). Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus. Language Resources and Evaluation, 47(2)(2013), pages 299-355.

Language File Format Size Number of Messages
English SQL 2,045K 55,835
English XML 2,359K 55,835
English JSON 2,740K 55,835
Chinese SQL 979K 31,465
Chinese XML 1,182K 31,465
Chinese JSON 1,700K 31,465

Our dataset has been added to Kaggle! Please consider participating a competition!