-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to get the file rcv1.tar.xz #1
Comments
You should get and compile CoNLL 2003 corpus. |
I see the 000README: NOTE: ADDED 16 AUGUST 2016 The Reuters Corpus is not distributed on a cd anymore but as a single
This should generate the three files eng.train, eng.testa and eng.testb Contact: erikt(at)xs4all.nl however, i only got rcv1.tar.gz from internet would you like share the rcv1.tar.xz to me? |
Sorry, I haven't that file. |
great |
how to generate
in https://github.com/patverga/torch-ner-nlp-from-scratch/tree/master/data/conll2003, I see: eng.testa move around some files, add data to repo cuz fuckit a year ago |
rcv1 is a set of 1393 news of Reuters Press. Which can be download from here https://trec.nist.gov/data/reuters/reuters.html |
I find rcv1.tar.gz on the internet
but it is not *.xz,
The text was updated successfully, but these errors were encountered: