Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to use lang-8 and CoNLL data? #6

Closed
XiaoranJin opened this issue Jun 8, 2016 · 5 comments
Closed

how to use lang-8 and CoNLL data? #6

XiaoranJin opened this issue Jun 8, 2016 · 5 comments

Comments

@XiaoranJin
Copy link

XiaoranJin commented Jun 8, 2016

Hi, I've got lang-8 and CoNLL dataset. Any clue about how to input the data into your training scripts?

Would really appreciate if you can show what the data structure is in "nlc-train.tar" and "nlc-valid.tar".

Thanks!

@avati
Copy link
Collaborator

avati commented Jun 8, 2016

Something like this:

$ tar tvf nlc-valid.tar 
-rw-r--r-- avati/avati  312524 2016-04-11 14:17 valid.x.txt
-rw-r--r-- avati/avati  323750 2016-04-11 14:17 valid.y.txt

$ tar tvf nlc-train.tar 
-rw-r--r-- avati/users 50842953 2016-05-19 10:44 train.x.txt
-rw-r--r-- avati/users 51878318 2016-05-19 10:45 train.y.txt

You can also look at nlc_data.py to see what the code checks for.

@avati avati closed this as completed Jun 8, 2016
@kbpranay
Copy link

kbpranay commented May 26, 2017

hey guys anyone got a link for lang 8 data for training? If yes please send me a download link. @XiaoranJin @avati

@ghost
Copy link

ghost commented Apr 7, 2018

@XiaoranJin
Did you understand what the data structure is in "nlc-train.tar" and "nlc-valid.tar" ?

@morusu
Copy link

morusu commented Aug 3, 2018

@rajism did you have the "nlc-train.tar" and "nlc-valid.tar" ? can u share me?

@morusu
Copy link

morusu commented Aug 3, 2018

@avati did you have the "nlc-train.tar" and "nlc-valid.tar" ? can u share me?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants