Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CoNLL-2003 data #1

Closed
malagus opened this issue Jun 14, 2020 · 2 comments
Closed

CoNLL-2003 data #1

malagus opened this issue Jun 14, 2020 · 2 comments

Comments

@malagus
Copy link

malagus commented Jun 14, 2020

Hi. I'm working in my master thesis on using neutral networks for text capitalization. I was wondering if you use any additional data clearing methods ( for example remove sentences where all data is capitalized ) or use data as is ?

@raymondhs
Copy link
Owner

Hello, sorry for the late reply. Yes, news headlines, which are all in upper case words, are discarded (it's also mentioned in Footnote 1 of the paper).

@malagus
Copy link
Author

malagus commented Jul 3, 2020

Thanks for information. Somehow I'm miss that footnote.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants