Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data in JSON format #1

Open
napsternxg opened this issue Feb 1, 2017 · 0 comments
Open

Data in JSON format #1

napsternxg opened this issue Feb 1, 2017 · 0 comments

Comments

@napsternxg
Copy link

Since the data is in CSV format and the train_bodies.txt file contains column values which span multiple lines, it may cause problems for many csv readers or for people using simple split of delimiter values. I have parsed the data in JSON format and also included a script which does the same. I think it will give an easy to use data for participants.

Will send a pull request.

napsternxg added a commit to napsternxg/fnc-1 that referenced this issue Feb 1, 2017
* Single `train_combined.json` file with all data
* Included jupyter notebook with details on parsing the data
* Updated README with format details

Fixes FakeNewsChallenge#1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant