Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing NLP-progress into a structured JSON #186

Merged
merged 13 commits into from
Jan 15, 2019

Conversation

rstojnic
Copy link
Contributor

Parse the unstructured Markdown files into a structured JSON format where the tasks, subtasks, datasets, subdatasets (ie dataset partitions), metrics and data/paper/code links have all been parsed and converted into a structured JSON format.

Also modifies grammatical_error_correction.md to follow the common way of specifying dataset partitions (ie as used in other files).

@sebastianruder
Copy link
Owner

Thanks, Robert! This is great and very appreciated. Will test the pipeline soon and get back to you.

@sebastianruder
Copy link
Owner

Hi @rstojnic,
Sorry that it took me so long to get to this. Just tested it and seems to work great. One minor thing: Is the empty requirements.txt there in case a future version of the extractor might require some dependencies?
I'm happy to merge the PR unless you want to make more changes in this branch.

@sebastianruder sebastianruder merged commit 3bc7646 into sebastianruder:master Jan 15, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants