Parsing NLP-progress into a structured JSON #186

rstojnic · 2018-12-17T18:25:35Z

Parse the unstructured Markdown files into a structured JSON format where the tasks, subtasks, datasets, subdatasets (ie dataset partitions), metrics and data/paper/code links have all been parsed and converted into a structured JSON format.

Also modifies grammatical_error_correction.md to follow the common way of specifying dataset partitions (ie as used in other files).

… markdown files.

…structured format

sebastianruder · 2018-12-18T16:01:23Z

Thanks, Robert! This is great and very appreciated. Will test the pipeline soon and get back to you.

…well

sebastianruder · 2019-01-14T13:33:42Z

Hi @rstojnic,
Sorry that it took me so long to get to this. Just tested it and seems to work great. One minor thing: Is the empty requirements.txt there in case a future version of the extractor might require some dependencies?
I'm happy to merge the PR unless you want to make more changes in this branch.

rstojnic added 9 commits December 16, 2018 18:27

First complete version of the parser

5862e37

First complete run, now just need to fix the edge cases.

1152669

Rename the folder, better handling of tables

0bdccbf

Handling of subdatasets

67e2a42

Make this file consistent with how subdatasets are specified in other…

8b18976

… markdown files.

Extract all the links in the dataset description and put them into a …

f9ce215

…structured format

Merge remote-tracking branch 'upstream/master'

3a36428

tweak wording

90ac471

Missing comments

cec6800

rstojnic and others added 4 commits January 6, 2019 17:37

Merge remote-tracking branch 'upstream/master'

4dcc3bb

Add attribution

2214afd

Model name now only extracts the model name - not the author tags as …

4eac155

…well

Merge branch 'new_model_name' into HEAD

8454f1a

sebastianruder merged commit 3bc7646 into sebastianruder:master Jan 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing NLP-progress into a structured JSON #186

Parsing NLP-progress into a structured JSON #186

rstojnic commented Dec 17, 2018

sebastianruder commented Dec 18, 2018

sebastianruder commented Jan 14, 2019

Parsing NLP-progress into a structured JSON #186

Parsing NLP-progress into a structured JSON #186

Conversation

rstojnic commented Dec 17, 2018

sebastianruder commented Dec 18, 2018

sebastianruder commented Jan 14, 2019