trial-data

Trial data for the NLPContributionGraph Shared Task 11 at SemEval-2021

The repository is organized as follows:

README.md                            
[task-name-folder]/                                # machine-translation, named-entity-recognition, question-answering, relation-classification, text-classification
    ├── [article-counter-folder]/                  # ranges from 0 to 9 since we annotated 10 articles per task
    │   ├── [articlename].pdf                      # scholarly article pdf
    │   ├── [articlename]-Grobid-out.txt           # plaintext output from the [Grobid parser](https://github.com/kermitt2/grobid)
    │   ├── [articlename]-Stanza-out.txt           # plaintext preprocessed output from [Stanza](https://github.com/stanfordnlp/stanza)
    │   ├── sentences.txt                          # annotated Contribution sentences in the file
    │   └── entities.txt                           # annotated entities in the Contribution sentences
    │   └── info-units/                            # the folder containing information units in JSON format
    │   │   └── research-problem.json              # `research problem` mandatory information unit in json format
    │   │   └── model.json                         # `model` information unit in json format; in some articles it is called `approach`
    │   │   └── ...                                # there are 12 information units in all and each article may be annotated by 3 or 6
    │   └── triples/                               # the folder containing information unit triples one per line
    │   │   └── research-problem.txt               # `research problem` triples (one research problem statement per line)
    │   │   └── model.txt                          # `model` triples (one statement per line)
    │   │   └── ...                                # there are 12 information units in all and each article may be annotated by 3 or 6
    │   └── ...                                    # there are ten articles annotated for each task, so this repeats nine more times
    └── ...                                        # there are five tasks selected overall, so this repeats four more times

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

trial-data

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
machine-translation		machine-translation
named-entity-recognition		named-entity-recognition
question-answering		question-answering
relation-classification		relation-classification
text-classification		text-classification
README.md		README.md

ncg-task/trial-data

Folders and files

Latest commit

History

Repository files navigation

trial-data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages