Skip to content

IBM/sciqa-arcade198-dataset

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 

AI2 Reasoning Challenge Annotated Dataset (ARCADE198)

This is the human-annotated AI2 Reasoning Challenge (ARC) dataset (ARCADE198) from the following paper:

A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset 
Boratko, M.; Padigela, H.; Mikkilineni, D.; Yuvraj, P.; Das, R.; McCallum, A.; Chang, M.; Fokoue, A.; Kapanipathi, P.; Mattei, N.; Musa, R.; Talamadupula, K.; and Witbrock, M.
ACL 2018 Machine Reading for Question Answering (MRQA) Workshop

The ARCADE198 dataset was generated using the annotation system from:

An Interface for Annotating Science Questions 
Boratko, M.; Padigela, H.; Mikkilineni, D.; Yuvraj, P.; Das, R.; McCallum, A.; Chang, M.; Fokoue, A.; Kapanipathi, P.; Mattei, N.; Musa, R.; Talamadupula, K.; and Witbrock, M.
EMNLP 2018 System Demonstration Program.

Use of the ARCADE198 Dataset

To use this dataset, please:

  • Cite the two papers above, using the following bib-entries:
@inproceedings{BoPaMiYu18,
Author = {M. Boratko and H. Padigela and D. Mikkilineni and P. Yuvraj and R. Das and A. McCallum and M. Chang and A. Fokoue-Nkoutche and P. Kapanipathi and N. Mattei and R. Musa and K. Talamadupula and M. Witbrock},
Booktitle = {{Proceedings of the Machine Reading for Question Answering (MRQA) Workshop at ACL 2018}},
Date-Added = {2018-06-06 19:16:13 +0000},
Date-Modified = {2018-06-06 19:18:45 +0000},
Title = {A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset},
Year = {2018}}
@inproceedings{BoPaMiYu18-2,
	Author = {M. Boratko and H. Padigela and D. Mikkilineni and P. Yuvraj and R. Das and A. McCallum and M. Chang and A. Fokoue-Nkoutche and P. Kapanipathi and N. Mattei and R. Musa and K. Talamadupula and M. Witbrock},
	Booktitle = {{Proceedings of the Empirical Methods in Natural Language Processing (EMNLP) 2018 System Demonstration Program}},
	Title = {An Interface for Annotating Science Questions},
	Year = {2018}}

Link to Dataset

Please download here: ARCADE198 Dataset

Blogpost

Here is a blogpost that describes the dataset, and talks a bit more about the associated work.