Skip to content

omrikeren/ParaShoot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

ParaShoot

A Hebrew question and answering dataset in the style of SQuAD, based on articles scraped from Wikipedia. The dataset contains a few thousand crowdsource-annotated pairs of questions and answers, in a setting suitable for few-shot learning.

For more details and quality analysis, see the paper.

Dataset Statistics

#Items #Articles #Paragraphs
Train 1792 295 565
Dev 221 33 63
Test 1025 165 319
Total 3038 493 947

Citing

If you use ParaShoot in your research, please cite the ParaShoot paper:

@inproceedings{keren2021parashoot,
  title={ParaShoot: A Hebrew Question Answering Dataset},
  author={Keren, Omri and Levy, Omer},
  booktitle={Proceedings of the 3rd Workshop on Machine Reading for Question Answering},
  pages={106--112},
  year={2021}
}

About

A Hebrew Question Answering Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors