Introduction

This repository provides code that converts the zipped XML datasets for SemEval 2016 and 2017 English Task 3, Subtask B into a single dataset in JSON.

Usage

Use the program as follows:

$ pip install -r requirements.txt
$ python __main__.py
 79% [........................................................               ] 17186816 / 21555267

The resulting dataset will reside in the result.json file.

References

You should use the following citation in your publications whenever using this resource:

@InProceedings{nakov-EtAl:2016:SemEval,
  author    = {Nakov, Preslav  and  M\`{a}rquez, Llu\'{i}s  and  Magdy, Walid  and  Moschitti, Alessandro  and  Glass, Jim  and  Randeree, Bilal},
  title     = {{SemEval}-2016 Task 3: Community Question Answering},
  booktitle = {Proceedings of the 10th International Workshop on Semantic Evaluation},
  series    = {SemEval '16},
  month     = {June},
  year      = {2016},
  address   = {San Diego, California},
  publisher = {Association for Computational Linguistics},
}

@InProceedings{SemEval-2017:task3,
   author    = {Nakov, Preslav and Hoogeveen, Doris and M\`{a}rquez, Llu\'{i}s and Moschitti, Alessandro and Mubarak, Hamdy and Baldwin, Timothy and Verspoor, Karin},
   title     = {{SemEval}-2017 Task 3: Community Question Answering},
   booktitle = {Proceedings of the 11th International Workshop on Semantic Evaluation},
   series    = {SemEval '17},
   month     = {August},
   year      = {2017},
   address   = {Vancouver, Canada},
   publisher = {Association for Computational Linguistics},
 }

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
__main__.py		__main__.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
xmlfiles.py		xmlfiles.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

main.py

main.py

preprocessing.py

preprocessing.py

requirements.txt

requirements.txt

xmlfiles.py

xmlfiles.py

Repository files navigation

Introduction

Usage

References

About

Releases 2

Packages

Languages

License

Witiko/semeval-2016_2017-task3-subtaskB-english

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

References

About

Resources

License

Stars

Watchers

Forks

Languages