Skip to content

felixgiov/UDST-DurationQA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Dataset from the paper "Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data"

Introduction

Understanding event duration is essential for understanding natural language. However, the amount of training data for tasks like duration question answering, i.e., McTACO, is very limited, suggesting a need for external duration information to improve this task. The duration information can be obtained from existing temporal information extraction tasks, such as UDS-T and TimeBank, where more duration data is available. A straightforward two-stage fine-tuning approach might be less likely to succeed given the discrepancy between the target duration question answering task and the intermediary duration classification task. This paper resolves this discrepancy by automatically recasting an existing event duration classification task from UDS-T to a question answering task similar to the target McTACO. We investigate the transferability of duration information by comparing whether the original UDS-T duration classification or the recast UDS-T duration question answering can be transferred to the target task. Our proposed model achieves a 13% Exact Match score improvement over the baseline on the McTACO duration question answering task, showing that the two-stage fine-tuning approach succeeds when the discrepancy between the target and intermediary tasks are resolved.

Overview of the dataset

We recast our data from the UDS-T dataset. We provide the train/dev/test split as specified in the paper under data/. In each file, there are lines of tab-separated data, each line representing an instance of a question-answer pair. Each line contains the following information:

sentence \t question \t answer \t label

  • sentence: a context sentence where the question is based on.
  • question: a question asking about the duration of an event in the sentence.
  • answer: a potential answer to the question.
  • label: whether the answer is a plausible answer. "yes" indicates the answer is plausible, "no" otherwise.

Citation

If you use this dataset, please cite our paper "Felix Giovanni Virgo, Fei Cheng, Sadao Kurohashi. Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data. Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC), (2022)"

Bibtex for citations:

@InProceedings{virgo-cheng-kurohashi:2022:LREC,
  author    = {Virgo, Felix  and  Cheng, Fei  and  Kurohashi, Sadao},
  title     = {Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data},
  booktitle      = {Proceedings of the Language Resources and Evaluation Conference},
  month          = {June},
  year           = {2022},
  address        = {Marseille, France},
  publisher      = {European Language Resources Association},
  pages     = {4451--4457},
  abstract  = {Understanding event duration is essential for understanding natural language. However, the amount of training data for tasks like duration question answering, i.e., McTACO, is very limited, suggesting a need for external duration information to improve this task. The duration information can be obtained from existing temporal information extraction tasks, such as UDS-T and TimeBank, where more duration data is available. A straightforward two-stage fine-tuning approach might be less likely to succeed given the discrepancy between the target duration question answering task and the intermediary duration classification task. This paper resolves this discrepancy by automatically recasting an existing event duration classification task from UDS-T to a question answering task similar to the target McTACO. We investigate the transferability of duration information by comparing whether the original UDS-T duration classification or the recast UDS-T duration question answering can be transferred to the target task. Our proposed model achieves a 13\% Exact Match score improvement over the baseline on the McTACO duration question answering task, showing that the two-stage fine-tuning approach succeeds when the discrepancy between the target and intermediary tasks are resolved.},
  url       = {https://aclanthology.org/2022.lrec-1.473}
}

About

Dataset from the paper "Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published