Temporal Reasoning in NLI

This repo contains the scripts to build the Temporal NLI dataset and also to run different models on it as described in the following paper:

Vashishtha, Siddharth, Adam Poliak, Yash Kumar Lal, Benjamin Van Durme, Aaron Steven White. Temporal Reasoning in Natural Language Inference. Findings of the Association for Computational Linguistics: EMNLP 2020, November, 2020.

@inproceedings{vashishtha-etal-2020-temporal,
    title = "Temporal Reasoning in Natural Language Inference",
    author = "Vashishtha, Siddharth  and
      Poliak, Adam  and
      Lal, Yash Kumar  and
      Van Durme, Benjamin  and
      White, Aaron Steven",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2020",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.findings-emnlp.363",
    pages = "4070--4078",
    abstract = "We introduce five new natural language inference (NLI) datasets focused on temporal reasoning. We recast four existing datasets annotated for event duration{---}how long an event lasts{---}and event ordering{---}how events are temporally arranged{---}into more than one million NLI examples. We use these datasets to investigate how well neural models trained on a popular NLI corpus capture these forms of temporal reasoning.",
}

Python Environment

We use pipenv to run our scripts in a Python virtualenv. You can replicate the environment by cloning this repo and running the following from the root dir of this repo:

pipenv install --ignore-pipfile

If you don't have pipenv, you can install it by running:

pip install pipenv

Dataset Creation

There are two steps to creating our recasted datasets:

Download the original datasets. Instructions here
Run recasting scripts. Instructions here

Train from Scratch or Evaluate best models

To train on our models from scratch or to use our best models, follow instructions here. Our saved roberta models can be downloaded by following instructions here

Updates from the first published version:

We made the following updates to our recasted data from the first published version:

To get the verb inflections, we use English Unimorph. If any inflection is not found in Unimorph, we back-off to LemmInflect
We added copular predicates from TempEval3, TimeBank-Dense, and RED corpus to our recasted data. We parse each corpus through Stanza to get the dependency trees of sentences in the corpus and then generate hypothesis for the NLI pair using rules described in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
saved_models		saved_models
src		src
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Temporal Reasoning in NLI

Python Environment

Dataset Creation

Train from Scratch or Evaluate best models

Updates from the first published version:

About

Releases

Packages

Languages

License

sidsvash26/temporal_nli

Folders and files

Latest commit

History

Repository files navigation

Temporal Reasoning in NLI

Python Environment

Dataset Creation

Train from Scratch or Evaluate best models

Updates from the first published version:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages