Long Range Transformers

Some variants of transformers are claimed to be able to process long contexts, while most of them were ony tested on pseudo tasks, like LRA, or language modeling, leaving their capability of comprehending long texts to be explored. The goal of this project is to verify the effectiveness of long-range transformers on more practical NLP tasks: Do they really work on NLP tasks concerning with long texts? If not, why, and how can we make it work?

Tasks

Coreference
NLI
Abstractive QA
Extractive QA
Summarization

Datasets

Ontonotes for coref
DocNLI for NLI
Qasper for abstractive QA
Triviaqa for extractive QA
SummFD and CNN

Model

Coarse2fine model for coref (located in this folder)
A baseline model for DocNLI (located in this folder)
A baseline model for abstractive QA (located in this folder)
A baseline model for extractive QA (located in this folder)
A baseline model for summarization (located in this folder)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
ContextModels		ContextModels
abstractive_qa		abstractive_qa
allennlp_modules		allennlp_modules
docs		docs
extractive_qa		extractive_qa
mcqa		mcqa
s2e		s2e
summarization		summarization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

ContextModels

ContextModels

abstractive_qa

abstractive_qa

allennlp_modules

allennlp_modules

docs

docs

extractive_qa

extractive_qa

mcqa

mcqa

s2e

s2e

summarization

summarization

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Long Range Transformers

Tasks

Datasets

Model

Experiments

Coref

2.2 NLI

About

Releases

Packages

Languages

License

hiaoxui/long-range-transformers

Folders and files

Latest commit

History

Repository files navigation

Long Range Transformers

Tasks

Datasets

Model

Experiments

Coref

2.2 NLI

About

Resources

License

Stars

Watchers

Forks

Languages