Memory-based generation: Enabling LMs to handle long input texts by extracting useful information

As part of the course "Computational Semantics for NLP" at ETH Zurich, we are exploring different methods of enabling transformer-based LMs to handle long text sequences. Our baseline methods include Random selection, Bias towards Start+End, TF-IDF. Then, we explore the use of sentence embeddings with SBERT and experiment with different chunk sizes. Our experiment uses different LMs to perform question answering on the QuALITY dataset in order to find the most fitting one for that task.

Reproduce the experiments

To perform the experiments for the different models, run the Notebooks:

Experiments_Deberta.ipynb
Experiments_Longformer.ipynb
Experiments_Roberta.ipynb

Make sure that all .py files are included in the project.

Reproduce the analysis

To reproduce the plots used in our analysis section, run the Notebooks:

Analysis.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
datasets		datasets
.gitignore		.gitignore
Advanced_SenteceEmbeddings.ipynb		Advanced_SenteceEmbeddings.ipynb
Analysis.ipynb		Analysis.ipynb
Baseline.ipynb		Baseline.ipynb
Baseline_old.ipynb		Baseline_old.ipynb
Clustering_chunks.ipynb		Clustering_chunks.ipynb
CreateSentEmbeddings.ipynb		CreateSentEmbeddings.ipynb
ExperimentSentEmb.ipynb		ExperimentSentEmb.ipynb
Experiments.ipynb		Experiments.ipynb
Experiments_Deberta.ipynb		Experiments_Deberta.ipynb
Experiments_Longformer.ipynb		Experiments_Longformer.ipynb
Experiments_Roberta.ipynb		Experiments_Roberta.ipynb
Extract.docx		Extract.docx
Finetune_Deberta.ipynb		Finetune_Deberta.ipynb
Finetuning.ipynb		Finetuning.ipynb
Project.docx		Project.docx
README.md		README.md
Results.xlsx		Results.xlsx
Sentence_Embeddings_RobertaLarge.csv		Sentence_Embeddings_RobertaLarge.csv
Sentence_Embeddings_RobertaLarge_predictions_roberta.csv		Sentence_Embeddings_RobertaLarge_predictions_roberta.csv
baseline_models.py		baseline_models.py
baseline_retrieval.py		baseline_retrieval.py
dataset_preprocessing.py		dataset_preprocessing.py
epoch3.png		epoch3.png
sentence_clusters.pkl		sentence_clusters.pkl
~$roject.docx		~$roject.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory-based generation: Enabling LMs to handle long input texts by extracting useful information

Reproduce the experiments

Reproduce the analysis

About

Releases

Packages

Contributors 2

Languages

J4Q8/CS4NLP

Folders and files

Latest commit

History

Repository files navigation

Memory-based generation: Enabling LMs to handle long input texts by extracting useful information

Reproduce the experiments

Reproduce the analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages