GitHub - alexmatton/Faster_Transformers: Project for CS224N

Faster Transformers

Faster Transformers is the project Amaury Sabran (@https://github.com/amaurySabran) and I decided to work on for Stanford CS224N 2018/19 class.

The purpose of the project was to apply the Transformer Architecture to summarization. We benchmarked several architectures close to the Transformer and derived new models that make it both faster and more efficient for this specific task. We also analyzed the speed of each component of the Transformer to determine where the overall architecture can be improved for real-life applications.

Our final report can be found here: https://github.com/Nutemm/Faster_Transformers/blob/master/Project_final_report%20CS224N.pdf

Requirements

Install torchvision (https://pytorch.org/)
Install our fairseq fork from sources: https://github.com/amaurySabran/fairseq
Install tensorflow (for dataloading)
Install py-rouge (pip install py-rouge)

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
others		others
scripts		scripts
.gitignore		.gitignore
Project_final_report CS224N.pdf		Project_final_report CS224N.pdf
Readme.md		Readme.md
compute_rouge.py		compute_rouge.py
compute_rouge_from_file.py		compute_rouge_from_file.py
data.py		data.py
dictionary.py		dictionary.py
model_embeddings.py		model_embeddings.py
models.py		models.py
speed_analysis.py		speed_analysis.py
test.py		test.py
train.py		train.py
vocab.py		vocab.py

alexmatton/Faster_Transformers

Folders and files

Latest commit

History

Repository files navigation

Faster Transformers

Requirements

About

Resources

Stars

Watchers

Forks

Languages