finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Currently this code implements the ROCStories Cloze Test result reported in the paper by running: python train.py --dataset rocstories --desc rocstories --submit --analysis --data_dir [path to data here]

Note: The code is currently non-deterministic due to various GPU ops. The median accuracy of 10 runs with this codebase (using default hyperparameters) is 85.8% - slightly lower than the reported single run of 86.5% from the paper.

The ROCStories dataset can be downloaded from the associated website.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis.py		analysis.py
datasets.py		datasets.py
opt.py		opt.py
text_utils.py		text_utils.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model

model

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

analysis.py

analysis.py

datasets.py

datasets.py

opt.py

opt.py

text_utils.py

text_utils.py

train.py

train.py

utils.py

utils.py

Repository files navigation

finetune-transformer-lm

About

Releases

Packages

Languages

License

chenjianshu/finetune-transformer-lm

Folders and files

Latest commit

History

Repository files navigation

finetune-transformer-lm

About

Resources

License

Stars

Watchers

Forks

Languages