GPT-2

Status: Archive (code is provided as-is, no updates expected)

GPT-2

Code "Language Models are Unsupervised Multitask Learners". | README. | Development | Contributors | License MIT

Fine tuning on custom datasets

To retrain GPT-2 117M model on a custom text dataset:

PYTHONPATH=src ./train.py --dataset <file|directory|glob>

If you want to precompute the dataset's encoding for multiple runs, you can instead use:

PYTHONPATH=src ./encode.py <file|directory|glob> /path/to/encoded.npz
PYTHONPATH=src ./train.py --dataset /path/to/encoded.npz

To do distributed on multiple GPUs or machines using Horovod:

mpirun -np 4 \
    -H localhost:4 \
    -bind-to none -map-by slot \
    -x NCCL_DEBUG=INFO -x LD_LIBRARY_PATH -x PATH \
    -x PYTHONPATH=src \
    -mca pml ob1 -mca btl ^openib \
    /home/jovyan/gpt-2/train-horovod.py --dataset encoded.npz

Citation

Please use the following bibtex entry:

@article{radford2019language,
  title={Language Models are Unsupervised Multitask Learners},
  author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya},
  year={2019}
}

FB2_2_txt.xsl conversion file is forked from https://github.com/kmrov/fb2_2_rtf

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile.gpu		Dockerfile.gpu
LICENSE		LICENSE
README.md		README.md
download_model.py		download_model.py
encode.py		encode.py
requirements.txt		requirements.txt
train-horovod.py		train-horovod.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2

Fine tuning on custom datasets

Citation

About

Releases

Packages

Languages

License

makramjandar/GPT-2

Folders and files

Latest commit

History

Repository files navigation

GPT-2

Fine tuning on custom datasets

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages