GitHub - epsdg/text-classifiers: BERT, OpenAI GPT & GPT-2, and XLNet for classification. TensorFlow and PyTorch

Pretrained Text Classifiers

Jupyter/Colab notebooks with implementations of several pretrained language models for classification.

TensorFlow Models (TPU)

XLNet
BERT

Based on original Google AI Research (BERT) and CMU/Google Brain (XLNet) implementations. Tested in colab using a cloud TPU v2. These support both binary and multi-label classification.

The models require I/O via Google Cloud Storage instead of the local file system attached to the colab instance.

PyTorch Models (GPU)

The PyTorch models currently support only binary classification.

Models were tested in colab using 1x NVIDIA Tesla T4 GPU. They will run on other CUDA devices with less memory, but require constraints on maximum sequence length and/or batch size, with significant impact on train time.

The models use NVIDIA Apex for mixed-precision scaling during training.

BERT uses the Hugging Face port of the original models to PyTorch

The OpenAI models (GPT & GPT2) use a fork of the above repo with a modification to support binary classification.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
BERT.ipynb		BERT.ipynb
BERT_TPU.ipynb		BERT_TPU.ipynb
GPT.ipynb		GPT.ipynb
GPT2_117M.ipynb		GPT2_117M.ipynb
GPT2_345M.ipynb		GPT2_345M.ipynb
LICENSE		LICENSE
README.md		README.md
XLNet.ipynb		XLNet.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BERT.ipynb

BERT.ipynb

BERT_TPU.ipynb

BERT_TPU.ipynb

GPT.ipynb

GPT.ipynb

GPT2_117M.ipynb

GPT2_117M.ipynb

GPT2_345M.ipynb

GPT2_345M.ipynb

LICENSE

LICENSE

README.md

README.md

XLNet.ipynb

XLNet.ipynb

Repository files navigation

Pretrained Text Classifiers

TensorFlow Models (TPU)

PyTorch Models (GPU)

About

Releases

Packages

Languages

License

epsdg/text-classifiers

Folders and files

Latest commit

History

Repository files navigation

Pretrained Text Classifiers

TensorFlow Models (TPU)

PyTorch Models (GPU)

About

Resources

License

Stars

Watchers

Forks

Languages