PracticeGPT

A tiny LLM based on GPT-2.

Overview

The simplest, fastest repository for training/finetuning medium-sized GPTs. It is a rewrite of nanoGPT that prioritizes teeth over education. Still under active development, but currently the file train.py reproduces GPT-2 (124M) on OpenWebText, running on a single 8XA100 40GB node in about 4 days of training. The code is divided into separate python modules.

Because the code is so simple, it is very easy to hack to your needs, train new models from scratch, or finetune pretrained checkpoints (e.g. biggest one currently available as a starting point would be the GPT-2 1.3B model from OpenAI).

Installation

Install the requirements using the following command:

pip install -r requirements.txt

Acknowledgement

This implementation is logically identical to NanoGPT.

The purpose of this project is to apply and expand my knowledge about LLMs.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
Data		Data
Model		Model
Tokenizer		Tokenizer
screenshots		screenshots
README.md		README.md
constants.py		constants.py
input.txt		input.txt
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PracticeGPT

Overview

Installation

Acknowledgement

About

Releases

Packages

Languages

YamanSD/PracticeGPT

Folders and files

Latest commit

History

Repository files navigation

PracticeGPT

Overview

Installation

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages