Generative AI from Scratch

This repository aims to cover minimal codes for generative models for an educational purpose. They basically depend on PyTorch 2.0, no HugginFace transformers.

To begin with, I included the code to train a 51M-parameter language model. I will add image generation and more features in the future.

Prerequisites

This repository is tested on:

Python 3.10.12
Poetry 1.6.1
NVIDIA V100 GPU
CUDA 11.8

For the Python packages, please refer to pyproject.toml.

Text Generation

I trained a 51M-parameter language model on 1B tokens from BookCorpus. The training took around 20 hours with a single V100 GPU, which cost around $50. The final model achieved the perplexity of 0.83.

Training Procedure

To create a tokenizer, run:

poetry run python generative_ai/scripts/create_tokenizer.py

To launch training, run:

poetry run python generative_ai/scripts/train.py

To generate sentences with pretrained model, run:

$ poetry run python generative_ai/scripts/generate.py --model generative_ai/artifacts/model.pt --prompt "life is about"

> number of parameters: 50.98M
life is about romance , and love and adrenaline , at the same time .

model.pt can be obtained at Hugging Face Models.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
fig		fig
generative_ai		generative_ai
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fig

fig

generative_ai

generative_ai

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

Repository files navigation

Generative AI from Scratch

Prerequisites

Text Generation

Training Procedure

About

Languages

License

shionhonda/generative-ai

Folders and files

Latest commit

History

Repository files navigation

Generative AI from Scratch

Prerequisites

Text Generation

Training Procedure

About

Resources

License

Stars

Watchers

Forks

Languages