Skip to content

spather/transformer-experiments

Repository files navigation

transformer-experiments

CI Deploy to GitHub Pages

This repo contains experiments I did on the simple transformer developed in Andrej Karpathy’s excellent video, Let’s build GPT: from scratch, in code, spelled out. The transformer architecture, training, and basic inference code comes from that video. The experiments and analyses are my own.

Dev Environment Setup

  1. Create a virtual environment
python3 -m venv ~/venv/venv-transformer-experiments
source ~/venv/venv-transformer-experiments/bin/activate
  1. Install dependencies
pip install -r requirements.txt
pip install -r requirements.dev.txt
  1. Install this library in editable mode
pip install -e '.[dev]'

Before committing any changes

Run the following to ensure all dependencies are generated from notebooks and notebooks are cleaned:

make all

About

Some experiments on transformer models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages