Skip to content

HayatoHongo/EveryonesLLM

Repository files navigation

🌐 Select Language / 日本語 🇯🇵 | 中文 🇨🇳

colab-badge.svg) Build LLM on Google Colab from scratch

EveryonesLLM_demo.gif


Table of Contents

WebUI.png

EveryonesLLM

Chapter Estimated Time Notebook
Chapter 00: Start Tutorial 1-2 hours Open in Colab
Chapter 01: Dataloader 1-2 hours Open in Colab
Chapter 02: TokenEmbedding 0.5-1 hour Open in Colab
Chapter 03: PositionEmbedding 0.5-1 hour Open in Colab
Chapter 04: EmbeddingModule 0.5-1 hour Open in Colab
Chapter 05: LayerNorm 1-2 hours Open in Colab
Chapter 06: AttentionHead 3-4 hours Open in Colab
Chapter 07: MultiHeadAttention 1-2 hours Open in Colab
Chapter 08: FeedForward 1-2 hours Open in Colab
Chapter 09: TransformerBlock 0.5-1 hour Open in Colab
Chapter 10: VocabularyLogits 0.5-1 hour Open in Colab
Chapter 11: nanoGPT 1-2 hours Open in Colab
Chapter 12: Trainer 1-2 hours Open in Colab
Chapter 13: Tokens per second(CPU) 1-2 hours Open in Colab
Chapter 14: Tokens per second(T4 GPU) 0.5-1 hour Open in Colab
Chapter 15: Train nanoGPT with GPU 0.5-1 hour Open in Colab
Chapter 16: Make only the model size bigger 0.5-1 hour (+ 1 hour model training) Open in Colab
Chapter 17: Make the dataset bigger 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 18: tiktoken 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 19: Long Train 1-2 hours (+ 6 hours model training) Open in Colab
Chapter 20: Learning rate 0.5-1 hour Open in Colab
Chapter 21: Scaling Law 1-2 hours Open in Colab
Chapter 22: TinyStories(Main) 1-2 hours Open in Colab
Chapter 22: TinyStories(Model Training) 1 hour Open in Colab
Chapter 23: RPE(OverSimplified) 2-3 hours Open in Colab
Chapter 24: RPE(Simplified) 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 25: LR schedule 1 hour Open in Colab
Chapter 26: Checkpoint 1 hour Open in Colab
Chapter 27: Pretraining 0.5 hour (+ 20 hours model training) Open in Colab
Chapter 28: Instruction Tuning 0.5 hour (+ 0.5 hour model training) Open in Colab

Tensor Map (Full Tensor Overview)

Try making the tensor map below by yourself!
Do not worry, I prepared lots of hints for you.
View the full-resolution Tensor Map of the nanoGPT model on Canva

Everyones TensorMap


About the Development Environment

To keep setup easy, please try running all the samples on Google Colab.

However, Google Colab does not save checkmarks in checkboxes.
If you want to track your progress, or if you want to work little by little, say every 30 minutes, I recommend VS Code.
In that case, fork this repository and clone it to your own PC. Just use Google Colab extension for your VS code, then you can use Colab CPU and GPU.


Answers

Chapter Estimated Time Notebook
Chapter 00: Start Tutorial 1-2 hours Open in Colab
Chapter 01: Dataloader 1-2 hours Open in Colab
Chapter 02: TokenEmbedding 0.5-1 hour Open in Colab
Chapter 03: PositionEmbedding 0.5-1 hour Open in Colab
Chapter 04: EmbeddingModule 0.5-1 hour Open in Colab
Chapter 05: LayerNorm 1-2 hours Open in Colab
Chapter 06: AttentionHead 3-4 hours Open in Colab
Chapter 07: MultiHeadAttention 1-2 hours Open in Colab
Chapter 08: FeedForward 1-2 hours Open in Colab
Chapter 09: TransformerBlock 0.5-1 hour Open in Colab
Chapter 10: VocabularyLogits 0.5-1 hour Open in Colab
Chapter 11: nanoGPT 1-2 hours Open in Colab
Chapter 12: Trainer 1-2 hours Open in Colab
Chapter 13: Tokens per second(CPU) 1-2 hours Open in Colab
Chapter 14: Tokens per second(T4 GPU) 0.5-1 hour Open in Colab
Chapter 15: Train nanoGPT with GPU 0.5-1 hour Open in Colab
Chapter 16: Make only the model size bigger 0.5-1 hour (+ 1 hour model training) Open in Colab
Chapter 17: Make the dataset bigger 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 18: tiktoken 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 19: Long Train 1-2 hours (+ 6 hours model training) Open in Colab
Chapter 20: Learning rate 0.5-1 hour Open in Colab
Chapter 21: Scaling Law 1-2 hours Open in Colab
Chapter 22: TinyStories(Main) 1-2 hours Open in Colab
Chapter 22: TinyStories(Model Training) 1 hour Open in Colab
Chapter 23: RPE(OverSimplified) 2-3 hours Open in Colab
Chapter 24: RPE(Simplified) 1-2 hours (+ 1 hour model training) Open in Colab
Chapter 25: LR schedule 1 hour Open in Colab
Chapter 26: Checkpoint 1 hour Open in Colab
Chapter 27: Pretraining 0.5 hour (+ 20 hours model training) Open in Colab
Chapter 28: Instruction Tuning 0.5 hour (+ 1 hour model training) Open in Colab

Notice

This project is a community-based open-source educational project and is not affiliated with Google in any way.

About Project EveryonesLLM

EveryonesLLM Logo
EveryonesLLM Goal
EveryonesLLM Idea
EveryonesLLM Prerequites

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors