ch04

Jun 24, 2025

2f53bf5 · Jun 24, 2025

Name	Name	Last commit message	Last commit date
parent directory ..
01_main-chapter-code	01_main-chapter-code	Add PyPI package (#576 )	Mar 24, 2025
02_performance-analysis	02_performance-analysis	Add PyPI package (#576 )	Mar 24, 2025
03_kv-cache	03_kv-cache	Link the other KV cache sections (#708 )	Jun 24, 2025
README.md	README.md	Add KV cache (#671 )	Jun 15, 2025

README.md

02_performance-analysis contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter
03_kv-cache implements a KV cache to speed up the text generation during inference
ch05/07_gpt_to_llama contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI (it might be interesting to look at alternative architectures after completing chapter 4, but you can also save that for after reading chapter 5)

In the video below, I provide a code-along session that covers some of the chapter contents as supplementary material.