LLMs From Scratch

This repository contains implementations and exercises for building a Large Language Model (LLM) from the ground up, based on the book and resources from the original repository by Sebastian Raschka.

Each chapter covers a critical component of the LLM pipeline, from data preparation to instruction fine-tuning.

Folder Structure

ch02: Working with Text Data - Tokenization and data sampling.
ch03: Coding Attention Mechanisms - Self-attention, causal attention, and multi-head attention.
ch04: Implementing a GPT Model - Building the GPT architecture and various attention optimizations.
ch05: Training on Unlabeled Data - Loss calculation, training loops, and loading pretrained weights.
ch06: Fine-Tuning for Classification - Adapting the model for tasks like spam detection.
ch07: Fine-Tuning to Follow Instructions - Instruction fine-tuning for conversational capabilities.

Getting Started

Each folder contains a main notebook and supporting scripts to demonstrate the concepts covered in that chapter.
For the authoritative source and additional resources, please visit the main repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMs From Scratch

Folder Structure

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ch02		ch02
ch03		ch03
ch04		ch04
ch05		ch05
ch06		ch06
ch07		ch07
.DS_Store		.DS_Store
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

LLMs From Scratch

Folder Structure

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages