Rust ML Systems from First Principles

A beginner course for learning machine learning as a translation problem:

plain English <-> algebra <-> Rust

The goal is not to memorize symbols. The goal is to learn how to read formulas as programs, and how to read Rust code as precise mathematical structure.

Who This Is For

Beginners with little or no machine learning background
Rust learners who want a concrete reason to use vectors, structs, loops, and functions
Self-paced learners who want short lessons and small practice steps

Start Here

Read 01 Foundations.
Continue with 02 Vectors.
Continue with 03 Neuron.
Use Lessons index to see the full course map.

If you specifically want the current Transformer material after the fundamentals, jump to 07 Transformer.

The repo uses sequential folder numbers even though the curriculum starts at Module 0:

Course Module 0 -> Repo folder lessons/01-foundations
Course Module 1 -> Repo folder lessons/02-vectors

What Exists Now

Authored lessons

Neuron track now included

Transformer track now included

Executable companion code

Source material and roadmap

Repo Map

rust-ml/
├── lessons/    # canonical course content
├── references/ # transcripts and papers used as source material
├── code/       # runnable companion crates
├── book/       # future mdBook/site wrapper
└── README.md

Working Rules For This Repo

lessons/ is the source of truth for written teaching content.
code/ follows the lesson progression and now includes a real tested transformer crate.
book/ is intentionally thin in this pass so the course content does not drift into two competing copies.

Learning Strategy

The course keeps the same translation goal everywhere:

plain English <-> algebra <-> Rust

Module 07 now applies that rule in two complementary ways:

narrative lessons that explain the architecture and the implementation choices
a chunked encoder lesson where every concept is written as English -> Algebra -> Rust

That repetition is intentional. Repetition is how the translation dictionary becomes automatic.

Suggested Study Flow

Read the module README.
Work through the lesson files in order.
Do the module exercises without copying from the solutions first.
Use the solution files to check reasoning, naming, and Rust syntax.
Move to the next module only after you can explain each formula out loud in English.

Running The Code

The current runnable code artifact is the Transformer teaching crate:

cargo test --manifest-path code/transformer/Cargo.toml

That crate covers:

dense vectors and matrices
semantic model newtypes such as TokenEmbedding, Query, Key, and Value
expressive thiserror diagnostics for shape mistakes
standard self-attention and multi-head attention
a simplified linear-attention comparison point
positional encodings, layer norm, feed-forward layers, and an encoder block

Quality Automation

The repo now includes two GitHub Actions workflows for quality control:

CI runs deterministic checks for lesson structure, local Markdown links, and authored-section contracts.
CI also compile-checks Rust snippets embedded in lessons and runs cargo fmt, cargo clippy, and cargo test for the Transformer teaching crate.
Gemini Writing Review reviews Markdown content on pull requests for English clarity, technical-teaching quality, structural discipline, and beginner friendliness.

The Gemini review is advisory, not a replacement for human judgment. It is designed to catch weak phrasing, excess cognitive load, mismatches between English and code, and places where the teaching flow violates common technical-writing or technical-instruction best practices.

To enable Gemini review in GitHub Actions, configure:

repository secret GEMINI_API_KEY
optional repository variable GEMINI_MODEL if you want a model other than the default gemini-2.0-flash

The workflow writes a review artifact named gemini-writing-review so the writing assessment can be read directly from the workflow run.

References

The repo keeps supporting source material in references/, including:

a Transformer explainer transcript
Bahdanau et al. (2014)
Luong et al. (2015)
Vaswani et al. (2017)
Sebastian Raschka's LLMs From Scratch repository as an external inspiration source for attention, GPT, and educational sequencing

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
.idea		.idea
assets		assets
book		book
code		code
lessons		lessons
references		references
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rust ML Systems from First Principles

Who This Is For

Start Here

What Exists Now

Authored lessons

Neuron track now included

Transformer track now included

Executable companion code

Source material and roadmap

Repo Map

Working Rules For This Repo

Learning Strategy

Suggested Study Flow

Running The Code

Quality Automation

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Rust ML Systems from First Principles

Who This Is For

Start Here

What Exists Now

Authored lessons

Neuron track now included

Transformer track now included

Executable companion code

Source material and roadmap

Repo Map

Working Rules For This Repo

Learning Strategy

Suggested Study Flow

Running The Code

Quality Automation

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages