Max LLM

LLM research repo for building, training, and evaluating modern language models on local hardware.

Status: Current phase and progress live in .github/MEMORY.md (working session), .github/SESSION_LOG.md (recent history), .github/SESSION_LOG_ARCHIVE.md (older history), and docs/PLAN.md.

Repository: github.com/maxjbarfuss/max_llm

Quick Start

Setup first, then use the entrypoint docs for the workflow you need:

scripts/setup/README.md: environment setup and platform requirements
scripts/build/README.md: C++ build wrapper and modes
src/config/README.md: config system (adding fields, versioning, test fixtures)
src/data/README.md: data prep workflow
src/training/README.md: training configurations
src/inference/README.md: inference configurations
outputs/README.md: milestone vs ephemeral artifact layout
tests/README.md: test execution

Docs

.github/MEMORY.md: current session working state
.github/SESSION_LOG.md: recent completed-session history
.github/SESSION_LOG_ARCHIVE.md: archived older session history
docs/PLAN.md: phased roadmap, task checklists, and exit criteria
docs/DESIGN.md: architecture and data design
docs/OPTIMIZATION.md: validated optimization learnings and current defaults
docs/PHASE_1_CLOSEOUT.md | docs/PHASE_2_CLOSEOUT.md | docs/PHASE_3_CLOSEOUT.md | docs/PHASE_4_CLOSEOUT.md: completed-phase historical reports
config/README.md: TOML config field reference for all sections
CONTRIBUTING.md: workflow and contributor authorization
.github/AGENTS.md: AI agent standard (Claude, o1, custom models)
.github/SKILLS.md: AI agent instructions
.github/LESSONS.md: agent mistake patterns
.github/CODEOWNERS: code ownership

Architecture, phase progression, and data strategy live in docs/DESIGN.md. Active execution detail lives in docs/PLAN.md. Historical outcomes live in the phase closeouts.

Architecture Diagram

graph TD
	In[Text Input]:::io --> Tok[Unigram Tokenizer 8K]:::p3 --> Emb[Token Embedding]:::p2 --> N1
	Emb --> RGRU[GRU Reasoning Stream]:::p7

	subgraph Block[Transformer Stream x N]
		N1[RMSNorm]:::p4 --> ATT[MLA]:::p6 --> R1[+ Residual]:::p3
		RoPE[RoPE]:::p4 -.-> ATT
		R1 --> N2[RMSNorm]:::p4 --> MOE[MoE Sparse SwiGLU]:::p6 --> R2[+ Residual]:::p3
	end

	LoRA:::p5 -.-> Block
	RewardModel:::p5 -.-> Block

	R2 --> COMB[GRU Combiner]:::p7
	RGRU --> COMB
	COMB --> Head[LM Head]:::p3 --> Logits[Logits]:::io
	Logits -->|training| Loss[CE Loss + DPO]:::p5
	Logits -->|inference| Samp[Sampler + KV-cache]:::p5 --> GenOut[Generated Text]:::io

	classDef io fill:#212121,stroke:#FFFFFF,color:#FFFFFF,stroke-width:2px
	classDef p2 fill:#C8E6C9,stroke:#2E7D32,color:#1B5E20
	classDef p3 fill:#BBDEFB,stroke:#1565C0,color:#0D47A1
	classDef p4 fill:#FFE0B2,stroke:#E65100,color:#BF360C
	classDef p5 fill:#E1BEE7,stroke:#6A1B9A,color:#4A148C
	classDef p6 fill:#FFCDD2,stroke:#C62828,color:#B71C1C
	classDef p7 fill:#FFF9C4,stroke:#F57F17,color:#F57F17

License

Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.claude		.claude
.github		.github
.vscode		.vscode
config		config
docs		docs
scripts		scripts
src		src
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.editorconfig		.editorconfig
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.sh		setup.sh
train.py		train.py
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Max LLM

Quick Start

Docs

Architecture Diagram

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Max LLM

Quick Start

Docs

Architecture Diagram

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages