GitHub - Adit0507/BareGPT: learning to build a LLM

BareGPT

BareGPT is my attempt to build a GPT-style large language model entirely from scratch using Pytorch. The goal is to understand the inner workings of LLMs by manually implementing every core component, from tokenization and attention mechanisms to full model architecture without relying on high level abstractions.

Overview

Large language models (LLMs) like GPT have reshaped natural language processing by leveraging transformers and self-attention. BareGPT aims to reconstruct this architecture from the ground up, focusing on clarity, correctness, and learning.

Motivation

BareGPT is a hands-on initiative to break down the complexity of modern LLMs and gain practical insight into how generative models work at a fundamental level. It’s an exploration in learning by building.

Concepts Explored

Tokenization and Embeddings
Self-attention and multi-head attention
Implemented scaled dot product attention for self-attention mechanism.
Built Multi-head attention modules
Added Causal attention mask to prevent tokens from attending to future positions.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
attention-mechanism		attention-mechanism
data-preparation		data-preparation
gpt-model		gpt-model
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BareGPT

Overview

Motivation

Concepts Explored

About

Uh oh!

Releases

Packages

Languages

Adit0507/BareGPT

Folders and files

Latest commit

History

Repository files navigation

BareGPT

Overview

Motivation

Concepts Explored

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages