decoder-only-transformer

Here are 3 public repositories matching this topic...

shaheennabi / Decoder-Only-Transformer-Architecture-Implementation-in-Pytorch

A complete implementation of a Decoder-Only Transformer (GPT-style) built using PyTorch, without relying on high-level abstractions. This implementation includes all core components: token embeddings, positional embeddings, multi-head self-attention, feedforward networks, causal masking, and output logits generation.

attention-is-all-you-need positional-encoding masked-attention decoder-only-transformer

Updated Feb 18, 2026
Python

MostafaK2 / GPT-Style_LM

Star

Decoder-only GPT-style Transformer for autoregressive language modeling with BPE tokenization, supporting greedy, temperature, top-k, and nucleus sampling

nlp pytorch transformer gpt language-model bpe decoder-only-transformer

Updated Mar 7, 2026
Python

paytonison / tater-tot

Star

Small educational C++20 character-level language model with a tiny automatic-differentiation engine, training/generation CLIs, and checkpointing.

python c cli machine-learning cmake neural-network cpp transformer educational standard-library c11 language-model from-scratch autodiff cpp20 character-level-model decoder-only-transformer

Updated May 20, 2026
C

Improve this page

Add a description, image, and links to the decoder-only-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the decoder-only-transformer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly