GitHub - shikhr/nn_fundamentals: Implementing models all the way from scratch!

Makemore models

Building and training autoregressive language models from scratch, following Andrej Karpathy's Neural Networks: Zero to Hero series.

Word2Vec with Negative Sampling

Word2vec is a technique in natural language processing (NLP) for obtaining vector representations of words. These vectors capture information about the meaning of the word based on the surrounding(context) words.

Word2Vec SkipGram with Negative Sampling

GPT

GPT Tokenizer(BPE)

Tokenization is splitting text into smaller units called tokens that can be fed into the language model.

character level (too small)
word level (too big)
subword level (balanced)
- BPE (Algorithm which merges on argmax P(A,B), good for whitespaced languages)
- WordPiece (Algorithm which merges on argmax P(A,B)/[P(A)*P(B)], good for whitespaced languages)
- SentencePiece (Library containing optimized BPE, WordPiece, Unigram, good for non - whitespaced languages)
- Unigram (All combinations of substrings, then reduce if least impact to maximising likelihood)

GPT Tokenizer(BPE) Notebook

NanoGPT

Transformer Decoder for autoregressive sequence to sequence modelling.

NanoGPT Notebook

Vision

CNN based architectures

LeNet
AlexNet
ResNet

VisionModels Notebook

Vision Transformer

ViT Notebook

Image generation

DCGAN

DCGAN, or Deep Convolutional GAN, is a generative adversarial network architecture.

DCGAN Notebook

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
GPT_Scratch		GPT_Scratch
Gpt_Tokenizer		Gpt_Tokenizer
ImageGeneration/dcgan		ImageGeneration/dcgan
ViT		ViT
VisionModels		VisionModels
Word2Vec_Negative_Sampling		Word2Vec_Negative_Sampling
makemore		makemore
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Makemore models

Word2Vec with Negative Sampling

GPT

GPT Tokenizer(BPE)

NanoGPT

Vision

CNN based architectures

Vision Transformer

Image generation

DCGAN

About

Releases

Packages

Languages

shikhr/nn_fundamentals

Folders and files

Latest commit

History

Repository files navigation

Makemore models

Word2Vec with Negative Sampling

GPT

GPT Tokenizer(BPE)

NanoGPT

Vision

CNN based architectures

Vision Transformer

Image generation

DCGAN

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages