Skip to content

Activity

add fine-tuning an embedding model for domain knowledge

Aisukopushed 1 commit to main • bfc2c40…e37baa5 • 
7 hours ago

add confusion matrix and decision boundary

Aisukopushed 1 commit to main • 65e913f…bfc2c40 • 
15 days ago

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 20

Aisukopushed 1 commit to main • 61b274d…65e913f • 
15 days ago

Add speech recognition via Whisper

Aisukopushed 1 commit to main • 3f50e7b…61b274d • 
23 days ago

Add visualisation for a simple multiple perceptron

Aisukopushed 1 commit to main • 9556083…3f50e7b • 
28 days ago

add back propagation and training functions

Aisukopushed 1 commit to main • ebe64a3…9556083 • 
29 days ago

Init commit for multilayer perceptron with numpy

Aisukopushed 1 commit to main • 61b03e8…ebe64a3 • 
29 days ago

Kaggle Notebook | Tokenize the Data for Pre-training Neural Network |…

Aisukopushed 1 commit to main • 2f5bf2a…61b03e8 • 
on Feb 2

add DistributedDataParallel training

Aisukopushed 1 commit to main • ebaac25…2f5bf2a • 
on Feb 2

Add the gradient accumulation minimal example

Aisukopushed 1 commit to main • 5022839…ebaac25 • 
on Feb 1

hidden code comment

Aisukopushed 1 commit to main • 19430e9…5022839 • 
on Feb 1

add acknowledgement

Aisukopushed 1 commit to main • 71c0fac…19430e9 • 
on Feb 1

add temporal fusion transformer with Pytorch Forecasting

Aisukopushed 1 commit to main • 51944fc…71c0fac • 
on Feb 1

show the code cell in the notebook

Aisukopushed 1 commit to main • 0e69843…51944fc • 
on Jan 31

reduce the micro batch number to fit lower memory GPU

Aisukopushed 1 commit to main • 7e62769…0e69843 • 
on Jan 31

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 15

Aisukopushed 1 commit to main • 99665b0…7e62769 • 
on Jan 31

add weight decay, only for 2D params, and add fused AdamW

Aisukopushed 1 commit to main • b004e3b…99665b0 • 
on Jan 31

Add learning rate scheduler

Aisukopushed 1 commit to main • dc0d493…b004e3b • 
on Jan 30

add torch compile and bfloat16

Aisukopushed 1 commit to main • e555b17…dc0d493 • 
on Jan 30

Support gpt2 initialisation with std=0.2 parameter

Aisukopushed 1 commit to main • 50199fb…e555b17 • 
on Jan 29

fix the issue weight tie the embedding and unembedding matrix

Aisukopushed 1 commit to main • e004406…50199fb • 
on Jan 29

Add data loader

Aisukopushed 1 commit to main • 4ba9840…e004406 • 
on Jan 29

Add 50 training loop with a little batch

Aisukopushed 1 commit to main • ea0bff4…4ba9840 • 
on Jan 29

calculate the loss function: cross entropy loss

Aisukopushed 1 commit to main • 8149b08…ea0bff4 • 
on Jan 29

Do sampling from the customize model

Aisukopushed 1 commit to main • 92d5064…8149b08 • 
on Jan 29

add forward() function of GPT2 nn.Module

Aisukopushed 1 commit to main • 9c0f183…92d5064 • 
on Jan 29

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 4

Aisukopushed 1 commit to main • 573d89d…9c0f183 • 
on Jan 29

Kaggle Notebook | Implement GPT2-124M | Version 2

Aisukopushed 1 commit to main • 85f5807…573d89d • 
on Jan 29

Kaggle Notebook | Build a Tokenizer | Version 3

Aisukopushed 1 commit to main • fef15bb…85f5807 • 
on Jan 27

Kaggle Notebook | Anomaly Detection on Bank Card Fraud | Version 2

Aisukopushed 1 commit to main • d8985f3…fef15bb • 
on Jan 21