Activity

add fine-tuning an embedding model for domain knowledge

Aisukopushed 1 commit to main • bfc2c40…e37baa5 •

7 hours ago

add confusion matrix and decision boundary

Aisukopushed 1 commit to main • 65e913f…bfc2c40 •

15 days ago

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 20

Aisukopushed 1 commit to main • 61b274d…65e913f •

15 days ago

Add speech recognition via Whisper

Aisukopushed 1 commit to main • 3f50e7b…61b274d •

23 days ago

Add visualisation for a simple multiple perceptron

Aisukopushed 1 commit to main • 9556083…3f50e7b •

28 days ago

add back propagation and training functions

Aisukopushed 1 commit to main • ebe64a3…9556083 •

29 days ago

Init commit for multilayer perceptron with numpy

Aisukopushed 1 commit to main • 61b03e8…ebe64a3 •

29 days ago

Kaggle Notebook | Tokenize the Data for Pre-training Neural Network |…

Aisukopushed 1 commit to main • 2f5bf2a…61b03e8 •

on Feb 2

add DistributedDataParallel training

Aisukopushed 1 commit to main • ebaac25…2f5bf2a •

on Feb 2

Add the gradient accumulation minimal example

Aisukopushed 1 commit to main • 5022839…ebaac25 •

on Feb 1

hidden code comment

Aisukopushed 1 commit to main • 19430e9…5022839 •

on Feb 1

add acknowledgement

Aisukopushed 1 commit to main • 71c0fac…19430e9 •

on Feb 1

add temporal fusion transformer with Pytorch Forecasting

Aisukopushed 1 commit to main • 51944fc…71c0fac •

on Feb 1

show the code cell in the notebook

Aisukopushed 1 commit to main • 0e69843…51944fc •

on Jan 31

reduce the micro batch number to fit lower memory GPU

Aisukopushed 1 commit to main • 7e62769…0e69843 •

on Jan 31

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 15

Aisukopushed 1 commit to main • 99665b0…7e62769 •

on Jan 31

add weight decay, only for 2D params, and add fused AdamW

Aisukopushed 1 commit to main • b004e3b…99665b0 •

on Jan 31

Add learning rate scheduler

Aisukopushed 1 commit to main • dc0d493…b004e3b •

on Jan 30

add torch compile and bfloat16

Aisukopushed 1 commit to main • e555b17…dc0d493 •

on Jan 30

Support gpt2 initialisation with std=0.2 parameter

Aisukopushed 1 commit to main • 50199fb…e555b17 •

on Jan 29

fix the issue weight tie the embedding and unembedding matrix

Aisukopushed 1 commit to main • e004406…50199fb •

on Jan 29

Add data loader

Aisukopushed 1 commit to main • 4ba9840…e004406 •

on Jan 29

Add 50 training loop with a little batch

Aisukopushed 1 commit to main • ea0bff4…4ba9840 •

on Jan 29

calculate the loss function: cross entropy loss

Aisukopushed 1 commit to main • 8149b08…ea0bff4 •

on Jan 29

Do sampling from the customize model

Aisukopushed 1 commit to main • 92d5064…8149b08 •

on Jan 29

add forward() function of GPT2 nn.Module

Aisukopushed 1 commit to main • 9c0f183…92d5064 •

on Jan 29

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 4

Aisukopushed 1 commit to main • 573d89d…9c0f183 •

on Jan 29

Kaggle Notebook | Implement GPT2-124M | Version 2

Aisukopushed 1 commit to main • 85f5807…573d89d •

on Jan 29

Kaggle Notebook | Build a Tokenizer | Version 3

Aisukopushed 1 commit to main • fef15bb…85f5807 •

on Jan 27

Kaggle Notebook | Anomaly Detection on Bank Card Fraud | Version 2

Aisukopushed 1 commit to main • d8985f3…fef15bb •

on Jan 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add fine-tuning an embedding model for domain knowledge

add confusion matrix and decision boundary

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 20

Add speech recognition via Whisper

Add visualisation for a simple multiple perceptron

add back propagation and training functions

Init commit for multilayer perceptron with numpy

Kaggle Notebook | Tokenize the Data for Pre-training Neural Network |…

add DistributedDataParallel training

Add the gradient accumulation minimal example

hidden code comment

add acknowledgement

add temporal fusion transformer with Pytorch Forecasting

show the code cell in the notebook

reduce the micro batch number to fit lower memory GPU

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 15

add weight decay, only for 2D params, and add fused AdamW

Add learning rate scheduler

add torch compile and bfloat16

Support gpt2 initialisation with std=0.2 parameter

fix the issue weight tie the embedding and unembedding matrix

Add data loader

Add 50 training loop with a little batch

calculate the loss function: cross entropy loss

Do sampling from the customize model

add forward() function of GPT2 nn.Module

Kaggle Notebook | Implement GPT2-124M and Load HF GPT2 | Version 4

Kaggle Notebook | Implement GPT2-124M | Version 2

Kaggle Notebook | Build a Tokenizer | Version 3

Kaggle Notebook | Anomaly Detection on Bank Card Fraud | Version 2