Hands on Large(?) Language model from scratch

tutorial: LLM basics from scratch provide step by step explanation.

how to run

prepare data

Download from Google drive: https://drive.google.com/drive/folders/1IaD_SIIB-K3Sij_-JjWoPy_UrWqQRdjx

download the file named openwebtext.tar.xz from google drive link and extract all the .xz files in folder openWebTextCorpus.

The files should look like:

after you have downloaded and extracted files above, in terminal:

python convert_data.py

The program automatically convert all the .xz files you have extracted in folder openWebTextCorpus and put the converted .txt files in folder data. Since we are using [neetbox][neetbox] for monitoring, open localhost:20202 (neetbox's default port) in your browser and you can check the progresses:

train

python train.py --config gptv1_s.toml

Since we are using neetbox for monitoring, open localhost:20202 (neetbox's default port) in your browser and you can check the progresses:

predict

python inference.py --config gptv1_s.toml

Open localhost:20202 (neetbox's default port) in your browser and feed text to your model via action button.

further

more information see also LLM basics from scratch

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
data		data
imgs/readme		imgs/readme
openWebTextCorpus		openWebTextCorpus
.gitignore		.gitignore
convert_data.py		convert_data.py
inference.py		inference.py
model.py		model.py
readme.md		readme.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

imgs/readme

imgs/readme

openWebTextCorpus

openWebTextCorpus

.gitignore

.gitignore

convert_data.py

convert_data.py

inference.py

inference.py

model.py

model.py

readme.md

readme.md

requirements.txt

requirements.txt

train.py

train.py

Repository files navigation

Hands on Large(?) Language model from scratch

how to run

prepare data

train

predict

further

About

Languages

visualDust/naive-llm-from-scratch

Folders and files

Latest commit

History

Repository files navigation

Hands on Large(?) Language model from scratch

how to run

prepare data

train

predict

further

About

Topics

Resources

Stars

Watchers

Forks

Languages