GitHub - thecodingmage/Diffusion-SLM: A Diffusion-based Small Language Model (SLM) built using the LLaDA framework. Currently featuring a 1B-parameter architecture trained on FineWeb 10BT with a focus on bidirectional context and denoising efficiency.

Download Dataset

download Script:

from huggingface_hub import snapshot_download
folder = snapshot_download(
                "HuggingFaceFW/fineweb", 
                repo_type="dataset",
                local_dir="./fineweb/",
                # replace "data/CC-MAIN-2023-50/*" with "sample/100BT/*" to use the 100BT sample
                allow_patterns="sample/10BT/*")

References

Llada-from-Scratch

utils

Avoid fragmentation: export PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True

Working Guid

Create a venv: python -m venv venv , venv\Scripts\activate
cd setup and ./setup.sh
Put the training data and python prepareData.py
Train tokenizer: python tokenizer.py
Train LLADA model: CUDA_VISIBLE_DEVICES=1 python train2.py
Eval_1: python sample.py
Eval_2: python eval.py
Launch App: python app.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gradio/flagged		.gradio/flagged
.idea		.idea
setup		setup
test		test
.gitignore		.gitignore
README.md		README.md
app.py		app.py
app2.py		app2.py
check.py		check.py
config.py		config.py
configs_llada.py		configs_llada.py
dataset.py		dataset.py
dataset_fn.py		dataset_fn.py
eval.py		eval.py
finewebDownload.py		finewebDownload.py
gpuCheck.py		gpuCheck.py
model.py		model.py
prepareData.py		prepareData.py
sample.py		sample.py
tokenizer.py		tokenizer.py
train.py		train.py
train2.py		train2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Download Dataset

References

utils

Working Guid

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Download Dataset

References

utils

Working Guid

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages