Lolicore: Simple, Playable and Flexible Language Model

Lolicore is a experimental language model written with jax and flax.

You can train a mixture of expert model that support sharding, parallel on both TPU and GPU devices by running the following lines of code:

# clone repo
git clone https://github.com/Psykura/Lolicore
cd Lolicore
# install deps
uv sync
# start training
uv run train.py

curl -LsSf https://raw.githubusercontent.com/Psykura/Lolicore/refs/heads/main/train.sh | sh

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
export_onnx.py		export_onnx.py
inference.py		inference.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_generation.py		test_generation.py
test_moe.py		test_moe.py
train.py		train.py
transformer.py		transformer.py
uv.lock		uv.lock

Provide feedback