Kolo

Tired of spending hours setting up your LLM fine-tuning environment? Kolo automates the entire process, getting you up and running in just 5 minutes with zero hassle. Get started instantly—whether you're an AI researcher, developer, or just experimenting with fine-tuning, Kolo makes it effortless.

🛠 Tools Installed

Kolo is built using a powerful stack of LLM tools:

Unsloth – Open-source LLM fine-tuning; faster training, lower VRAM.
Torchtune – Native PyTorch library LLM fine-tuning which supports AMD GPU and CPU fine tuning.
Llama.cpp – C/C++ converting and quantization of LLMs into GGUFs for easy testing and deployment.
Ollama – Portable, user-friendly LLM model management and deployment software.
Docker – Containerized environment to automatically setup the entire LLM development environment with the necessary tools and dependencies automatically installed along with scripts to make fine tuning and testing easy.
Open WebUI – Self-hosted web interface for LLM testing.

Recommended System Requirements

Operating System: Windows 10 or later, or Linux
Graphics Card: Nvidia GPU with CUDA 12.1 support and at least 8GB of VRAM
AMD GPU Users: Linux is required; Windows WSL2 does not support ROCM.
Memory: 16GB or more of system RAM

May work on other systems, your results may vary. Let us know!

Issues or Feedback

Join our Discord group!

🏃 Getting Started

1️⃣ Install Dependencies

🖥️ Windows Requirements

Ensure HyperV is installed.

Ensure WSL 2 is installed; alternative guide.

Ensure Docker Desktop is installed.

🐧 Linux Requirements

Ensure Docker Desktop is installed. Or Docker CLI

AMD Requirements

Install ROCM on Linux.

2️⃣ Build the Image

To build the image, run:

./build_image.ps1

If you are using an AMD GPU, use the following command instead:

./build_image_amd.ps1

Note: Only Torchtune supports AMD GPUs for fine-tuning.

3️⃣ Run the Container

If running for first time:

./create_and_run_container.ps1

If you are using an AMD GPU, use the following command instead:

./create_and_run_container_amd.ps1

For subsequent runs:

./run_container.ps1

4️⃣ Copy Training Data

./copy_training_data.ps1 -f examples/God.jsonl -d data.jsonl

Don't have training data? Check out our synthetic QA data generation guide!

5️⃣ Train Model

Using Unsloth

./train_model_unsloth.ps1 -OutputDir "GodOutput" -Quantization "Q4_K_M" -TrainData "data.jsonl"

All available parameters

./train_model_unsloth.ps1 -Epochs 3 -LearningRate 1e-4 -TrainData "data.jsonl" -BaseModel "unsloth/Llama-3.2-1B-Instruct-bnb-4bit" -ChatTemplate "llama-3.1" -LoraRank 16 -LoraAlpha 16 -LoraDropout 0 -MaxSeqLength 1024 -WarmupSteps 10 -SaveSteps 500 -SaveTotalLimit 5 -Seed 1337 -SchedulerType "linear" -BatchSize 2 -OutputDir "GodOutput" -Quantization "Q4_K_M" -WeightDecay 0

Using Torchtune

Requirements: Create a Hugging Face account and create a token. You will also need to get permission from Meta to use their models. Search the Base Model name on Hugging Face website and get access before training.

./train_model_torchtune.ps1 -OutputDir "GodOutput" -Quantization "Q4_K_M" -TrainData "data.json" -HfToken "your_token"

If you are using an AMD GPU, use the following command instead:

./train_model_torchtune.ps1 -GpuArch "gfx90a" -OutputDir "GodOutput" -Quantization "Q4_K_M" -TrainData "data.json" -HfToken "your_token"

All available parameters

./train_model_torchtune.ps1 -HfToken "your_token" -Epochs 3 -LearningRate 1e-4 -TrainData "data.json" -BaseModel "Meta-llama/Llama-3.2-1B-Instruct" -LoraRank 16 -LoraAlpha 16 -LoraDropout 0 -MaxSeqLength 1024 -WarmupSteps 10 -Seed 1337 -SchedulerType "cosine" -BatchSize 2 -OutputDir "GodOutput" -Quantization "Q4_K_M" -WeightDecay 0

Note: If re-training with the same OutputDir, delete the existing directory first:

./delete_model.ps1 "GodOutput" -Tool "unsloth|torchtune"

For more information about fine tuning parameters please refer to the Fine Tune Training Guide.

6️⃣ Install Model

Using Unsloth

./install_model.ps1 "God" -Tool "unsloth" -OutputDir "GodOutput" -Quantization "Q4_K_M"

Using Torchtune

./install_model.ps1 "God" -Tool "torchtune" -OutputDir "GodOutput" -Quantization "Q4_K_M"

7️⃣ Test Model

Open your browser and navigate to localhost:8080

Other Commands

Uninstalls the Model from Ollama.

./uninstall_model.ps1 "God"

Lists all models installed on Ollama and the training model directories for both torchtune and unsloth.

./list_models.ps1

Copies all the scripts and files inside /scripts into Kolo at /app/

./copy_scripts.ps1

Copies all the torchtune config files inside /torchtune into Kolo at /app/torchtune

./copy_configs.ps1

🔧 Advanced Users

SSH Access

To quickly SSH into the Kolo container for installing additional tools or running scripts directly:

./connect.ps1

If prompted for a password, use:

password 123

Alternatively, you can connect manually via SSH:

ssh root@localhost -p 2222

WinSCP (SFTP Access)

You can use WinSCP or any other SFTP file manager to access the Kolo container’s file system. This allows you to manage, modify, add, or remove scripts and files easily.

Connection Details:

Host: localhost
Port: 2222
Username: root
Password: 123

This setup ensures you can easily transfer files between your local machine and the container.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kolo

🛠 Tools Installed

Recommended System Requirements

Issues or Feedback

🏃 Getting Started

1️⃣ Install Dependencies

🖥️ Windows Requirements

🐧 Linux Requirements

AMD Requirements

2️⃣ Build the Image

3️⃣ Run the Container

4️⃣ Copy Training Data

5️⃣ Train Model

Using Unsloth

Using Torchtune

6️⃣ Install Model

Using Unsloth

Using Torchtune

7️⃣ Test Model

Other Commands

🔧 Advanced Users

SSH Access

WinSCP (SFTP Access)

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 256 Commits
examples		examples
scripts		scripts
torchtune/configs		torchtune/configs
.gitignore		.gitignore
FineTuningGuide.md		FineTuningGuide.md
GenerateTrainingDataGuide.md		GenerateTrainingDataGuide.md
LICENSE		LICENSE
README.md		README.md
build_image.ps1		build_image.ps1
build_image_amd.ps1		build_image_amd.ps1
connect.ps1		connect.ps1
convert_qa_output.ps1		convert_qa_output.ps1
copy_configs.ps1		copy_configs.ps1
copy_qa_input_generation.ps1		copy_qa_input_generation.ps1
copy_scripts.ps1		copy_scripts.ps1
copy_training_data.ps1		copy_training_data.ps1
create_and_run_container.ps1		create_and_run_container.ps1
create_and_run_container_amd.ps1		create_and_run_container_amd.ps1
delete_model.ps1		delete_model.ps1
delete_qa_generation_output.ps1		delete_qa_generation_output.ps1
dockerfile		dockerfile
dockerfile-amd		dockerfile-amd
generate_qa_data.ps1		generate_qa_data.ps1
install_model.ps1		install_model.ps1
list_models.ps1		list_models.ps1
run_container.ps1		run_container.ps1
supervisord.conf		supervisord.conf
train_model_torchtune.ps1		train_model_torchtune.ps1
train_model_unsloth.ps1		train_model_unsloth.ps1
uninstall_model.ps1		uninstall_model.ps1

License

potatoqualitee/Kolo

Folders and files

Latest commit

History

Repository files navigation

Kolo

🛠 Tools Installed

Recommended System Requirements

Issues or Feedback

🏃 Getting Started

1️⃣ Install Dependencies

🖥️ Windows Requirements

🐧 Linux Requirements

AMD Requirements

2️⃣ Build the Image

3️⃣ Run the Container

4️⃣ Copy Training Data

5️⃣ Train Model

Using Unsloth

Using Torchtune

6️⃣ Install Model

Using Unsloth

Using Torchtune

7️⃣ Test Model

Other Commands

🔧 Advanced Users

SSH Access

WinSCP (SFTP Access)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages