This repository has moved to Here
Azuki.ai is a fully customizable AI.
Everything can be changed from the dataset state.
This is a solo build, so I'll probably run out of steam soon.
Someone please help: extend the dataset, suggest new features, implement new features.
This name is, from the Japanese "あずき (Azuki)".
"あずき" is "Red beans" in the English.
- Load GPT2 and that Tokenizer.
- Add dataset for fine tuning, and Training Model using That.
- Extract New Model.
- Complete! This is all you need!
- Base Script and Structure
- Essential Plugin (LLM can using External Compute. e.g., Calc using Python, Assist your Coding and more!)
- Chat Model (for Chat Plugin)
- CLI Optimize
- GUI Mode
- Chat Server Mode (e.g., Assist your Coding with VSCode, Using Azuki.ai Server)
- Your Document Import
- Image Plugin (Seeing Image)
- Chat Plugin (Prompt Continue)
- SM Model (Former name "XS")
- JP Model
- MD Model
- LF (Life) Model (Main Model)
- LG Model
- XL Model
- CD Model
And more!
- SM Model: Can be run on some smartphones, and almost all PCs from 2015 onwards
- CD Model: Unknown
- MD Model: Unknown
- LG Model: Incomplete
- XL Model: Incomplete
- JP Model: Incomplete
- LF Model: Azuki.ai All in One Model (Need High Spec PC)
If you like it, please click the star right away.
It will help us spread our project to more people.
Please download from This Repo
To make this project bigger, we need to make the dataset bigger. Please cooperate.
Divided the dataset into the following five categories:
- Life (lf) : Next generation of the AI, for High Spec PC. (Adding Part of Me and Contributer Life to Life Model, This is Most Bigger Road.)
- Small (sm) : A small, highly efficient dataset for mobile devices (e.g., generating sentence continuations)
- Code (cd) : Python Knowledge (Small Model for Coding Assistant)
- Medium (md) : A medium-sized, slightly smart dataset for low-spec PCs (e.g., solving common sense problems)
- Large (lg) : A large, smart dataset for medium-spec PCs (e.g., solving general problems)
- Extra Large (xl) : An extra-large, high-spec dataset for a Mac M1 or so (e.g., solving math problems for high school students)
- Japanese (jp) : Japanese Model
- execute.py: Runner
- training.py: Training Script
- Download Latest Default Dataset
- Edit data-{size}.json
- Execute Training Script