GitHub - DiamondGotCat/Azuki.ai: Azuki.ai - A community-driven generative AI project

This repository has moved to Here

Azuki.ai is a fully customizable AI.

Everything can be changed from the dataset state.

We need your help

This is a solo build, so I'll probably run out of steam soon.

Someone please help: extend the dataset, suggest new features, implement new features.

What is this Naming!?

This name is, from the Japanese "あずき (Azuki)".

"あずき" is "Red beans" in the English.

Why does Azuki.ai work?

Load GPT2 and that Tokenizer.
Add dataset for fine tuning, and Training Model using That.
Extract New Model.
Complete! This is all you need!

Roadmap

And more!

Require Spec

SM Model: Can be run on some smartphones, and almost all PCs from 2015 onwards
CD Model: Unknown
MD Model: Unknown
LG Model: Incomplete
XL Model: Incomplete
JP Model: Incomplete
LF Model: Azuki.ai All in One Model (Need High Spec PC)

We need stars!

If you like it, please click the star right away.

It will help us spread our project to more people.

Latest default dataset for Azuki.ai

Please download from This Repo

Dataset Contribute

To make this project bigger, we need to make the dataset bigger. Please cooperate.

NOTE

Divided the dataset into the following five categories:

Life (lf) : Next generation of the AI, for High Spec PC. (Adding Part of Me and Contributer Life to Life Model, This is Most Bigger Road.)
Small (sm) : A small, highly efficient dataset for mobile devices (e.g., generating sentence continuations)
Code (cd) : Python Knowledge (Small Model for Coding Assistant)
Medium (md) : A medium-sized, slightly smart dataset for low-spec PCs (e.g., solving common sense problems)
Large (lg) : A large, smart dataset for medium-spec PCs (e.g., solving general problems)
Extra Large (xl) : An extra-large, high-spec dataset for a Mac M1 or so (e.g., solving math problems for high school students)
Japanese (jp) : Japanese Model

Files

execute.py: Runner
training.py: Training Script

Customize Output

Download Latest Default Dataset
Edit data-{size}.json
Execute Training Script

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
logs		logs
results		results
trained_model		trained_model
LICENSE		LICENSE
README.md		README.md
execute.py		execute.py
training-from-no-data.py		training-from-no-data.py
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

We need your help

What is this Naming!?

Why does Azuki.ai work?

Roadmap

Require Spec

We need stars!

Latest default dataset for Azuki.ai

Dataset Contribute

NOTE

Files

Customize Output

About

Releases 4

Languages

License

DiamondGotCat/Azuki.ai

Folders and files

Latest commit

History

Repository files navigation

We need your help

What is this Naming!?

Why does Azuki.ai work?

Roadmap

Require Spec

We need stars!

Latest default dataset for Azuki.ai

Dataset Contribute

NOTE

Files

Customize Output

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Languages