LLM720

LLM720 is a second generation Large Language Model that aims to be:

Open
Interpretable
Energy efficient

These goals are all accomplished through the same guiding design: Make a model that is as sparse as possible, so you can see exactly which weights are contributing to output while simultaneously not using more compute than necessary.

We aim to accomplish this through a fine grained Mixture of Experts architecture (He, 2024), while utilizing efficient attention mechanisms initially pioneered by DeepSeek with Multi-Headed Latent Attention (Deepseek, 2024). This project is also exploring expanding model weights into system memory while maintaining performance with the hope of making frontier models less bound by GPU memory capacity by expanding on the Mixture of a Million Experts architecture originally developed by He.

LLM720 gets its namesake from LLM360, where we intend to carry the torch of completely open-sourced model development (Liu et al, 2023).

Quick Start

Installation: See docs/INSTALLATION.md
Architecture: See docs/ARCHITECTURE.md
Configuration: See docs/CONFIGURATION.md
Full Documentation: Browse the docs/ directory

Join us on Discord

Special Thanks

Thank you to @lambdal for providing compute for this project.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
configs		configs
docs		docs
llm		llm
scripts		scripts
tests		tests
.aider.config.yml		.aider.config.yml
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
pytest.ini		pytest.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM720

Quick Start

Join us on Discord

Special Thanks

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM720

Quick Start

Join us on Discord

Special Thanks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages