Skip to content

davzoku/cria

Repository files navigation

🍼🦙 CRIA

Netlify Status

💡 Article | 💻 HuggingFace | 📔 Colab 1,2

Welcome to CRIA, a LLM model series based on Llama 2-7B.

What is CRIA?

Hint: krē-ə plural crias; a baby llama, alpaca, vicuña, or guanaco.

Cria Logo

With ChatGPT's help, CRIA also stands for "Crafting a Rapid prototype of an Intelligent llm App using open source resources". This encapsulates the objective of this project perfectly.

Additionally, akin to a baby llama in nature, CRIA pays homage to its foundational model, Meta's Llama-2 7B Large Language Model.

Features

  • Demostration of instruction-tuning on latest open source LLM using a custom dataset on a free colab instance.
  • Utilized FastAPI for efficient model serving and inference deployment.
    • Supports real-time with Server-Sent Events (SSE) for a seamless chat experience.
  • Enjoy a modern front-end built with Next.js and Chakra UI.
  • Supports both local deployment, and cloud deployment. (Coming Soon!)

Frontend
Demo: Leveraging on open source resources such as Horizon AI Template

What You'll Find Here

In this repository, you'll find:

Code: Dive into the technical details of our chatbot implementation, including the training process, API server implementation, the integration of Next.js for the user interface, and more.

Documentation: Detailed documentation to help you understand and replicate the CRIA setup, from model selection to deployment considerations.

Demo: Access a live demo showcasing CRIA in action.

Model History

HuggingFace Model Model Type Base Model Dataset Colab Status
cria-llama2-7b-v1.3,
cria-llama2-7b-v1.3_peft
Merged / PEFT NousResearch/Llama-2-7b-chat-hf mlabonne/CodeLlama-2-20k Open In Colab Open In Colab Latest
cria-llama2-7b-v1.1, cria-llama2-7b-v1.2 Merged / PEFT TinyPixel/Llama-2-7B-bf16-sharded n3rd0/DreamBook_Guanaco_Format N.A. Experimental
cria-llama2-7b-v1.0 PEFT TinyPixel/Llama-2-7B-bf16-sharded Elliot4AI/dolly-15k-chinese-guanacoformat N.A. Experimental

Documentation

Setup

The instructions to run the various components, such as the API server and frontend interface, can be found at /docs/setup.md.

Deployment

The instructions to deploy the API server and frontend on the cloud, can be found at /docs/deployment.md.

Slides

CRIA v1.3 was first presented in a private session on 18 Aug 2023. The slides is publicly available here.

Architectural Overview

The overview of the project can be found at /docs/architecture.md.

Architectural Decision Records (ADR)

Please refer to the /docs/adr/ folder for the detailed information on the list of design decisions made so far.

Model Evaluation

The preliminary model evaluation can be be found at /docs/model-eval/ folder.

References

Guides / Tutorials / Discussions

Datasets

Models

Releases

No releases published

Sponsor this project

Packages

No packages published