List of Open-Source Finetuned Large Language Models

This repository contains a curated (incomplete) list of open-source and finetuned Large Language Models.

LLaMA (Meta)

LLaMA (Large Language Model Meta AI), a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI. Smaller, more performant models such as LLaMA enable others in the research community who don’t have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field.

LLaMA Website: Introducing LLaMA: A foundational, 65-billion-parameter language model (facebook.com)

Alpaca (Stanford)

We are releasing our findings about an instruction-following language model, dubbed Alpaca, which is fine-tuned from Meta’s LLaMA 7B model. We train the Alpaca model on 52K instruction-following demonstrations generated in the style of self-instruct using text-davinci-003. On the self-instruct evaluation set, Alpaca shows many behaviors similar to OpenAI’s text-davinci-003, but is also surprisingly small and easy/cheap to reproduce.

Website: https://crfm.stanford.edu/2023/03/13/alpaca.html
GitHub: https://github.com/tatsu-lab/stanford_alpaca

Alpaca-LoRA

This repository contains code for reproducing the Stanford Alpaca results using low-rank adaptation (LoRA). We provide an Instruct model of similar quality to text-davinci-003 that can run on a Raspberry Pi (for research), and the code is easily extended to the 13b, 30b, and 65b models.

Baize

Koala

Vicuna (FastChat)

We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.

llama.cpp

LLama.cpp, allows users to run the LLaMA model on their local computers using C/C++. According to the documentation, llama.cpp supports the following models and runs on moderately speed PCs:

LLaMA | Alpaca | GPT4All | Vicuna | Koala | OpenBuddy (Multilingual) | Pygmalion 7B / Metharme 7B

GitHub: ggerganov/llama.cpp: Port of Facebook’s LLaMA model in C/C++ (github.com)

LLaMA-Adapter V2

GitHub: ZrrSkywalker/LLaMA-Adapter: Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters (github.com)

Lit-LLaMA ️

GitHub: Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports quantization, LoRA fine-tuning, pre-training. Apache 2.0-licensed. (github.com)

StableVicuna

StackLLaMA

Website: https://huggingface.co/blog/stackllama

StableLM (StabilityAI)

GPT4All

GTP4All is an ecosystem to train and deploy powerful and customized large language models that run locally on consumer grade CPUs. The goal is simple - be the best instruction tuned assistant-style language model that any person or enterprise can freely use, distribute and build on.

GPT-J (EleutherAI)

GitHub: https://github.com/kingoflolz/mesh-transformer-jax/#gpt-j-6b

GPT4All-J

GitHub: nomic-ai/gpt4all: gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue (github.com)

GPT-NeoX (EleutherAI)

GitHub: EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. (github.com)

Pythia (EleutherAI)

GitHub: EleutherAI/pythia (github.com)

Dolly 2.0 (Databricks)

Databricks’ Dolly is an instruction-following large language model trained on the Databricks machine learning platform that is licensed for commercial use. Based on pythia-12b, Dolly is trained on ~15k instruction/response fine tuning records databricks-dolly-15k generated by Databricks employees in capability domains from the InstructGPT paper, including brainstorming, classification, closed QA, generation, information extraction, open QA and summarization.

GutHub: dolly/data at master · databrickslabs/dolly (github.com)
Blog post: Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
Hugging Face: databricks (Databricks) (huggingface.co)

OpenAssistant Models

Open Assistant is a project meant to give everyone access to a great chat based large language model. We believe that by doing this we will create a revolution in innovation in language. In the same way that stable-diffusion helped the world make art and images in new ways we hope Open Assistant can help improve the world by improving language itself.

Replit-Code (Replit)

Hugging Face: https://huggingface.co/replit/replit-code-v1-3b

Segment Anything (Meta)

We aim to democratize segmentation by introducing the Segment Anything project: a new task, dataset, and model for image segmentation, as we explain in our research paper. We are releasing both our general Segment Anything Model (SAM) and our Segment Anything 1-Billion mask dataset (SA-1B), the largest ever segmentation dataset, to enable a broad set of applications and foster further research into foundation models for computer vision.

StartCoder (BigCode)

Website: https://huggingface.co/bigcode
Hugging Face: https://huggingface.co/spaces/bigcode/bigcode-editor and https://huggingface.co/spaces/bigcode/bigcode-playground

BLOOM (BigScience)

Hugging Face: bigscience/bloom · Hugging Face

Flamingo (Google/Deepmind)

Website: Tackling multiple tasks with a single visual language model
GitHub: https://github.com/lucidrains/flamingo-pytorch

FLAN (Google)

GitHub: google-research/FLAN (github.com)

FastChat-T5

GitHub: lm-sys/FastChat: The release repo for “Vicuna: An Open Chatbot Impressing GPT-4” (github.com)

Flan-Alpaca

GitHub: declare-lab/flan-alpaca: This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5. (github.com)

Commercial Use LLMs

Bloom | StableLM-Alpha | FastChat-T5 |

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

List of Open-Source Finetuned Large Language Models

About

Uh oh!

Releases

Packages

IntellectsAI/open-source-llms

Folders and files

Latest commit

History

Repository files navigation

List of Open-Source Finetuned Large Language Models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages