LLMs
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
✨✨Latest Advances on Multimodal Large Language Models
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
SGLang is a high-performance serving framework for large language models and multimodal models.
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Modeling, training, eval, and inference code for OLMo
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Large World Model -- Modeling Text and Video with Millions Context
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
lightweight, standalone C++ inference engine for Google's Gemma models.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
A series of large language models trained from scratch by developers @01-ai
Interact with your documents using the power of GPT, 100% privately, no data leaks
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
