Skip to content
View ZZfive's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ZZfive

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLMs

142 repositories

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

1,076 54 Updated Sep 27, 2025

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,693 293 Updated Aug 14, 2024

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 864 52 Updated May 8, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,210 2,703 Updated Nov 3, 2025

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,489 144 Updated Mar 7, 2025

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,407 69 Updated Apr 11, 2024

Llama 2 Everywhere (L2E)

C 1,527 45 Updated Aug 27, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,350 1,112 Updated Feb 23, 2026

LLM&VLM Tutorial

Python 1,934 1,512 Updated May 5, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,157 507 Updated Oct 30, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 28,303 2,824 Updated Feb 10, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,658 4,524 Updated Feb 23, 2026

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,303 142 Updated Jul 15, 2025

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 629 43 Updated Dec 30, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,318 701 Updated Nov 24, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,653 546 Updated Feb 11, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,326 1,011 Updated Jul 1, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,395 560 Updated Oct 19, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,310 6,142 Updated Nov 10, 2025

LLM inference in C/C++

C++ 95,651 15,034 Updated Feb 23, 2026

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 64,370 8,094 Updated Jan 21, 2026

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,734 598 Updated Feb 19, 2026

Mamba SSM architecture

Python 17,222 1,596 Updated Feb 18, 2026

Home of StarCoder2!

Python 2,039 193 Updated Mar 21, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,676 166 Updated Oct 28, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,840 487 Updated Nov 27, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 57,118 7,609 Updated Nov 13, 2024

Grok open release

Python 51,506 8,487 Updated Aug 30, 2024

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 36,606 5,923 Updated Feb 23, 2026