-
Kuaishou Technology (Kwai Inc.)
- Beijing, China
- https://hymeric.github.io
Stars
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A high-throughput and memory-efficient inference and serving engine for LLMs
verl: Volcano Engine Reinforcement Learning for LLMs
Witness the aha moment of VLM with less than $3.
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
Fast inference engine for Transformer models
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Official inference library for Mistral models
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用
Simple tutorials on Pytorch DDP training
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Odyssey: Empowering Minecraft Agents with Open-World Skills
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Chinese version of GPT2 training code, using BERT tokenizer.
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…