Skip to content
View HymEric's full-sized avatar

Block or report HymEric

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,776 1,836 Updated Mar 2, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,607 365 Updated Mar 26, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,859 6,501 Updated Mar 27, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,756 571 Updated Mar 27, 2025

Witness the aha moment of VLM with less than $3.

Python 3,409 265 Updated Mar 1, 2025

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,890 1,394 Updated Jul 31, 2023

Fast inference engine for Transformer models

C++ 3,706 341 Updated Mar 20, 2025

FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion

Python 118 12 Updated Mar 18, 2025
Python 185 Updated Sep 11, 2024

Official inference library for Mistral models

Jupyter Notebook 10,135 908 Updated Mar 20, 2025

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 400 52 Updated Mar 23, 2025

Thsis-vocab_32k_gpt2_moe

Python 4 1 Updated Jul 5, 2024

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 684 73 Updated Oct 30, 2024

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Python 94 15 Updated Mar 16, 2024

Simple tutorials on Pytorch DDP training

Python 275 49 Updated Aug 19, 2022

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Python 2,591 275 Updated Feb 12, 2025

Odyssey: Empowering Minecraft Agents with Open-World Skills

Python 301 19 Updated Mar 6, 2025

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

946 81 Updated Oct 17, 2022

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,548 1,705 Updated Apr 25, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,096 765 Updated Oct 16, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,346 519 Updated May 3, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,395 1,146 Updated Mar 14, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,994 2,411 Updated Aug 12, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,468 5,552 Updated Mar 27, 2025

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,510 1,299 Updated Sep 5, 2024

Inference code for Llama models

Python 57,946 9,720 Updated Jan 26, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,543 2,394 Updated Mar 26, 2025

汇总各大互联网公司容易考察的高频leetcode题🔥

19,079 2,718 Updated Mar 13, 2024

该仓库主要记录 NLP 算法工程师相关的面试题

2,523 513 Updated Oct 10, 2023

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…

14,477 1,329 Updated Dec 21, 2024
Next
Showing results