yysirs

桐原因 yysirs

7 followers · 7 following

China

Achievements

Organizations

Starred repositories

open-thought / tiny-grpo

Minimal hackable GRPO implementation

Python 166 21 Updated Jan 31, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,373 531 Updated Mar 7, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,202 62 Updated Mar 7, 2025

microsoft / RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 1,758 156 Updated Mar 7, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,658 193 Updated Mar 4, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 1,460 124 Updated Feb 23, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,085 227 Updated Feb 19, 2025

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 13,481 2,417 Updated Mar 7, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,323 2,000 Updated Mar 7, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,789 2,400 Updated Mar 7, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,016 1,403 Updated Feb 1, 2025

deepseek-ai / DeepSeek-R1

85,385 11,017 Updated Feb 24, 2025

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 559 59 Updated Feb 24, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,475 242 Updated Feb 20, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,090 623 Updated Feb 10, 2025

robertmartin8 / PyPortfolioOpt

Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity

Jupyter Notebook 4,814 989 Updated Mar 6, 2025

Charmve / Surface-Defect-Detection

📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.

Python 3,426 554 Updated May 27, 2024

cubenlp / 2023CCL_CEFE

Python 17 Updated Jul 7, 2023

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,081 521 Updated Mar 7, 2025

NetEase-Media / grps

Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming modes. It is dual-language compatible with Python and C++, off…

C++ 156 13 Updated Feb 28, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,260 1,744 Updated Aug 1, 2024

gomate-community / TrustRAG

TrustRAG：The RAG Framework within Reliable input,Trusted output

Python 716 78 Updated Mar 7, 2025

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,027 1,024 Updated Jan 14, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 15,007 1,738 Updated Mar 2, 2025

yihong0618 / xiaogpt

Play ChatGPT and other LLM with Xiaomi AI Speaker

Python 6,486 901 Updated Oct 30, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,860 1,061 Updated Mar 2, 2025

WooooDyy / LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,302 430 Updated Jul 28, 2024

songquanpeng / one-api

LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key 管理与二次分发。单可执行文件，提供 Docker 镜像，一键部署，开箱即用。LLM API management & k…

JavaScript 23,402 4,895 Updated Feb 21, 2025

WangRongsheng / Aurora

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python 260 21 Updated May 9, 2024

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,562 271 Updated Jan 16, 2024

桐原因 yysirs

Organizations

Starred repositories

document-ai