Skip to content
View yysirs's full-sized avatar

Organizations

@cubenlp

Block or report yysirs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Minimal hackable GRPO implementation

Python 166 21 Updated Jan 31, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,373 531 Updated Mar 7, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,202 62 Updated Mar 7, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 1,758 156 Updated Mar 7, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,658 193 Updated Mar 4, 2025

Fully open data curation for reasoning models

Python 1,460 124 Updated Feb 23, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,085 227 Updated Feb 19, 2025

An AI Hedge Fund Team

Python 13,481 2,417 Updated Mar 7, 2025

Fully open reproduction of DeepSeek-R1

Python 22,323 2,000 Updated Mar 7, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,789 2,400 Updated Mar 7, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,016 1,403 Updated Feb 1, 2025

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 559 59 Updated Feb 24, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,475 242 Updated Feb 20, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,090 623 Updated Feb 10, 2025

Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity

Jupyter Notebook 4,814 989 Updated Mar 6, 2025

📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.

Python 3,426 554 Updated May 27, 2024
Python 17 Updated Jul 7, 2023

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,081 521 Updated Mar 7, 2025

Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming modes. It is dual-language compatible with Python and C++, off…

C++ 156 13 Updated Feb 28, 2025

LLM101n: Let's build a Storyteller

32,260 1,744 Updated Aug 1, 2024

TrustRAG:The RAG Framework within Reliable input,Trusted output

Python 716 78 Updated Mar 7, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,027 1,024 Updated Jan 14, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,007 1,738 Updated Mar 2, 2025

Play ChatGPT and other LLM with Xiaomi AI Speaker

Python 6,486 901 Updated Oct 30, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,860 1,061 Updated Mar 2, 2025

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,302 430 Updated Jul 28, 2024

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…

JavaScript 23,402 4,895 Updated Feb 21, 2025

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python 260 21 Updated May 9, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,562 271 Updated Jan 16, 2024
Next
Showing results