zrbcool

Follow

🎯

Focusing

Robin Zhang zrbcool

🎯

Focusing

Follow

19 followers · 16 following

Beijing

Achievements

Achievements

Starred repositories

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,763 577 Updated Mar 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,398 2,127 Updated Mar 27, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,928 582 Updated Mar 27, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 809 50 Updated Mar 19, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,320 679 Updated Mar 27, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 6,862 676 Updated Mar 27, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,305 388 Updated Mar 27, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 375 22 Updated Mar 4, 2025

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 760 87 Updated Aug 21, 2024

wilicc / gpu-burn

Multi-GPU CUDA stress test

C++ 1,617 317 Updated Aug 20, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,585 565 Updated Mar 27, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,949 988 Updated Mar 17, 2025

ncabatoff / process-exporter

Prometheus exporter that mines /proc to report on selected processes

Go 1,853 287 Updated Jan 10, 2025

limbopro / Paolujichang

科学上网🕸️之跑路机场名单收集（2020-2025），欢迎投稿。Ad🔗🈲🙅❌

3,101 55 Updated Mar 2, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 959 142 Updated Mar 27, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,476 5,553 Updated Mar 27, 2025

friskit-china / beijing-unicom-iptv-tweaker

北京联通IPTV相关脚本

Python 21 6 Updated Jun 1, 2020

wuwentao / bj-telecom-iptv

北京电信IPTV播放列表 Beijing Telecom IPTV playlist bj-telecom-iptv.m3u

53 12 Updated Dec 31, 2021

luogen1996 / RepAdapter

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

Python 180 24 Updated Apr 18, 2024

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 393 34 Updated Feb 7, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,605 730 Updated Dec 17, 2024

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 3,603 886 Updated Mar 24, 2025

NVIDIA / PyProf

A GPU performance profiling tool for PyTorch models

Python 506 51 Updated Jul 13, 2021

wgwang / awesome-LLMs-In-China

中国大模型

6,015 511 Updated Nov 30, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,390 1,074 Updated Mar 27, 2025

facebookincubator / oomd

A userspace out-of-memory killer

C++ 1,863 148 Updated Mar 21, 2025

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,049 234 Updated Apr 14, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,900 4,053 Updated Jul 17, 2024

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 72,924 7,947 Updated Mar 19, 2025

bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,380 223 Updated Mar 20, 2024

Starred topics

awesome-list

Kubernetes