Skip to content
View zrbcool's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zrbcool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,763 577 Updated Mar 27, 2025

Fully open reproduction of DeepSeek-R1

Python 23,398 2,127 Updated Mar 27, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,928 582 Updated Mar 27, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 809 50 Updated Mar 19, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,320 679 Updated Mar 27, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 6,862 676 Updated Mar 27, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,305 388 Updated Mar 27, 2025

Zero Bubble Pipeline Parallelism

Python 375 22 Updated Mar 4, 2025

Pipeline Parallelism for PyTorch

Python 760 87 Updated Aug 21, 2024

Multi-GPU CUDA stress test

C++ 1,617 317 Updated Aug 20, 2024

Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,585 565 Updated Mar 27, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,949 988 Updated Mar 17, 2025

Prometheus exporter that mines /proc to report on selected processes

Go 1,853 287 Updated Jan 10, 2025

科学上网🕸️之跑路机场名单收集(2020-2025),欢迎投稿。Ad🔗🈲🙅❌

3,101 55 Updated Mar 2, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 959 142 Updated Mar 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,476 5,553 Updated Mar 27, 2025

北京联通IPTV相关脚本

Python 21 6 Updated Jun 1, 2020

北京电信IPTV播放列表 Beijing Telecom IPTV playlist bj-telecom-iptv.m3u

53 12 Updated Dec 31, 2021

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

Python 180 24 Updated Apr 18, 2024

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 393 34 Updated Feb 7, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,605 730 Updated Dec 17, 2024

Optimized primitives for collective multi-GPU communication

C++ 3,603 886 Updated Mar 24, 2025

A GPU performance profiling tool for PyTorch models

Python 506 51 Updated Jul 13, 2021

中国大模型

6,015 511 Updated Nov 30, 2024

Example models using DeepSpeed

Python 6,390 1,074 Updated Mar 27, 2025

A userspace out-of-memory killer

C++ 1,863 148 Updated Mar 21, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,049 234 Updated Apr 14, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,900 4,053 Updated Jul 17, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 72,924 7,947 Updated Mar 19, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,380 223 Updated Mar 20, 2024
Next
Showing results