BingyangWu

Follow

BingyangWu

Follow

102 followers · 331 following

Peking University
Beijing, China

Achievements

Achievements

Highlights

Pro

Stars

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,965 586 Updated Mar 27, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,846 585 Updated Mar 29, 2025

LMCache / LMCache

Redis for LLMs

Python 661 73 Updated Mar 28, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,307 388 Updated Mar 27, 2025

LiuXiaoxuanPKU / OSD

Python 47 3 Updated Dec 3, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,487 2,759 Updated Mar 29, 2025

interestingLSY / swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Python 153 13 Updated Jul 5, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,951 517 Updated Mar 28, 2025

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 781 63 Updated Sep 4, 2024

jiazhihao / attention_superoptimizer

An Attention Superoptimizer

C++ 21 Updated Jan 20, 2025

CasperVector / pkuthss

LaTeX template for dissertations in Peking University

TeX 561 190 Updated Apr 25, 2024

zhiyunyao / pkuthss

Forked from CasperVector/pkuthss

LaTeX template for dissertations in Peking University

TeX 23 1 Updated Mar 18, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,607 5,572 Updated Mar 28, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,572 123 Updated Mar 23, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 375 22 Updated Mar 4, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,018 1,891 Updated Mar 29, 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,261 556 Updated Oct 19, 2024

pkusys / Jolteon

Automatic resource configuration for serverless workflows.

Python 20 2 Updated Mar 24, 2024

UbiquitousLearning / Efficient_Foundation_Model_Survey

Survey Paper List - Efficient LLM and Foundation Models

241 18 Updated Sep 22, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,618 1,388 Updated Mar 29, 2025

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,124 95 Updated Feb 27, 2025

oleg-codaio / branch-predictor

Branch Prediction Pin tool, implementing 2-bit saturating counter and perceptron branch predictors.

C++ 23 11 Updated Apr 1, 2016

ammubhave / 6.823-lab

6.823 Advanced Computer Architecture Lab

C++ 13 9 Updated Oct 17, 2016

gem5 / gem5

The official repository for the gem5 computer-system architecture simulator.

C++ 1,908 1,368 Updated Mar 28, 2025

synxlin / branch-predictor-simulator

A C version of Branch Predictor Simulator

C 17 6 Updated Jul 10, 2024

jingpu / pintools

C++ 69 41 Updated Apr 1, 2013

mlfoundations / open_lm

A repository for research on medium sized language models.

Python 493 69 Updated Jan 13, 2025

bojone / NBCE

Naive Bayes-based Context Extension

Python 322 22 Updated Dec 9, 2024

NVIDIA / ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript 2,934 394 Updated Aug 21, 2024

yule-BUAA / MergeLM

Codebase for Merging Language Models (ICML 2024)

Python 805 49 Updated May 5, 2024