Skip to content
View EnanaAwa's full-sized avatar
🤡
🤡

Block or report EnanaAwa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Mar 29, 2024

A minimum demo for PyTorch distributed extension functionality for collectives.

C++ 11 2 Updated Jul 29, 2024

Reproduction of DeepSeek-R1

Python 174 17 Updated Mar 24, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 208 20 Updated Mar 26, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,357 229 Updated Mar 28, 2025
Python 1 1 Updated Aug 11, 2024

⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue

JavaScript 8,001 546 Updated Mar 19, 2025
C++ 25 2 Updated Feb 22, 2025

A lightweight, powerful framework for multi-agent workflows

Python 7,602 873 Updated Mar 28, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,048 70 Updated Mar 28, 2025

A Rust RDMA library.

Rust 10 3 Updated Mar 18, 2025

Redis for LLMs

Python 659 73 Updated Mar 28, 2025

[TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Arash Nasr-Esfahany, Kevin Zhao, Prateesh Goyal, Mohammad Alizadeh, Thomas Anderson.

C++ 6 Updated Mar 9, 2025

InferX is a Inference Function as a Service Platform

Rust 3 Updated Mar 11, 2025

PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.

Python 18 2 Updated Mar 20, 2025
C++ 2 Updated Dec 8, 2024

Postgres-Native Data Warehouse

C++ 1,208 32 Updated Mar 28, 2025

Knowledge management for the impatient

Rust 23 3 Updated Mar 12, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 924 121 Updated Mar 28, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 1,454 103 Updated Mar 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,761 114 Updated Mar 27, 2025

REPETITA: Repeatable Experiments for Performance Evaluation of Traffic-Engineering Algorithms

Scala 32 17 Updated Sep 12, 2023

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,434 388 Updated Mar 5, 2025

Main source code repository of the Tamarin prover for security protocol verification.

Haskell 452 137 Updated Mar 26, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,389 814 Updated Mar 27, 2025
2 Updated Feb 10, 2025

Analyze computation-communication overlap in V3/R1.

970 130 Updated Mar 21, 2025

Expert Parallelism Load Balancer

Python 1,108 177 Updated Mar 24, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,675 281 Updated Mar 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,107 536 Updated Mar 28, 2025
Next
Showing results