
-
AntGroup
- Beijing
- https://www.antfin.com/
Starred repositories
Ongoing research training transformer models at scale
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Cost-efficient and pluggable Infrastructure components for GenAI inference
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
My learning notes/codes for ML SYS.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
verl: Volcano Engine Reinforcement Learning for LLMs
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
Model Context Protocol Servers
Simple, unified interface to multiple Generative AI providers
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
GPUd automates monitoring, diagnostics, and issue identification for GPUs
🐶 Command-line DNS Client for Humans. Written in Golang
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
📐 Convert Golang's http.Request to CURL command line
eBPF-based Networking, Security, and Observability
Simplify Kubernetes applications operation with one-stop observability services, including resource delivery SLO,root cause diagnoses and container lifecycle tracing and more.
ModelScope: bring the notion of Model-as-a-Service to life.