Skip to content
View kannon92's full-sized avatar
  • Red Hat
  • Cleveland, Ohio

Block or report kannon92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kubernetes-native Job Queueing

Go 2 1 Updated Mar 1, 2025

KJob: Tool for CLI-loving ML researchers

Go 20 5 Updated Feb 26, 2025
JavaScript 3 Updated Jan 24, 2025

Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)

Go 461 225 Updated Jan 16, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,032 91 Updated Mar 1, 2025

A tool to detect infrastructure issues on cloud native AI systems

Python 23 15 Updated Feb 27, 2025

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 10,954 416 Updated Mar 1, 2025

Blazingly fast LLM inference.

Rust 5,111 361 Updated Mar 2, 2025

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

Python 105 31 Updated Mar 1, 2025

Distribute and run LLMs with a single file.

C++ 21,850 1,146 Updated Jan 30, 2025

Tensor library for machine learning

C++ 11,995 1,150 Updated Feb 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,864 5,972 Updated Mar 2, 2025

Heterogeneous AI Computing Virtualization Middleware

Go 1,325 263 Updated Feb 27, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,413 10,672 Updated Mar 2, 2025

LLM inference in C/C++

C++ 75,628 10,929 Updated Mar 1, 2025

Example DRA driver that developers can fork and modify to get them started writing their own.

Go 61 41 Updated Feb 28, 2025

batch-simulator is a Golang CLI tool that simulates the lifecycle of Kubernetes API resources, such as Nodes, Pods, etc. using KWOK

Go 4 Updated May 15, 2024

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 296 51 Updated Feb 24, 2025

Configuration data used to build OCP images

40 218 Updated Feb 28, 2025

A unified tool for collecting system logs and other debug information

Python 522 556 Updated Feb 28, 2025

This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).

Shell 5,248 323 Updated Aug 4, 2023

JobSet: a k8s native API for distributed ML training and HPC workloads

Go 192 61 Updated Mar 1, 2025

A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.

Go 502 139 Updated Feb 28, 2025

📕 Clarity in the current fast-paced mess of Open Source innovation

TeX 1,546 89 Updated Jan 20, 2025

A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.

Rust 1,294 96 Updated Feb 28, 2025

CLI and validation tools for Kubelet Container Runtime Interface (CRI) .

Go 1,751 460 Updated Feb 28, 2025

An OCI container runtime monitor.

C 431 128 Updated Feb 25, 2025

Core components in the OCM project. Report here if you found any issues in OCM.

Go 834 100 Updated Feb 28, 2025
Next
Showing results