Skip to content
View kannon92's full-sized avatar
  • Red Hat
  • Cleveland, Ohio

Block or report kannon92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kubernetes-native Job Queueing

Go 2 1 Updated Mar 4, 2025

KJob: Tool for CLI-loving ML researchers

Go 20 5 Updated Mar 4, 2025
JavaScript 3 Updated Jan 24, 2025

Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)

Go 464 225 Updated Jan 16, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,045 92 Updated Mar 4, 2025

A tool to detect infrastructure issues on cloud native AI systems

Python 24 15 Updated Feb 27, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.

Python 11,014 419 Updated Mar 4, 2025

Blazingly fast LLM inference.

Rust 5,128 362 Updated Mar 3, 2025

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

Python 105 32 Updated Mar 4, 2025

Distribute and run LLMs with a single file.

C++ 21,872 1,149 Updated Jan 30, 2025

Tensor library for machine learning

C++ 12,009 1,154 Updated Mar 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,202 6,021 Updated Mar 4, 2025

Heterogeneous AI Computing Virtualization Middleware

Go 1,336 266 Updated Mar 4, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,947 10,737 Updated Mar 4, 2025

LLM inference in C/C++

C++ 75,802 10,961 Updated Mar 4, 2025

Example DRA driver that developers can fork and modify to get them started writing their own.

Go 61 42 Updated Mar 4, 2025

batch-simulator is a Golang CLI tool that simulates the lifecycle of Kubernetes API resources, such as Nodes, Pods, etc. using KWOK

Go 4 Updated May 15, 2024

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 304 51 Updated Feb 24, 2025

Configuration data used to build OCP images

40 218 Updated Mar 4, 2025

A unified tool for collecting system logs and other debug information

Python 523 556 Updated Mar 4, 2025

This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).

Shell 5,249 324 Updated Aug 4, 2023

JobSet: a k8s native API for distributed ML training and HPC workloads

Go 192 63 Updated Mar 3, 2025

A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.

Go 502 139 Updated Mar 4, 2025

📕 Clarity in the current fast-paced mess of Open Source innovation

TeX 1,547 89 Updated Jan 20, 2025

A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.

Rust 1,297 95 Updated Feb 28, 2025

CLI and validation tools for Kubelet Container Runtime Interface (CRI) .

Go 1,752 460 Updated Mar 3, 2025

An OCI container runtime monitor.

C 431 128 Updated Mar 3, 2025

Core components in the OCM project. Report here if you found any issues in OCM.

Go 836 100 Updated Mar 4, 2025
Next
Showing results