Skip to content
View kannon92's full-sized avatar
  • Red Hat
  • Cleveland, Ohio

Block or report kannon92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,229 6,214 Updated Mar 12, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.

Python 11,173 428 Updated Mar 12, 2025

Distributed ML Training and Fine-Tuning on Kubernetes

Python 1,707 745 Updated Mar 11, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,078 96 Updated Mar 12, 2025

Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator suppor…

Python 544 159 Updated Mar 12, 2025

A unified tool for collecting system logs and other debug information

Python 523 557 Updated Mar 10, 2025

Native Kubernetes integration for Dask

Python 316 151 Updated Feb 24, 2025

JobSet: a k8s native API for distributed ML training and HPC workloads

Python 194 65 Updated Mar 11, 2025

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

Python 106 32 Updated Mar 12, 2025

An airflow operator that executes a task in a kubernetes cluster, given a kubernetes yaml configuration or an image refrence.

Python 58 9 Updated Dec 2, 2023

Deploy a Flux MiniCluster to Kubernetes with the operator

Python 31 8 Updated Mar 4, 2025

A tool to detect infrastructure issues on cloud native AI systems

Python 26 16 Updated Feb 27, 2025

Create and manage your Notebooks on Kubernetes with ease.

Python 21 Updated Mar 10, 2023
13 stars written in Python