Skip to content
View hustshawn's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@cloudnativeto

Block or report hustshawn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VS Code in the browser

TypeScript 70,215 5,806 Updated Mar 12, 2025

Incomplete list of macOS `defaults` commands with demos ✨

HTML 1,209 64 Updated Mar 13, 2025

Serverless DeepSeek R1 Inference with FastAPI and Lambda SnapStart

Python 2 Updated Mar 11, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,231 1,757 Updated Mar 2, 2025

A network filesystem client to connect to SSH servers

C 6,539 506 Updated Mar 11, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,132 287 Updated Mar 12, 2025

Public repo for DeepLearning.AI MLEP Specialization

Jupyter Notebook 1,878 2,367 Updated Oct 28, 2024

A Java wrapper to run Spring, Spring Boot, Jersey, and other apps inside AWS Lambda.

Java 1,512 563 Updated Mar 10, 2025

Windows for ARM in a Docker container.

Shell 1,440 134 Updated Mar 13, 2025

This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.

C 167 60 Updated Mar 13, 2025

The open-source AIOps and alert management platform

Python 9,676 891 Updated Mar 13, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,940 492 Updated Mar 11, 2025

Analyze computation-communication overlap in V3/R1.

923 118 Updated Mar 3, 2025

🚀 PR-Agent (Qodo Merge open-source): An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Python 7,167 740 Updated Mar 12, 2025

Run any AWS Lambda function as a Large Language Model (LLM) tool without code changes using Anthropic's Model Control Protocol (MCP).

Python 40 7 Updated Mar 10, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,370 247 Updated Mar 13, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,281 792 Updated Mar 1, 2025

This is a concept showing how you can use zonal autoshift's integration with EventBridge to automatically remove zones from a Karpenter node pool when an autoshift is underway

Go 1 1 Updated Feb 28, 2025

Advanced Quantization Algorithm for LLMs/VLMs.

Python 389 30 Updated Mar 13, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,397 1,442 Updated Feb 25, 2025

AWS Workshop for Learning EKS for Greater China

Makefile 145 97 Updated Dec 4, 2023

Example AWS Resource control policies to get started or mature your usage of AWS RCPs.

171 20 Updated Feb 21, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 760 99 Updated Mar 13, 2025

Common Expression Language -- specification and binary representation

Starlark 3,074 240 Updated Mar 7, 2025

AWS plugins for Backstage

TypeScript 88 17 Updated Mar 13, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 329 53 Updated Mar 13, 2025

Amazon EMR on EKS Custom Image CLI

Python 28 10 Updated Sep 26, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 69,022 8,491 Updated Feb 25, 2025
Next
Showing results