wxsms

🥕

wxsm wxsms

🥕

165 followers · 24 following

KingSoft Office
Zhuhai, China
16:26 - 8h ahead
https://wxsm.space

Achievements

x4 x3 x3

Achievements

x4 x3 x3

Organizations

Starred repositories

poloclub / transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,051 391 Updated Feb 23, 2025

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,256 66 Updated Dec 3, 2024

spf13 / viper

Go configuration with fangs

Go 27,978 2,032 Updated Feb 18, 2025

derailed / k9s

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 28,835 1,813 Updated Mar 7, 2025

s3fs-fuse / s3fs-fuse

FUSE-based file system backed by Amazon S3

C++ 8,961 1,035 Updated Feb 28, 2025

NVIDIA / DCGM

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 471 61 Updated Feb 20, 2025

aio-libs / aiohttp

Asynchronous HTTP client/server framework for asyncio and Python

Python 15,477 2,064 Updated Mar 7, 2025

KuntaiDu / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 12 4 Updated Mar 8, 2025

NVIDIA / gds-nvidia-fs

NVIDIA GPUDirect Storage Driver

C 228 34 Updated Dec 11, 2024

pytorch / TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,697 361 Updated Mar 8, 2025

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,947 3,742 Updated Aug 6, 2024

NVIDIA-AI-IOT / torch2trt

An easy to use PyTorch to TensorRT converter

Python 4,688 683 Updated Aug 17, 2024

novnc / noVNC

VNC client web application

JavaScript 12,060 2,368 Updated Feb 28, 2025

l7mp / stunner

A Kubernetes media gateway for WebRTC. Contact: info@l7mp.io

Go 812 69 Updated Feb 20, 2025

selkies-project / selkies-gstreamer

Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop Streaming Platform for Self-Hosting, Containers, Kubernetes, or Cloud/HPC

CSS 436 54 Updated Mar 8, 2025

virginiakm1988 / ML2022-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2022 Spring

Jupyter Notebook 2,272 516 Updated Oct 18, 2022

mit-han-lab / deepcompressor

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 364 27 Updated Feb 21, 2025

mit-han-lab / nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda 880 61 Updated Mar 9, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,637 1,136 Updated Mar 7, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,447 130 Updated Mar 9, 2025

siliconflow / onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,834 123 Updated Jan 13, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,293 2,165 Updated Mar 7, 2025