Skip to content
View bennylii's full-sized avatar

Block or report bennylii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Boosting 4-bit inference kernels with 2:4 Sparsity

Cuda 71 5 Updated Sep 4, 2024

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 771 62 Updated Sep 4, 2024

CUDA on non-NVIDIA GPUs

Rust 11,005 705 Updated Mar 17, 2025

A general-purpose CV-based framework for extracting precise subtitle timelines from videos with embedded subtitles, from video to .ass file.

Python 54 2 Updated Mar 16, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1

TypeScript 20,387 1,709 Updated Mar 23, 2025

LLM inference in C/C++

C++ 77,034 11,170 Updated Mar 23, 2025

An Xposed module to intercept applist detections

Kotlin 3,636 310 Updated Mar 9, 2025

The Magic Mask for Android

C++ 51,615 13,467 Updated Mar 22, 2025

A Magic Mask to Alter Android System Systemless-ly

C++ 664 65 Updated Mar 19, 2025

Automatic screenshots for Magisk on Android

Shell 4 Updated Dec 13, 2024

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

210 9 Updated Dec 31, 2024

Open-source calculator for LLM system requirements.

Python 142 23 Updated Dec 18, 2024

OpenAI 接口管理 & 分发系统,改自songquanpeng/one-api。支持更多模型,加入统计页面,完善非openai模型的函数调用。

Go 1,863 350 Updated Mar 20, 2025

一个可扩展的通用型小说下载器。

TypeScript 1,133 104 Updated Mar 14, 2025

Epson ESC/P-R Driver (generic driver)

C 12 3 Updated Nov 1, 2024

宇树科技 Yushu Technology (Unitree) go1 development notes

TeX 342 77 Updated Dec 2, 2023

A userscript for downloading artworks from Pixiv and other websites.

TypeScript 80 1 Updated Mar 22, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,050 883 Updated Mar 22, 2025
Python 112 13 Updated Dec 28, 2024

Adds 3D acceleration support for P106-090 / P106-100 / P104-100 / P104-101 / P102-100 / CMP 30HX / CMP 40HX / CMP 50HX / CMP 70HX / CMP 90HX / CMP 170HX mining cards as well as RTX 3060 3840SP, RTX…

PowerShell 578 49 Updated Mar 13, 2025

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 49,006 3,602 Updated Aug 18, 2024

An app that brings language models directly to your phone.

TypeScript 2,888 277 Updated Mar 14, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,580 4,311 Updated Mar 23, 2025
Python 350 110 Updated Oct 19, 2023

NVIDIA Linux open GPU with P2P support

C 1,057 105 Updated Dec 18, 2024

OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。

C++ 1,108 142 Updated Oct 15, 2024

AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具

Shell 238 28 Updated Nov 16, 2024

Yuan 2.0 Large Language Model

Python 685 87 Updated Jul 11, 2024

🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …

TypeScript 8,098 1,091 Updated Mar 19, 2025

Open source eGPU dock for ROG Ally and ROG Flow

C 460 42 Updated Feb 2, 2025
Next
Showing results