Skip to content
View inisis's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report inisis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source high-performance RISC-V processor

Scala 6,157 744 Updated Mar 7, 2025

A Toolkit to Help Optimize Onnx Model

Python 121 13 Updated Feb 24, 2025

mnn tts demo.

C++ 9 Updated Feb 25, 2025

mnn asr demo.

C++ 13 Updated Dec 31, 2024

nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and …

C++ 706 104 Updated Mar 6, 2025

llm deploy project based onnx.

C++ 31 7 Updated Oct 9, 2024

用于学习GOT/Qwen/OnnxLLm

Python 47 2 Updated Oct 8, 2024

caffe model to onnx

Python 33 12 Updated Nov 16, 2022

real time face swap and one-click video deepfake with only a single image

Python 44,488 6,551 Updated Mar 6, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,445 1,044 Updated Sep 26, 2024

Verifile

Rust 1 Updated Aug 10, 2024

Large Language Model Onnx Inference Framework

Python 31 1 Updated Jan 12, 2025

TypeScript 43 9 Updated Feb 13, 2025

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

JavaScript 13,148 870 Updated Mar 7, 2025

FlagGems is an operator library for large language models implemented in Triton Language.

Python 441 73 Updated Mar 7, 2025

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 105 15 Updated May 16, 2024

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 9,970 1,772 Updated Mar 7, 2025

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 237 17 Updated Oct 28, 2024

Detect CPU features with single-file

C 357 40 Updated Mar 7, 2025

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,180 1,501 Updated Aug 9, 2024

File consistency proofreader

Vue 4 Updated Aug 19, 2024

llm deploy project based mnn. This project has merged into MNN.

C++ 1,559 173 Updated Jan 20, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,663 6,121 Updated Mar 7, 2025
Shell 4 Updated Feb 7, 2025

llm-export can export llm model to onnx.

Python 270 30 Updated Jan 17, 2025

PyTorch implementation of AlphaZero Chess from scratch

Python 145 28 Updated Aug 7, 2024

A Toolkit to Help Optimize Large Onnx Model

Python 153 9 Updated May 16, 2024

Everything in Torch Fx

Python 342 65 Updated Jun 7, 2024
Next
Showing results