Skip to content
View king0980692's full-sized avatar
👓
Generated by DALLE-3 ..
👓
Generated by DALLE-3 ..
  • Taipei, Taiwan
  • 08:38 - 8h ahead

Highlights

  • Pro

Block or report king0980692

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 7,405 547 Updated Mar 22, 2025

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

Python 5,926 425 Updated Mar 22, 2025

A lightweight, powerful framework for multi-agent workflows

Python 6,998 767 Updated Mar 24, 2025

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,793 501 Updated Mar 7, 2025

A framework for few-shot evaluation of language models.

Python 8,352 2,234 Updated Mar 23, 2025

Truly independent web browser

C++ 36,280 1,518 Updated Mar 24, 2025

OpenUI let's you describe UI using your imagination, then see it rendered live.

TypeScript 20,148 1,890 Updated Oct 21, 2024

A guidance language for controlling large language models.

Jupyter Notebook 19,926 1,096 Updated Mar 19, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,057 165 Updated Mar 6, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 12,254 1,230 Updated Mar 21, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,143 73 Updated Mar 17, 2025

Send push notifications to your phone or desktop using PUT/POST

Go 21,934 871 Updated Sep 29, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 2,455 258 Updated Mar 22, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,329 695 Updated Mar 21, 2025

Lightweight yet powerful formatter plugin for Neovim

Lua 3,802 196 Updated Mar 20, 2025

Detect file content types with deep learning

Rust 8,490 439 Updated Mar 19, 2025

Watches files and records, or triggers actions, when they change.

C++ 12,988 1,018 Updated Mar 23, 2025

CLIP model deploy in plain C/C++ using ggml machine learning library

C++ 21 1 Updated Mar 17, 2025

Inference of Mamba models in pure C

C 186 10 Updated Feb 26, 2024

structured outputs for llms

Python 9,869 761 Updated Mar 21, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,360 807 Updated Mar 1, 2025

Optimizing inference proxy for LLMs

Python 2,112 165 Updated Mar 19, 2025

Event notification library

C 11,402 3,413 Updated Mar 1, 2025

Perf monitoring CLI tool for Apple Silicon

Python 3,934 164 Updated Apr 18, 2024

ggml implementation of BERT

C++ 486 66 Updated Feb 23, 2024

Up to 10x faster strings for C, C++, Python, Rust, Swift & Go, leveraging NEON, AVX2, AVX-512, SVE, & SWAR to accelerate search, hashing, sort, edit distances, and memory ops 🦖

C 2,466 87 Updated Mar 23, 2025

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …

C 1,291 74 Updated Feb 26, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,490 102 Updated Mar 23, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,162 4,222 Updated Mar 18, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,003 488 Updated Aug 6, 2024
Next
Showing results