Stars
Browse starred repositories and topics
Sort: Recently starred
Sort options
Starred Repositories
-
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
-
ChatGPT の 日本語の Prompt のサンプル
-
MonotaRO社内で利用されているChatGPTのSlackbot
-
Inference code for Llama models
-
-
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
-
Next.js Commerce
-
Firebase Firestore Queue System
-
A React component for Instagram like stories
-