-
-
Notifications
You must be signed in to change notification settings - Fork 20
Closed
Labels
MVPKiller feature sprintKiller feature sprintenhancementNew feature or requestNew feature or request
Description
Problem
Every query hits LLM API, costing money and requiring internet. GPTCache enables 70% cost reduction.
Solution
- Cache responses with semantic similarity matching
cortex cache stats- show hit rate, savingscortex --offlineflag for cached-only mode- SQLite backend, LRU eviction
Bounty: $65 (+ $65 bonus after funding)
Paid on merge to main.
Skills: Python, GPTCache, SQLite, Embeddings
Metadata
Metadata
Assignees
Labels
MVPKiller feature sprintKiller feature sprintenhancementNew feature or requestNew feature or request