Skip to content

docs: LLM caching strategy guide#3

Open
OfficialAbhinavSingh wants to merge 3 commits into
mainfrom
feat/docs/llm-caching-guide
Open

docs: LLM caching strategy guide#3
OfficialAbhinavSingh wants to merge 3 commits into
mainfrom
feat/docs/llm-caching-guide

Conversation

@OfficialAbhinavSingh

Copy link
Copy Markdown
Collaborator

Adds comprehensive LLM caching reference covering prompt caching (Anthropic/OpenAI), semantic caching, KV cache (vLLM/SGLang), and cache-aware
RAG. Includes implementation code for all 4 layers, decision flowchart, verified pricing, and prerequisites. Relevant to our Groq→Anthropic→OpenAI routing stack and Redis caching layer.

@OfficialAbhinavSingh OfficialAbhinavSingh mentioned this pull request Jun 2, 2026
Merged
@OfficialAbhinavSingh OfficialAbhinavSingh force-pushed the feat/docs/llm-caching-guide branch from 40e8ce5 to ff9dc7a Compare June 3, 2026 08:30
@Viscous106

Copy link
Copy Markdown
Contributor

@OfficialAbhinavSingh , this is way too much for a planner we want just a planner not a research paper please be concise with what we have to implement and why we have to implement it.

@Viscous106 Viscous106 added the documentation Improvements or additions to documentation label Jun 9, 2026
@OfficialAbhinavSingh

Copy link
Copy Markdown
Collaborator Author

@OfficialAbhinavSingh , this is way too much for a planner we want just a planner not a research paper please be concise with what we have to implement and why we have to implement it.

@Viscous106 Revised the planner for a simpler review and implementation process

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants