Skip to content

docs: free tier cost management guide#4

Open
OfficialAbhinavSingh wants to merge 2 commits into
mainfrom
feat/docs/free-tier-cost-management
Open

docs: free tier cost management guide#4
OfficialAbhinavSingh wants to merge 2 commits into
mainfrom
feat/docs/free-tier-cost-management

Conversation

@OfficialAbhinavSingh

Copy link
Copy Markdown
Collaborator

Adds production-level free tier cost management reference covering 5-layer cost defense stack, model pricing (Groq/Haiku/Sonnet/Opus),
Claude.ai-style UX pattern (4 states, upgrade prompts, at-capacity handling), token budget ledger, shared cross-org semantic cache, context compression, Batch API 50% discount, and full implementation code.
Target: under .50/org/month for free tier.

@OfficialAbhinavSingh OfficialAbhinavSingh force-pushed the feat/docs/free-tier-cost-management branch from 9134e59 to 002c09a Compare June 3, 2026 13:04
@Viscous106 Viscous106 added the documentation Improvements or additions to documentation label Jun 9, 2026
@Viscous106 Viscous106 self-assigned this Jun 9, 2026
@Viscous106

Copy link
Copy Markdown
Contributor

@OfficialAbhinavSingh , Kindly have a research over graphify tool it help in token reduction as well.

@@ -0,0 +1,177 @@
# Free Tier Cost Management — Mergit

> Target: **< $0.50/org/month** · Without optimization: ~$27/org/month · Required reduction: **98%**

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is this data based on ?

Comment on lines +7 to +19
## Model Pricing (verified June 2026)

| Model | Input $/M | Output $/M | Use for |
|-------|----------|-----------|---------|
| groq/llama-4-maverick | $0.20 | $0.60 | Free tier primary (Researcher/Writer) |
| groq/llama-3.1-8b | $0.05 | $0.08 | Summarization only |
| claude-haiku-4-5 | $0.80 | $4.00 | Free tier Coder role |
| claude-sonnet-4-6 | $3.00 | $15.00 | Startup tier |
| claude-opus-4-8 | $15.00 | $75.00 | Enterprise only |

**Anthropic prompt cache:** read = **0.1×** · write 5-min = 1.25× · write 1-hour = 2×
**Anthropic Batch API:** **50% off** all tokens (free tier always uses this)
**Groq:** no prompt caching support

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This planning is good for the free tier users but what would we do for the pro users that are paying for the service. As they will have access to change between models depending on their needs.


## Tier Limits

| | Free | Startup | Enterprise |

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not startup we are to name it pro plan

## 5-Layer Cost Stack

```
Layer 0 Anthropic prefix cache cache_control: ephemeral on system prompt

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

avoid usnig caching of other provider that way we wont be able to use the data that our customer is giving for our improvement purpose.

Layer 0 Anthropic prefix cache cache_control: ephemeral on system prompt
All free orgs share one prefix → 0.1× input cost after first write

Layer 1 Model routing by role Groq for Researcher/Writer, Haiku for Coder

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How the hell is haiku better in coding part ??

@Viscous106 Viscous106 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@OfficialAbhinavSingh There is lot of ai slop in this pr kindly refrain from using ai for all things try to filter out things that are not true by reviewing your own changes before pushing or opening a pr .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants