You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Evaluate routing simple queries (single-chunk paraphrase, meta-FAQ lookup, word definitions) to Apple Foundation Models on supported iOS devices (iOS 18+, Apple Silicon). Formally deferred from v1 — this is a cost-optimization pass.
Trigger for this work
Cloud AI spend > $5K/month (Moderate Y3 territory) OR material Anthropic pricing change.
Do not start this until trigger is hit — premature optimization before usage data is real.
Scope
1. Feasibility audit
iOS version distribution in CS user base (need ≥40% on iOS 18+ for meaningful offload)
Apple Foundation Models API surface stability (verify at time of start)
Parent: #1446 (Epic: AI Study Partner)
Phase: 5 (Polish & Cost Optimization)
Evaluate routing simple queries (single-chunk paraphrase, meta-FAQ lookup, word definitions) to Apple Foundation Models on supported iOS devices (iOS 18+, Apple Silicon). Formally deferred from v1 — this is a cost-optimization pass.
Trigger for this work
Cloud AI spend > $5K/month (Moderate Y3 territory) OR material Anthropic pricing change.
Do not start this until trigger is hit — premature optimization before usage data is real.
Scope
1. Feasibility audit
2. Routing logic update
Client-side reasoning router gains a new branch:
Query classification:
3. Quality parity gate
Run shadow comparison for 2 weeks:
Pass criteria:
4. Gradual rollout
Passing shadow → ship to 10% of eligible iOS users → monitor → expand to 100% if quality metrics hold.
5. Success metrics
Non-goals
Acceptance criteria
Size: M
Labels: ai-partner, phase-5
Blocked by: #1468 (accuracy audit pipeline must exist first)