Cache surplus practice blocks for instant delivery by sfw · Pull Request #72 · sfw/dibble

sfw · 2026-03-20T21:48:09Z

Summary

When the LLM generates 2-3 practice_problem blocks, only the first is delivered — extras are cached as separate GeneratedContent entries via SurplusPracticeCache
On the next continue request, cached surplus blocks are served instantly before falling back to LLM generation, eliminating wait time
Surplus entries use is_predictive_warm: true to piggyback on existing mastery-change invalidation — no changes to invalidation infrastructure needed

Test plan

12 new unit tests covering split logic, cache/pop ordering, session isolation, invalidation flags, and TTL
Backend tests: 733 passed
Frontend tests: 289 passed
Backend lint clean
Frontend lint clean
Frontend production build passes
Manual: verify only 1 practice question appears per page
Manual: verify next question loads instantly from surplus cache

🤖 Generated with Claude Code

When the LLM generates multiple practice_problem blocks, only the first is delivered to the learner. Extra blocks are cached and served instantly on the next continue request, eliminating wait time while the next eager generation runs in the background. Surplus entries piggyback on the existing predictive warming invalidation via is_predictive_warm flag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sfw merged commit b04a76c into main Mar 20, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache surplus practice blocks for instant delivery#72

Cache surplus practice blocks for instant delivery#72
sfw merged 1 commit intomainfrom
sfw/surplus-practice-cache

sfw commented Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sfw commented Mar 20, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant