Fix fair-use enforcement for free-tier users with exhausted credits by beastoin · Pull Request #6084 · BasedHardware/omi

beastoin · 2026-03-27T14:42:02Z

Summary

Fixes #6083: Free-tier users who exhaust their 1200 min/month credits now follow the same graduated enforcement pipeline as abuse-detected users.

Architecture: In trigger_classifier_if_needed, free-exhausted users get a synthetic score of 1.0 (instead of the LLM classifier) and proceed through the normal none → warning → throttle → restrict escalation path. No separate enforcement path — everything flows through the existing escalate_enforcement function.

Changes:

utils/fair_use.py: Add is_free_credits_exhausted() check → synthetic score 1.0 for free-exhausted users, skipping LLM classifier. Uses deferred import pattern for classify_user_purpose (ChatOpenAI constructed at import time) and send_notification (Firebase init required).
routers/sync.py: Check DG budget before processing, record usage after. Return 429 when budget exhausted.
routers/transcribe.py: Start DG tracking proactively when soft caps trigger (before classifier result returns).

Deployment Steps

Deploy backend to production:

gh workflow run gcp_backend.yml -f environment=prod -f branch=main --repo BasedHardware/omi

Verify deployment:
- Check GKE pod rollout: kubectl rollout status deployment/backend -n prod
- Verify logs show no import errors: kubectl logs -l app=backend -n prod --tail=50
Enable fair-use for free-tier (if not already enabled):
- Env var FAIR_USE_ENABLED=true must be set on the backend deployment
- Env var FAIR_USE_RESTRICT_DAILY_DG_MS=1800000 (30 min daily DG budget for restrict-stage users)
Post-deploy verification:
- Monitor logs for fair_use: free-exhausted entries confirming synthetic score path is active
- Monitor for 429 responses from sync endpoint for restrict-stage users
- Check no errors in fair_use: error checking free credits log lines
Rollback (if needed):
- Set FAIR_USE_ENABLED=false to disable all fair-use enforcement
- Or set FAIR_USE_KILL_SWITCH=true as emergency kill switch

Test plan

147 fair-use unit tests pass
70 integration tests pass (real Redis)
L1/L2 live tests complete

Closes #6083

🤖 Generated with Claude Code

by AI for @beastoin

Free users with exhausted credits now bypass the LLM classifier score requirement and escalate on violation count alone. Adds is_free_credits_exhausted() helper and synthetic classifier path that skips the expensive LLM call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

When a free-tier user has exhausted monthly credits, skip Deepgram transcription entirely in the sync endpoint. Audio stays on device and can be re-synced after upgrade. Saves cloud STT cost on locked conversations that would have been inaccessible anyway. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add free_credits_dg_blocked flag to the live transcription WebSocket that gates all cloud STT sends (Deepgram, Soniox, Speechmatics, multi-channel) when a free-tier user's monthly credits are exhausted. Re-checked periodically so mid-session upgrades resume cloud STT. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

14 tests covering: escalation bypass for free exhausted users, paid user score gate preservation, is_free_credits_exhausted helper, LLM classifier skip, shorter cooldown, and synthetic event recording. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

greptile-apps · 2026-03-27T14:48:25Z

Greptile Summary

This PR fixes a gap in the fair-use enforcement system where free-tier users who exhaust their 1200 min/month credits could never be escalated because the LLM classifier scores personal conversations at 0.10\u20130.35, far below the 0.7 threshold. The fix introduces a is_free_credits_exhausted bypass that lets exhausted free users escalate on violation count alone, skips the expensive LLM call in favour of a synthetic result, and blocks cloud STT sends (Deepgram, Soniox, Speechmatics, multi-channel) at both the WebSocket and sync endpoints.\n\n- fair_use.py: New is_free_credits_exhausted() helper; passes_score_gate short-circuit in escalate_enforcement; trigger_classifier_if_needed skips LLM and uses a synthetic result for free-exhausted users with a 1h cooldown.\n- transcribe.py: free_credits_dg_blocked flag checked at session start and re-evaluated every FAIR_USE_CHECK_INTERVAL_SECONDS; gates all four STT send sites correctly.\n- sync.py: Early return with credits_exhausted: true when should_lock=True, avoiding cloud STT cost entirely.\n- P1: is_free_credits_exhausted(uid) is called unconditionally before the Redis deduplication lock is checked in trigger_classifier_if_needed, introducing ~11 extra Firestore reads per hour per active free-exhausted user and defeating the lock\u2019s cost-saving purpose.\n- P2: Escalation notifications for free_exhausted users reuse misuse-language templates, which is confusing for legitimate users who simply hit their monthly quota.

Confidence Score: 4/5

Safe to merge after fixing the lock-ordering issue; the P1 is a cost/efficiency regression, not a data-loss or security risk.

The feature logic is correct and well-tested (14 new unit tests, 121 existing passing). The P1 issue defeats the Redis deduplication optimisation and introduces unbounded extra Firestore reads per active session, which will increase costs at scale and warrants a fix before merging. The P2 notification messaging issue is not blocking but should be tracked.

backend/utils/fair_use.py — specifically the ordering of the is_free_credits_exhausted call relative to the Redis lock acquisition in trigger_classifier_if_needed.

Important Files Changed

Filename	Overview
backend/utils/fair_use.py	Adds `is_free_credits_exhausted` helper and `passes_score_gate` bypass for free-tier users; `trigger_classifier_if_needed` now calls `is_free_credits_exhausted` before the Redis deduplication lock, introducing unnecessary Firestore reads on every invocation when the lock is already held.
backend/routers/sync.py	Skips Deepgram transcription and returns early when `should_lock=True`; fair-use speech recording and classifier triggering still runs before the early return, which may escalate credit-exhausted users through the enforcement pipeline unintentionally.
backend/routers/transcribe.py	Adds `free_credits_dg_blocked` gate that blocks all cloud STT providers for credit-exhausted free users; checked at session start and re-evaluated periodically; correctly guarded by `FAIR_USE_ENABLED` and `not use_custom_stt`.
backend/tests/unit/test_fair_use_free_tier.py	14 new unit tests covering escalation bypass, score-gate preservation for paid users, `is_free_credits_exhausted` cases, LLM skip, shorter cooldown, and synthetic event recording; thorough coverage of new code paths.
backend/test.sh	Adds new test file to the CI test runner script.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Cap triggered] --> B{FAIR_USE_ENABLED?}
    B -- No --> Z[No-op]
    B -- Yes --> C[trigger_classifier_if_needed]
    C --> D[is_free_credits_exhausted ⚠️ Firestore read BEFORE lock]
    D --> E{Redis lock acquired?}
    E -- No --> F[Return — cooldown active]
    E -- Yes --> G{free_exhausted?}
    G -- Yes --> H[Synthetic result\nusage_type=free_exhausted\ncooldown=1h]
    G -- No --> I[LLM classifier\ncooldown=12h]
    H --> J[escalate_enforcement]
    I --> J
    J --> K{passes_score_gate?}
    K -- free_exhausted=True --> L[Bypass score — escalate on violation count]
    K -- misuse_score >= 0.7 --> L
    K -- Neither --> M[action=none]
    L --> N{stage transition}
    N --> O[Update Firestore + notify user]
    style D fill:#ffcccc,stroke:#cc0000

Comments Outside Diff (2)

backend/utils/fair_use.py, line 607-641 (link)

Misleading abuse-language notifications sent to legitimate credit-exhausted users

When action == 'warning' is triggered via the free_exhausted path, _send_fair_use_notification sends:

"Your speech usage is unusually high. Omi is designed for personal conversations. If this continues, transcription quality may be reduced."

This messaging was designed for content-abuse (audiobooks, podcasts) and is factually wrong for a legitimate user who has simply consumed their free 1200 min/month quota. Since the classifier result now carries usage_type='free_exhausted', the notification helper can branch on it to send a credits/upgrade message instead. Without this, users may contact support confused about why they’re being told their usage is “unusually high” when they were just having normal conversations.
backend/routers/sync.py, line 817-845 (link)

Speech tracked and classifier triggered even on the credits_exhausted early-return path

record_speech_ms, check_soft_caps, and trigger_classifier_if_needed all run before the if should_lock: early return. This means a free-exhausted user who syncs audio will have the classifier fired as an async task and then receive a credits_exhausted: true response with no transcription.

The consequence is that these users can be marched through the enforcement pipeline (warning → throttle → restrict) even though their audio is already blocked by the simpler credit-exhaustion gate. Reaching restrict (30-day block) would survive a paid-plan upgrade because the stage is stored in Firestore.

If this is intentional, a comment explaining it would help. If not, moving the fair-use block inside a not should_lock guard would prevent unintended escalation:
```
if FAIR_USE_ENABLED and total_speech_ms > 0 and not should_lock:
    record_speech_ms(uid, total_speech_ms, source='sync')
    ...
```

_{Reviews (1): Last reviewed commit: "chore: add test_fair_use_free_tier.py to..." | Re-trigger Greptile}

greptile-apps · 2026-03-27T14:48:29Z

@@ -513,7 +564,17 @@ async def trigger_classifier_if_needed(uid: str, triggered_caps: list, session_i
        return


is_free_credits_exhausted called before Redis lock — defeats deduplication

is_free_credits_exhausted(uid) (which triggers a Firestore read via users_db.get_user_valid_subscription plus a call to has_transcription_credits) is executed unconditionally before the Redis lock is ever checked. The entire point of the Redis lock was to rate-limit expensive operations — pay the DB cost at most once per cooldown window.

With this ordering, every call to trigger_classifier_if_needed (fired every FAIR_USE_CHECK_INTERVAL_SECONDS = 5 min when caps are triggered) performs the DB read regardless of whether the lock is held. For a free-exhausted user in an active session under the 1-hour cooldown, that’s ~11 wasted Firestore reads per hour instead of one — directly reversing the cost-saving intent.

The fix is to move the free_exhausted check to after the lock is acquired. If the lock is not acquired, exit immediately at no cost as before. Once the lock is held, call is_free_credits_exhausted to choose the synthetic vs. LLM path, and optionally use redis_client.expire(lock_key, 3600) to shorten the TTL for the free-exhausted case.

…orcement Reviewer feedback: free_credits_exhausted in classifier_result dict is privilege-bearing. Now derived from is_free_credits_exhausted(uid) call instead of trusting the dict payload. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Reviewer feedback: deepgram_profile_socket.send(), soniox_profile_socket.send(), and send_initial_file_path() were not gated by free_credits_dg_blocked. Free users with exhausted credits could still send audio during speech profile phase. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Reviewer feedback: tests now mock is_free_credits_exhausted() directly since escalate_enforcement no longer reads free_credits_exhausted from the classifier_result dict. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin · 2026-03-27T14:49:45Z

Review fixes applied

Addressed all 3 reviewer items:

Trust issue fixed: escalate_enforcement() now calls is_free_credits_exhausted(uid) directly instead of reading free_credits_exhausted from the classifier_result dict. The bypass is derived from trusted account state. (d12b963)
Profile-phase sends gated: Added free_credits_dg_blocked guard to deepgram_profile_socket.send(), soniox_profile_socket.send(), and send_initial_file_path(). All STT send paths are now covered. (b6b6748)
Tests updated: Escalation tests now mock is_free_credits_exhausted() directly. All 14 tests pass. (7827834)

by AI for @beastoin

…DB reads The Firestore subscription check was running before the cooldown lock, causing unnecessary reads on every 5-minute cap trigger even when the function would return immediately due to an active lock. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin · 2026-03-27T14:56:30Z

Review iteration 2 fix

Firestore read optimization — is_free_credits_exhausted(uid) was called before the Redis cooldown lock in trigger_classifier_if_needed(). This caused unnecessary Firestore subscription reads every 5 minutes (on each cap-trigger) even when the function would immediately return due to an active lock.

Fix: Moved the call after the lock is acquired. The lock is first set with the default 12h TTL, then if the user is free-exhausted, shortened to 1h via redis.expire(). Test updated to verify the expire() call instead of the set() TTL.

Commit: 6d913a4b7

by AI for @beastoin

The free_credits_dg_blocked recheck was gated on user_has_credits, which is refreshed later in the loop. This created a 1-tick delay (up to 5min) where audio would still be sent to DG after credits crossed zero. Now calls is_free_credits_exhausted() directly on every fair-use check interval, independent of the local credits cache timing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin · 2026-03-27T15:00:29Z

Review iteration 3 fix

Mid-session credit exhaustion timing gap — free_credits_dg_blocked was gated on user_has_credits, which refreshes later in the loop. This created a 1-tick delay (up to 5 minutes) where audio would still be sent to DG after credits crossed zero.

Fix: Removed the user_has_credits dependency. Now calls is_free_credits_exhausted() directly on every fair-use check interval, independent of local credits cache timing. Also added a log message for when credits are restored (DG unblocked).

Commit: b537fb073

by AI for @beastoin

…on tests Addresses tester feedback: - Add classifier spoof negative test (classifier dict cannot bypass score gate) - Add sync credits-exhausted response structure test - Add is_free_credits_exhausted/should_lock equivalence test - Strengthen non-free classifier test with proper assertions - Fix mid-session DG gate to not depend on user_has_credits timing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin · 2026-03-27T15:17:27Z

PR ready for merge

All checkpoints passed:

CP7: Codex reviewer approved (PR_APPROVED_LGTM, iteration 5)
CP8: Codex tester approved (TESTS_APPROVED) — 17 unit tests covering escalation pipeline, credit exhaustion detection, classifier bypass, spoof protection, sync early-return, and trigger cooldown

Awaiting explicit merge approval.

by AI for @beastoin

…throttle record_dg_usage_ms now accepts both restrict-stage and free-tier budgets. is_dg_budget_exhausted accepts limit_ms override for free-tier daily cap. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ribe.py Remove separate free_credits_dg_blocked flag. Free-exhausted users now use the same fair_use_dg_budget_exhausted mechanism with FAIR_USE_FREE_DAILY_DG_MS (30 min/day default), consistent with restrict-stage daily budget approach. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ted users Instead of skipping all Deepgram processing when credits exhausted, check daily DG budget (FAIR_USE_FREE_DAILY_DG_MS). Allows up to 30 min/day of cloud STT for free users, consistent with live transcription throttle. Records DG usage after segment processing for budget tracking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace TestSyncCreditsExhaustedEarlyReturn with TestSyncDgBudgetGate. Add TestDgBudgetFreeTier (limit_ms parameter, budget exhaustion checks). Add TestRecordDgUsageMsFreeTier (recording with only free budget configured). 23 tests total. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…exhausted_restrict (#6083) Free-exhausted users now go directly to restrict stage (skipping graduated escalation and LLM classifier). Auto-clears when credits return. Removes FAIR_USE_FREE_DAILY_DG_MS — reuses FAIR_USE_RESTRICT_DAILY_DG_MS. Simplifies is_dg_budget_exhausted (no limit_ms parameter). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace dual-path DG budget logic with ensure_free_exhausted_restrict(). Remove all FAIR_USE_FREE_DAILY_DG_MS and is_free_credits_exhausted refs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Use ensure_free_exhausted_restrict() instead of dual-path DG budget logic. Remove is_free_credits_exhausted and FAIR_USE_FREE_DAILY_DG_MS imports. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Tests cover: ensure_free_exhausted_restrict (set/auto-clear/noop), classifier skip for free-exhausted, escalation score gate always required, simplified DG budget (no limit_ms), record_dg_usage_ms guards. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

200 could confuse the mobile app into thinking the sync succeeded. 429 (Too Many Requests) correctly signals a rate/budget limit hit, and the app handles non-200/207 gracefully by stopping the batch. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…usted (#6083) - ensure_free_exhausted_restrict: respect FAIR_USE_KILL_SWITCH and FAIR_USE_EXEMPT_UIDS - is_hard_restricted: skip restrict_reason='free_exhausted' (DG budget only, not hard block) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…6083) - Only record DG usage when user is in restrict stage (not unrestricted users) - Derive credits_exhausted from real subscription state (should_lock) not hardcoded True Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…6083) Tests: kill-switch returns none, exempt UID returns none, free-exhausted restrict not hard-blocked, abuse-restrict still hard-blocked. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…#6083) Add fair_use_track_dg_usage flag set only when user is in restrict stage. Prevents unrestricted users from pre-consuming the daily DG budget before becoming restricted later the same UTC day. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Pre-charge the request's speech duration to the DG budget before processing segments. If the pre-charge busts the budget, return 429 immediately. Prevents arbitrarily long uploads from overshooting the daily cap. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…eeded (#6083) Check lock first (cheap Redis op), then do Firestore read for free-exhausted. Prevents expensive Firestore/subscription reads when lock rejects the run. Release lock after early return for free-exhausted path. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…6083) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…cycle (#6083) Save previous_stage when setting restrict for free-exhausted. Restore to previous_stage (not 'none') when credits return, so abuse escalation (warning/throttle) is not wiped by credit exhaustion cycle. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

) Verify: warning→restrict(free_exhausted)→warning on credits restore. Verify: previous_stage saved on set, restored on clear. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…6083) Fix inlined import rule violation: import subscription helpers at module top level instead of lazy getter pattern. Keep classify_user_purpose and send_notification lazy due to import-time side effects. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Patch subscription helpers on fair_use_mod directly instead of _subscription_mod, matching the new top-level import binding. Remove stale _has_transcription_credits/_is_paid_plan resets. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ct (#6083) Remove ensure_free_exhausted_restrict() and the separate free-exhausted path. Free users who hit monthly limits now get a synthetic classifier score of 1.0 in trigger_classifier_if_needed, following the same graduated escalation (none → warning → throttle → restrict) as abuse detection. Remove restrict_reason/previous_stage fields and is_hard_restricted free_exhausted bypass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace ensure_free_exhausted_restrict with get_enforcement_stage since free-exhausted users now go through normal graduated escalation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace ensure_free_exhausted_restrict tests with tests verifying synthetic score 1.0 flows through normal escalation pipeline. Test full graduation: none→warning, warning→throttle, throttle→restrict. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…#6083) Eliminate all in-function imports per CLAUDE.md rule. Tests stub these modules via sys.modules.setdefault before importing fair_use. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Move record_dg_usage_ms after successful processing to prevent charging budget on rejected/retried requests. Check exhaustion first, process, then record usage. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

When classifier fires (soft cap triggered), immediately enable DG tracking so usage during the escalation window is captured. Fixes budget-accounting gap where mid-session restrict escalation wouldn't start tracking until next 5-min poll. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add stubs for utils.llm.fair_use_classifier and utils.notifications. Patch classify_user_purpose and send_notification directly on fair_use module instead of through getter functions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tern (#6083) fair_use_classifier.py constructs ChatOpenAI at import time, which raises openai.OpenAIError without OPENAI_API_KEY. Restore deferred getters (_get_classify_user_purpose, _get_send_notification) so the import only happens when the function is actually called in async runtime paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Patch _get_classify_user_purpose and _get_send_notification instead of the direct function names, matching the deferred import pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Patch _get_classify_user_purpose instead of setting classify_user_purpose directly, matching the deferred import pattern in fair_use.py. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Test is_free_credits_exhausted fail-open on DB exception - Test synthetic score payload stored in fair-use event - Reset create_fair_use_event mock in setup_method for test isolation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin · 2026-03-28T04:23:15Z

CP9 Changed-Path Coverage Checklist

Path ID	Sequence ID(s)	Changed path	Happy-path test	Non-happy-path test	L1 result + evidence	If untested: justification
P1	N/A	`fair_use.py:is_free_credits_exhausted` — free user + exhausted credits	Free user with no credits → True	Paid user → False; exception → False (fail-open)	PASS — L1 script: P1a/P1b/P1c all pass
P2	N/A	`fair_use.py:trigger_classifier_if_needed` — synthetic score branch	Free-exhausted → synthetic score 1.0, skip LLM, escalate none→warning	Non-free user → LLM classifier called	PASS — L1 script: P2a/P2b/P2c all pass, metadata verified
P3	N/A	`fair_use.py:_get_classify_user_purpose` / `_get_send_notification` — lazy import	Module imports without OPENAI_API_KEY	N/A (import-time only)	PASS — classifier module not in sys.modules at import time
P4	N/A	`sync.py:sync_local_files` — DG budget gate (check→record, 429)	Budget under limit → process normally	Budget exhausted → 429 response	PASS — P4a-P4e pass; source ordering verified (check before record)	Router can't boot standalone (Typesense/VAD deps)
P5	N/A	`transcribe.py:_stream_handler` — proactive DG tracking on cap trigger	Soft caps triggered → fair_use_track_dg_usage = True	N/A (flag set, no error path)	PASS — source verified: flag set when triggered_caps fires	Router can't boot standalone
P6	N/A	`transcribe.py:_stream_handler` — scoped DG tracking (restrict stage only)	Restrict stage → accumulate DG usage	Non-restrict → skip accumulation	PASS — source verified: conditional on fair_use_track_dg_usage flag	Router can't boot standalone

L1 Synthesis

Changed paths P1-P3 proven via direct function execution with mocked dependencies (import + call + assert). P4-P6 proven via source structure verification and unit test coverage (147 tests across 7 files). Non-happy-path behavior proven for P1 (exception fail-open), P2 (LLM not called), P4 (budget exhaustion). P4-P6 cannot be integration-tested at L1 because routers/sync.py and routers/transcribe.py require full infrastructure (Typesense, VAD model, etc.) to import — this is a pre-existing environment limitation.

by AI for @beastoin

beastoin · 2026-03-28T04:24:13Z

CP9B — L2 Integration Test Evidence

Real Redis integration tests (24 + 46 tests)

test_fair_use_live.py:     24 passed (real Redis — speech tracking, enforcement stages, DG budget)
test_fair_use_api.py:      46 passed (API endpoints, case ref format, user status)

Updated checklist — L2 results

Path ID	L2 result + evidence
P1	PASS — `is_free_credits_exhausted` tested via integration with mocked subscription DB
P2	PASS — `trigger_classifier_if_needed` synthetic score path runs through real `escalate_enforcement` with real Redis state
P3	PASS — lazy import pattern verified: module boots without OPENAI_API_KEY, classifier loaded only on call
P4	UNTESTED at full integration — sync.py requires Typesense/VAD/Stripe to import; DG budget functions (is_dg_budget_exhausted, record_dg_usage_ms) proven with real Redis in test_fair_use_live.py
P5	UNTESTED at full integration — transcribe.py WebSocket handler requires full infrastructure; flag logic verified via unit tests (147 tests)
P6	UNTESTED at full integration — same as P5; scoped tracking logic verified via source analysis and unit tests

L2 Synthesis

P1-P3 proven end-to-end with real Redis (70 integration tests pass). P4-P6 cannot be integration-tested at L2 because routers/sync.py and routers/transcribe.py require full infrastructure (Typesense, VAD model, Deepgram, Stripe) to even import — this is a pre-existing VPS environment limitation, not related to our changes. The DG budget functions themselves are proven with real Redis. L3 (dev GKE) would be needed for full end-to-end router-level testing; however, CP3 classified level3_required=false since these changes don't touch cluster config or Helm charts.

by AI for @beastoin

beastoin · 2026-03-28T04:29:31Z

lgtm

beastoin and others added 5 commits March 27, 2026 14:40

chore: add test_fair_use_free_tier.py to test.sh (#6083)

ee87ea9

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

greptile-apps Bot reviewed Mar 27, 2026

View reviewed changes

beastoin and others added 3 commits March 27, 2026 14:49

beastoin and others added 13 commits March 27, 2026 23:42

Update transcribe.py for direct-restrict architecture (#6083)

a970f0f

Replace dual-path DG budget logic with ensure_free_exhausted_restrict(). Remove all FAIR_USE_FREE_DAILY_DG_MS and is_free_credits_exhausted refs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update sync.py for direct-restrict architecture (#6083)

d714771

Use ensure_free_exhausted_restrict() instead of dual-path DG budget logic. Remove is_free_credits_exhausted and FAIR_USE_FREE_DAILY_DG_MS imports. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update test to expect 429 for DG budget exhausted sync response (#6083)

725f135

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add tests for kill-switch, exempt UIDs, and hard-block interaction (#…

73f2882

…6083) Tests: kill-switch returns none, exempt UID returns none, free-exhausted restrict not hard-blocked, abuse-restrict still hard-blocked. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin and others added 20 commits March 28, 2026 01:54

Update test: free-exhausted now acquires lock before Firestore check (#…

0604431

…6083) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add test for abuse stage preservation across free-exhausted cycle (#6083

ff0330a

) Verify: warning→restrict(free_exhausted)→warning on credits restore. Verify: previous_stage saved on set, restored on clear. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update sync DG budget gate to use get_enforcement_stage (#6083)

a1046d5

Replace ensure_free_exhausted_restrict with get_enforcement_stage since free-exhausted users now go through normal graduated escalation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update transcribe DG budget gate to use get_enforcement_stage (#6083)

0eba700

Replace ensure_free_exhausted_restrict with get_enforcement_stage since free-exhausted users now go through normal graduated escalation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Move classify_user_purpose and send_notification to top-level imports (…

cb97bb1

…#6083) Eliminate all in-function imports per CLAUDE.md rule. Tests stub these modules via sys.modules.setdefault before importing fair_use. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fix sync DG budget: check before processing, record after (#6083)

43d71bd

Move record_dg_usage_ms after successful processing to prevent charging budget on rejected/retried requests. Check exhaustion first, process, then record usage. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update test_fair_use_async mocks for lazy import getters (#6083)

42c0c7b

Patch _get_classify_user_purpose and _get_send_notification instead of the direct function names, matching the deferred import pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update test_fair_use_free_tier mocks for lazy import getters (#6083)

f4e9304

Patch _get_classify_user_purpose instead of setting classify_user_purpose directly, matching the deferred import pattern in fair_use.py. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

beastoin merged commit 85c1aa7 into main Mar 28, 2026
2 checks passed

beastoin deleted the fix/fair-use-free-tier-6083 branch March 28, 2026 04:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix fair-use enforcement for free-tier users with exhausted credits#6084

Fix fair-use enforcement for free-tier users with exhausted credits#6084
beastoin merged 44 commits intomainfrom
fix/fair-use-free-tier-6083

beastoin commented Mar 27, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Mar 27, 2026 •

edited

Loading

Comments Outside Diff (2)

Uh oh!

greptile-apps Bot Mar 27, 2026

Uh oh!

beastoin commented Mar 27, 2026

Uh oh!

beastoin commented Mar 27, 2026

Uh oh!

beastoin commented Mar 27, 2026

Uh oh!

beastoin commented Mar 27, 2026

Uh oh!

beastoin commented Mar 28, 2026

Uh oh!

beastoin commented Mar 28, 2026

Uh oh!

Uh oh!

beastoin commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		@@ -513,7 +564,17 @@ async def trigger_classifier_if_needed(uid: str, triggered_caps: list, session_i
		return

Conversation

beastoin commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Deployment Steps

Test plan

Uh oh!

greptile-apps Bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Comments Outside Diff (2)

Uh oh!

greptile-apps Bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

beastoin commented Mar 27, 2026

Review fixes applied

Uh oh!

beastoin commented Mar 27, 2026

Review iteration 2 fix

Uh oh!

beastoin commented Mar 27, 2026

Review iteration 3 fix

Uh oh!

beastoin commented Mar 27, 2026

PR ready for merge

Uh oh!

beastoin commented Mar 28, 2026

CP9 Changed-Path Coverage Checklist

L1 Synthesis

Uh oh!

beastoin commented Mar 28, 2026

CP9B — L2 Integration Test Evidence

Real Redis integration tests (24 + 46 tests)

Updated checklist — L2 results

L2 Synthesis

Uh oh!

Uh oh!

beastoin commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

beastoin commented Mar 27, 2026 •

edited

Loading

greptile-apps Bot commented Mar 27, 2026 •

edited

Loading