feat: relayauth core platform — Domains 1-7 (Foundation → SDK) by khaliqgant · Pull Request #2 · AgentWorkforce/relayauth

khaliqgant · 2026-03-24T21:01:59Z

Relayauth Core Platform

Complete implementation of the relayauth identity and authorization plane — from project scaffold through SDK clients.

Stats

132 TypeScript files, ~31,800 lines
39 test files with unit + integration + E2E coverage
9 API route modules (Hono on Cloudflare Workers)
61 workflow commits (sequential, automated)

Domains Completed

✅ Domain 1: Foundation (WF007-010)

Error catalog, test helpers, dev environment (wrangler dev), contract tests

✅ Domain 2: Token System (WF011-020)

JWT signing (Ed25519), JWKS endpoint, token verification, issuance API, refresh, revocation (KV-backed), introspection, key rotation, E2E tests

✅ Domain 3: Identity Lifecycle (WF021-030)

Identity Durable Object, full CRUD (create/get/list/update), suspend, reactivate, retire, delete, lifecycle E2E tests

✅ Domain 4: Scopes & RBAC (WF031-040)

Scope parser + matcher, scope checker SDK, scope middleware, role CRUD, role assignment, policy CRUD, policy evaluation, scope inheritance, RBAC E2E tests

✅ Domain 5: API Routes (WF041-050)

Auth middleware, org CRUD, workspace CRUD, workspace membership, API key management, admin routes, rate limiting, error handling, CORS/headers, routes E2E tests

✅ Domain 6: Audit & Observability (WF051-058)

Audit logger, audit query API, audit export, retention policies, webhooks, identity activity API, dashboard stats API, audit E2E tests

✅ Domain 7: SDK & Verification (WF059-068)

SDK client (identities, tokens, roles, audit), complete verification module, Hono middleware, Express middleware, Go middleware, Python SDK, SDK E2E tests

Architecture

Runtime: Cloudflare Workers + Durable Objects + D1 + KV
Framework: Hono
Auth model: JWT (Ed25519) with scope-based authorization
Scope format: {plane}:{resource}:{action}:{path?}
Key feature: Path-scoped file access control (relayfile:fs:write:/src/*)

Try It Locally

cd ~/Projects/AgentWorkforce/relayauth
npm install
wrangler dev
# → localhost:8787

curl localhost:8787/health
curl -X POST localhost:8787/v1/identities -H 'Content-Type: application/json' -d '{"name":"test-agent","type":"agent"}'
curl localhost:8787/v1/roles
curl localhost:8787/v1/audit

What's Next (PR #3: Domains 8-12)

Domain 8: CLI (relay auth init, relay wrap, shell hook)
Domain 9: Integration (relayfile, relaycast, cloud)
Domain 10: Hosted server (wrangler deploy, staging/prod)
Domain 11: Testing & CI pipelines
Domain 12: Docs & landing page

…gnore - worker.ts: Add global CORS and request-ID middleware, auth placeholder - verify.test.ts: Expand constructor tests, add TODO roadmap for verification - .gitignore: Add Python build artifact patterns Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Extracted from autofix swarm workflow results. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Replace broken TokenVerifier in scope middleware with inline HS256 verification - Add safeRegexTest to prevent ReDoS in policy evaluation engine - Add org isolation checks to all identity endpoints (GET/PATCH/DELETE/suspend/retire/reactivate) - Fix fail-open auth bypass in audit-webhooks (flip to fail-closed with 403) - Re-enable PyJWT built-in claim verification with leeway in Python SDK - Extract shared auth module to lib/auth.ts, deduplicate across routes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Critical: fail-closed org boundary checks (audit-webhooks), cross-org IDOR guards (identities), ReDoS mitigation with pattern/input limits (policy-eval), scope inheritance cross-tenant fix, mass assignment lockdown, Python JWT leeway, scope info leakage removal. High: deduplicated scope matching into shared export, SQL-level pagination, scope-based authorization on identity routes. Medium: CSV injection prevention, SSRF hardening (IPv4-mapped IPv6, octal/ decimal IPs, cloud metadata), audit failure counters with sensitive-op enforcement, Go EdDSA support, SDK verify test suite (24 tests), Python scopes aligned to return False. Low: webhook schema cleanup, no-op hydrateIdentity removed, error message sanitization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

devin-ai-integration

Devin Review found 4 new potential issues.

View 16 additional findings in Devin Review.

packages/server/src/engine/policy-evaluation.ts

devin-ai-integration · 2026-03-25T14:49:12Z

packages/sdk/src/verify.ts

+    if (claims.nbf !== undefined && claims.nbf > now) {
+      throw invalidTokenError();
+    }
+
+    if (claims.exp <= now) {
+      throw new TokenExpiredError();


🔴 Go middleware lacks clock skew leeway, causing cross-SDK token rejection

The Go middleware in relayauth.go validates nbf and exp claims with zero clock skew tolerance (lines 249-253), while both the TypeScript SDK (packages/sdk/src/verify.ts:205-209) and the Python SDK (packages/python-sdk/relayauth/verifier.py:240-243) use a 30-second leeway. This means a token that is valid according to the TypeScript/Python SDKs can be rejected by the Go middleware when server clocks differ by even 1 second. For nbf, Go uses *claims.Nbf > now (exact) vs TypeScript's claims.nbf > now + 30. For exp, Go uses claims.Exp <= now (exact) vs TypeScript's claims.exp <= now - 30. In a distributed system where relayauth tokens are verified by different services using different SDKs, this inconsistency will cause intermittent 401 errors.

Was this helpful? React with 👍 or 👎 to provide feedback.

packages/server/src/engine/audit-webhook-dispatcher.ts

packages/server/src/routes/audit-webhooks.ts

1. scope-inheritance: fix function signature mismatch (3-param vs 2-param) - resolveInheritedScopes and getInheritanceChain now accept (db, identityId) - SQL query updated to look up identity by ID alone - Fixes runtime crash (identityId.trim() on undefined) 2. audit-webhook-dispatcher: reduce retry delays for CF Workers - Changed from [10s, 60s, 300s] to [500ms, 1s, 2s] - Previous delays would exceed Worker CPU time budget 3. verify.ts: add 30s leeway to match Python SDK - nbf and exp checks now use 30s clock-skew tolerance - Consistent behavior across TypeScript and Python SDKs 4. audit-webhooks: replace global mutable tableInitialized with WeakSet - Per-db instance tracking via WeakSet<D1Database> - Handles isolate recycling correctly in CF Workers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

devin-ai-integration

Devin Review found 1 new potential issue.

View 23 additional findings in Devin Review.

devin-ai-integration · 2026-03-25T17:10:22Z

packages/server/src/routes/audit-query.ts

+  "token.issued",
+  "token.refreshed",
+  "token.revoked",
+  "token.validated",
+  "identity.created",
+  "identity.updated",
+  "identity.suspended",
+  "identity.retired",
+  "scope.checked",
+  "scope.denied",
+  "role.assigned",
+  "role.removed",
+  "policy.created",
+  "policy.updated",
+  "policy.deleted",
+  "key.rotated",
+]);
+


🔴 Audit query API rejects extended audit actions that are actively written to the database

The AUDIT_ACTIONS set in audit-query.ts only includes the base AuditAction type values and is missing "budget.exceeded", "budget.alert", and "scope.escalation_denied". However, audit-logger.ts:46-64 and policy-evaluation.ts actively write entries with these extended actions to the audit_logs table. When a user tries to filter audit queries by these actions (e.g., GET /v1/audit?action=budget.exceeded), the parseAuditQuery function at line 124 returns a 400 error "invalid action". This also affects the audit export endpoint (audit-export.ts) which reuses the same parseAuditQuery function. As a result, budget breach events and scope escalation denial events are logged but cannot be queried or exported by action filter.

Suggested change

"token.issued",

"token.refreshed",

"token.revoked",

"token.validated",

"identity.created",

"identity.updated",

"identity.suspended",

"identity.retired",

"scope.checked",

"scope.denied",

"role.assigned",

"role.removed",

"policy.created",

"policy.updated",

"policy.deleted",

"key.rotated",

]);

export const AUDIT_ACTIONS = new Set<AuditAction | 'budget.exceeded' | 'budget.alert' | 'scope.escalation_denied'>([

"token.issued",

"token.refreshed",

"token.revoked",

"token.validated",

"identity.created",

"identity.updated",

"identity.suspended",

"identity.retired",

"scope.checked",

"scope.denied",

"role.assigned",

"role.removed",

"policy.created",

"policy.updated",

"policy.deleted",

"key.rotated",

"budget.exceeded",

"budget.alert",

"scope.escalation_denied",

]);

Was this helpful? React with 👍 or 👎 to provide feedback.

The Go middleware validated nbf and exp claims with zero tolerance, while TypeScript and Python SDKs use a 30-second leeway. This adds a matching clockSkewLeeway constant and applies it to both checks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

budget.exceeded, budget.alert, and scope.escalation_denied are actively written by audit-logger.ts and policy-evaluation.ts but were missing from the query validation set, making them impossible to filter on. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Remove the initializedDbs WeakSet guard from ensureAuditWebhookTable. The DDL already uses CREATE TABLE/INDEX IF NOT EXISTS, making it idempotent and safe to run every time without a mutable flag, which is fragile in Cloudflare Workers environments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

devin-ai-integration

Devin Review found 2 new potential issues.

View 27 additional findings in Devin Review.

devin-ai-integration · 2026-03-25T18:00:37Z

packages/sdk/src/verify.ts

+    if (
+      this.options?.audience &&
+      !this.options.audience.some((audience) => claims.aud.includes(audience))
+    ) {
+      throw invalidTokenError();


🔴 TokenVerifier rejects all tokens when audience is set to an empty array

In the TypeScript SDK TokenVerifier._validateClaims, the audience check at line 210 uses this.options?.audience && .... In JavaScript, an empty array [] is truthy, so new TokenVerifier({ audience: [] }) will always throw invalidTokenError() since [].some(...) returns false. This is a behavioral trap: consumers expect an empty audience to mean "skip audience validation" (which is what the Python SDK does, since [] is falsy in Python). One test (packages/sdk/src/__tests__/verify.test.ts:113) even validates audience: [] is accepted, but the verify path would reject every token.

Comparison with Python SDK

Python verifier at packages/python-sdk/relayauth/verifier.py:247:

if self.options.audience and not any(aud in claims.aud for aud in self.options.audience):

In Python, [] is falsy, so empty audience skips validation — the correct behavior.

Suggested change

if (

this.options?.audience &&

!this.options.audience.some((audience) => claims.aud.includes(audience))

) {

throw invalidTokenError();

if (

this.options?.audience &&

this.options.audience.length > 0 &&

!this.options.audience.some((audience) => claims.aud.includes(audience))

) {

throw invalidTokenError();

}

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-03-25T18:00:38Z

packages/sdk/src/verify.ts

+    if (this.options?.maxAge !== undefined && claims.iat + this.options.maxAge < now) {
+      throw new TokenExpiredError();


🟡 maxAge validation inconsistently applies clock-skew leeway across SDKs

The TypeScript SDK _validateClaims at line 217 checks claims.iat + this.options.maxAge < now without applying the 30-second leeway used for exp and nbf checks. This means a token issued exactly maxAge seconds ago will be rejected due to clock skew, even though exp and nbf both tolerate 30 seconds of drift. The Python SDK at packages/python-sdk/relayauth/verifier.py:249 applies leeway (claims.iat + self.options.max_age < now - leeway), making the two SDKs behave differently for the same token under the same clock conditions. Since maxAge is an expiry-like check, it should be consistent with the exp leeway.

Suggested change

if (this.options?.maxAge !== undefined && claims.iat + this.options.maxAge < now) {

throw new TokenExpiredError();

if (this.options?.maxAge !== undefined && claims.iat + this.options.maxAge < now - leeway) {

throw new TokenExpiredError();

Was this helpful? React with 👍 or 👎 to provide feedback.

khaliqgant added 30 commits March 24, 2026 20:56

wf011: jwt-signing

76d0292

wf012: jwks-endpoint

3735f46

wf013: token-verification

02f79ef

wf014: token-issuance-api

e5e7f18

wf015: token-refresh-api

4b38c0c

wf016: token-revocation-api

5e88f59

wf017: revocation-kv

ba69969

wf018: token-introspect-api

ee7302e

wf019: key-rotation

f4d5a83

wf020: token-system-e2e

9292777

wf021: identity-do

ca908e9

wf022: create-identity-api

dbd43bc

wf023: get-identity-api

701ed1b

wf024: list-identities-api

4ba03b5

wf025: update-identity-api

23b4ffd

wf026: suspend-identity-api

7e6b209

wf027: reactivate-identity-api

edf6da3

wf028: retire-identity-api

c4c2e3e

wf029: delete-identity-api

d8558b0

fix: remove garbage text from WF030 workflow file

d1694ef

wf030: identity-lifecycle-e2e

8830292

wf031: scope-parser

1c9f4cf

wf032: scope-matcher

ce03753

wf033: scope-checker-sdk

9c9650d

wf034: scope-middleware

361761b

wf035: role-crud-api

867fdba

wf036: role-assignment-api

4c48bcb

wf037: policy-crud-api

8c9b204

wf038: policy-evaluation

13938eb

wf039: scope-inheritance

18cad88

khaliqgant added 11 commits March 25, 2026 09:21

wf039: scope-inheritance

4d984f8

wf060: sdk-client-tokens

beb7712

wf061: sdk-client-roles

fdb3af2

wf040: rbac-e2e

ee7fd8e

wf062: sdk-client-audit

489ed20

wf063: sdk-verify-complete

9c25d42

wf064: sdk-middleware-hono

6ff49d7

wf065: sdk-middleware-express

6cd5a21

wf066: go-middleware

0cc073c

wf067: python-sdk

91ab301

wf068: sdk-e2e

1cd9e50

khaliqgant marked this pull request as ready for review March 25, 2026 10:14

khaliqgant changed the title ~~feat: relayauth implementation (Domains 1-12, WF007-100)~~ feat: relayauth core platform — Domains 1-7 (Foundation → SDK) Mar 25, 2026

This comment was marked as resolved.

Sign in to view

khaliqgant and others added 3 commits March 25, 2026 14:32

fix: add auth middleware helper from autofix review

a2bf5bf

Extracted from autofix swarm workflow results. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This comment was marked as resolved.

Sign in to view

devin-ai-integration bot reviewed Mar 25, 2026

View reviewed changes

devin feedback

f05ba6a

This comment was marked as resolved.

Sign in to view

devin-ai-integration bot reviewed Mar 25, 2026

View reviewed changes

khaliqgant and others added 3 commits March 25, 2026 18:55

khaliqgant merged commit 36e3053 into main Mar 25, 2026

devin-ai-integration bot reviewed Mar 25, 2026

View reviewed changes

khaliqgant mentioned this pull request Mar 25, 2026

feat: relayauth wave 3 — landing page + discovery ecosystem (WF100-110) #4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: relayauth core platform — Domains 1-7 (Foundation → SDK)#2

feat: relayauth core platform — Domains 1-7 (Foundation → SDK)#2
khaliqgant merged 76 commits intomainfrom
domain/auto-workflows

khaliqgant commented Mar 24, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

devin-ai-integration bot Mar 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Mar 25, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Mar 25, 2026

Uh oh!

devin-ai-integration bot Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		if (this.options?.maxAge !== undefined && claims.iat + this.options.maxAge < now) {
		throw new TokenExpiredError();

Conversation

khaliqgant commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Relayauth Core Platform

Stats

Domains Completed

✅ Domain 1: Foundation (WF007-010)

✅ Domain 2: Token System (WF011-020)

✅ Domain 3: Identity Lifecycle (WF021-030)

✅ Domain 4: Scopes & RBAC (WF031-040)

✅ Domain 5: API Routes (WF041-050)

✅ Domain 6: Audit & Observability (WF051-058)

✅ Domain 7: SDK & Verification (WF059-068)

Architecture

Try It Locally

What's Next (PR #3: Domains 8-12)

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devin-ai-integration bot Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

khaliqgant commented Mar 24, 2026 •

edited

Loading

devin-ai-integration bot Mar 25, 2026 •

edited

Loading