docs: platform knowledge documentation + integration and e2e tests

jddunn · jddunn · commit 2f6cdcfb2118 · 2026-03-28T22:04:10.000-07:00
Add QUERY_ROUTER.md source doc with bundled platform knowledge section.
Add integration tests verifying the real platform-corpus.json (243 entries,
5 categories, keyword search). Add e2e tests exercising QueryRouter init,
classification, and retrieval with platform knowledge enabled/disabled.
diff --git a/docs/QUERY_ROUTER.md b/docs/QUERY_ROUTER.md
@@ -0,0 +1,173 @@
+AgentOS includes a `QueryRouter` that turns one user question into a three-stage pipeline:
+
+1. classify the query into tier `0` through `3`
+2. retrieve the right amount of context
+3. generate a grounded answer from that context
+
+## What Is Live Today
+
+- Tier classification uses an LLM prompt with corpus topics, recent conversation history, and optional tool names.
+- The router embeds local markdown docs into an in-memory vector store when an embedding provider is available.
+- If embeddings are unavailable or vector search fails, the router falls back to keyword search automatically.
+- Result metadata includes `tiersUsed` and `fallbacksUsed`.
+- Lifecycle events cover classification, retrieval, research, generation, and route completion.
+
+## Current Limitations
+
+The QueryRouter scaffold is ahead of the wired runtime in a few places:
+
+- `graphExpand()` is now a built-in corpus-neighborhood heuristic, not yet a true GraphRAG engine.
+- `rerank()` is now a built-in lexical heuristic reranker, not yet a cross-encoder service.
+- `deepResearch()` is now a built-in local-corpus heuristic synthesis pass, not yet a web-backed research runtime.
+- The router is useful today for query classification, vector retrieval, keyword fallback, heuristic graph expansion, heuristic reranking, heuristic local research synthesis, and grounded answer generation, but it is not yet a full GraphRAG or web-research runtime.
+
+## Host-Injected Runtime Hooks
+
+You can replace the built-in heuristic branches without forking `QueryRouter`
+by passing host-provided callbacks in the constructor:
+
+- `graphExpand(seedChunks)` for GraphRAG or relationship expansion
+- `rerank(query, chunks, topN)` for provider-backed reranking
+- `deepResearch(query, sources)` for real multi-source research
+
+When these hooks are supplied, `router.getCorpusStats()` will report the
+corresponding runtime mode as `active` instead of the built-in `heuristic`
+mode.
+
+## Example
+
+Runnable source: `packages/agentos/examples/query-router.mjs`
+
+```ts
+import { QueryRouter } from '@framers/agentos';
+
+const router = new QueryRouter({
+  knowledgeCorpus: ['./docs', './packages/agentos/docs'],
+  availableTools: ['web_search', 'deep_research'],
+});
+
+await router.init();
+
+console.log(router.getCorpusStats());
+
+const result = await router.route('How does memory retrieval work?');
+
+console.log(result.answer);
+console.log(result.classification.tier);
+console.log(result.tiersUsed);
+console.log(result.fallbacksUsed);
+console.log(result.sources);
+
+await router.close();
+```
+
+### Host-Injected Runtime Example
+
+Runnable source: `packages/agentos/examples/query-router-host-hooks.mjs`
+
+```ts
+const router = new QueryRouter({
+  knowledgeCorpus: ['./docs', './packages/agentos/docs'],
+  graphEnabled: true,
+  deepResearchEnabled: true,
+  graphExpand: async (seedChunks) => [...seedChunks, extraGraphChunk],
+  rerank: async (_query, chunks, topN) => chunks.slice(0, topN),
+  deepResearch: async (query, sources) => ({
+    synthesis: `Host-provided research for ${query}`,
+    sources: externalResearchChunks,
+  }),
+});
+
+await router.init();
+console.log(router.getCorpusStats()); // graph/deepResearch/rerank runtime modes become active
+```
+
+## Bundled Platform Knowledge
+
+The QueryRouter ships with **243 pre-built knowledge entries** that cover the entire AgentOS platform surface. These entries are auto-loaded at startup and merged into the corpus alongside your project docs — no configuration required.
+
+### What's Included
+
+| Category | Count | Examples |
+|----------|-------|---------|
+| **Tools** | 105 | All channel adapters, productivity tools, orchestration tools |
+| **Skills** | 79 | Every curated skill from the skills registry |
+| **FAQ** | 30 | "How do I add voice?", "What models are supported?", "Does AgentOS support streaming?" |
+| **API** | 14 | generateText(), streamText(), agent(), agency(), embedText(), generateImage() |
+| **Troubleshooting** | 15 | Missing API keys, model not found, embedding init failures |
+
+### How It Works
+
+Platform knowledge is loaded from `knowledge/platform-corpus.json` inside the `@framers/agentos` package. During `init()`, these entries are converted to `CorpusChunk` objects and appended to the user corpus. Both the vector index and the keyword fallback index cover platform entries, so they work regardless of whether an embedding API key is available.
+
+The platform knowledge layer sits beneath your project documentation:
+
+```
+User project docs     (your ./docs, ./guides, etc.)
+  + Platform knowledge  (243 entries — tools, skills, FAQ, API, troubleshooting)
+  + GitHub repos        (optional — indexed asynchronously after init)
+  = Complete corpus
+```
+
+This means an agent can answer questions like "What vector stores does AgentOS support?" or "How do I set up a Bluesky channel?" without any project-specific documentation — the answer comes from the bundled platform knowledge.
+
+### Configuration
+
+Platform knowledge is enabled by default. To disable it:
+
+```typescript
+const router = new QueryRouter({
+  knowledgeCorpus: ['./docs'],
+  includePlatformKnowledge: false,
+});
+```
+
+### Regenerating Platform Knowledge
+
+If you are contributing to AgentOS and need to update the bundled knowledge:
+
+```bash
+npm run build:knowledge
+```
+
+This regenerates `knowledge/platform-corpus.json` from the current tool manifests, skill registry, FAQ sources, and API documentation.
+
+## Config Notes
+
+- `knowledgeCorpus` is required.
+- `init()` throws if `knowledgeCorpus` resolves to zero readable `.md` / `.mdx` sections.
+- `availableTools` is optional and is only used to help the classifier reason about what the runtime can do.
+- `apiKey` / `baseUrl` configure classifier and generator LLM calls. When omitted, QueryRouter prefers `OPENAI_API_KEY` and falls back to `OPENROUTER_API_KEY` with the OpenRouter compatibility base URL.
+- `embeddingApiKey` / `embeddingBaseUrl` override only the embedding path when vector retrieval should use a different provider or credential. When omitted, embeddings fall back through `apiKey`, then `OPENAI_API_KEY`, then `OPENROUTER_API_KEY`.
+- `githubRepos` optionally enables non-blocking GitHub corpus indexing after `init()`. Newly indexed repo chunks are merged back into the live corpus, keyword fallback, classifier topics, and the vector index when embeddings are active.
+- `deepResearchEnabled` controls whether the tier-3 research branch is attempted; the default core implementation is a local-corpus heuristic, and hosts can still inject a real web-backed implementation.
+- `onClassification` and `onRetrieval` are hooks for consumers that want lightweight runtime integration without reading the full event stream.
+- `router.getCorpusStats()` returns a `QueryRouterCorpusStats` snapshot with configured path count, loaded chunk/topic/source counts, whether retrieval is running in `vector+keyword-fallback` or `keyword-only` mode, the embedding health field `embeddingStatus`, and the runtime-truth fields `graphRuntimeMode`, `rerankRuntimeMode`, and `deepResearchRuntimeMode`.
+- `embeddingStatus: 'active'` means the vector index initialized successfully, `'disabled-no-key'` means init stayed keyword-only because no embedding credential was available, and `'failed-init'` means embedding bootstrap was attempted but failed and the router fell back to keyword-only mode.
+- `graphRuntimeMode: 'heuristic'` means the built-in same-document / heading-overlap expansion is active; `'active'` is reserved for a future wired graph expansion service or a host-injected hook.
+- `rerankRuntimeMode: 'heuristic'` means the built-in lexical reranker is active; `'active'` is reserved for a future wired reranker service.
+- `deepResearchRuntimeMode: 'heuristic'` means the built-in local-corpus synthesis pass is active; `'active'` is reserved for a host-injected or future provider-backed research runtime.
+
+## Result Metadata
+
+`QueryResult` includes:
+
+- `classification`: the final classification result
+- `sources`: citations built from retrieved chunks
+- `tiersUsed`: the tiers actually exercised after fallbacks
+- `fallbacksUsed`: retrieval/classification fallback strategy names such as `keyword-fallback` or `research-skip`
+- `durationMs`: total end-to-end wall-clock time for classification, retrieval, and generation
+
+## Events
+
+The router records typed events for:
+
+- `classify:start`
+- `classify:complete`
+- `classify:error`
+- `retrieve:*`
+- `research:*`
+- `generate:*`
+- `route:complete`
+
+These events are intended for observability, audit trails, and future workbench/runtime inspection surfaces.
diff --git a/src/query-router/__tests__/platform-knowledge.integration.test.ts b/src/query-router/__tests__/platform-knowledge.integration.test.ts
@@ -0,0 +1,187 @@
+/**
+ * @fileoverview Integration tests for the bundled platform knowledge corpus.
+ *
+ * These tests exercise the REAL `knowledge/platform-corpus.json` file (not
+ * mocked) to verify structural integrity, category coverage, specific entry
+ * existence, and keyword-based searchability.
+ *
+ * No LLM calls or embedding APIs are needed — all assertions are against the
+ * static corpus file and the KeywordFallback engine.
+ *
+ * @module @framers/agentos/query-router/__tests__/platform-knowledge.integration
+ */
+
+import { existsSync, readFileSync } from 'node:fs';
+import { dirname, join, resolve } from 'node:path';
+import { fileURLToPath } from 'node:url';
+import { describe, expect, it, beforeAll } from 'vitest';
+
+import { KeywordFallback } from '../KeywordFallback.js';
+import type { CorpusChunk } from '../types.js';
+
+// ---------------------------------------------------------------------------
+// Locate the real platform corpus
+// ---------------------------------------------------------------------------
+
+const MODULE_DIR = dirname(fileURLToPath(import.meta.url));
+
+/** Candidate paths where the corpus file may live relative to this test file. */
+const CORPUS_CANDIDATES = [
+  // From src/query-router/__tests__/ -> knowledge/
+  resolve(MODULE_DIR, '../../../knowledge/platform-corpus.json'),
+  // From dist/query-router/__tests__/ -> knowledge/
+  resolve(MODULE_DIR, '../../../../knowledge/platform-corpus.json'),
+];
+
+/** Resolved path to the platform corpus, or null if not found. */
+const corpusPath = CORPUS_CANDIDATES.find((p) => existsSync(p)) ?? null;
+
+// ---------------------------------------------------------------------------
+// Types for raw corpus entries
+// ---------------------------------------------------------------------------
+
+interface PlatformCorpusEntry {
+  id: string;
+  heading: string;
+  content: string;
+  category: string;
+}
+
+// ---------------------------------------------------------------------------
+// Test suite
+// ---------------------------------------------------------------------------
+
+describe('Platform Knowledge Corpus — integration', () => {
+  let entries: PlatformCorpusEntry[];
+  let chunks: CorpusChunk[];
+  let fallback: KeywordFallback;
+
+  beforeAll(() => {
+    expect(corpusPath).not.toBeNull();
+    const raw = readFileSync(corpusPath!, 'utf-8');
+    entries = JSON.parse(raw) as PlatformCorpusEntry[];
+
+    // Convert to CorpusChunk format (same transform as QueryRouter.loadPlatformKnowledge)
+    chunks = entries.map((entry) => ({
+      id: entry.id,
+      heading: entry.heading,
+      content: entry.content,
+      sourcePath: `platform:${entry.category}/${entry.id}`,
+    }));
+
+    fallback = new KeywordFallback(chunks);
+  });
+
+  // =========================================================================
+  // Structural integrity
+  // =========================================================================
+
+  it('contains at least 200 entries', () => {
+    expect(entries.length).toBeGreaterThanOrEqual(200);
+  });
+
+  it('has all 5 expected categories', () => {
+    const categories = new Set(entries.map((e) => e.category));
+    expect(categories).toContain('tools');
+    expect(categories).toContain('skills');
+    expect(categories).toContain('faq');
+    expect(categories).toContain('api');
+    expect(categories).toContain('troubleshooting');
+  });
+
+  it('every entry has non-empty id, heading, content, and category', () => {
+    for (const entry of entries) {
+      expect(entry.id).toBeTruthy();
+      expect(entry.heading).toBeTruthy();
+      expect(entry.content).toBeTruthy();
+      expect(entry.category).toBeTruthy();
+    }
+  });
+
+  // =========================================================================
+  // Specific entry existence
+  // =========================================================================
+
+  it('contains the generateText() API entry', () => {
+    const match = entries.find((e) => e.id === 'api:generateText');
+    expect(match).toBeDefined();
+    expect(match!.heading).toContain('generateText');
+    expect(match!.category).toBe('api');
+  });
+
+  it('contains the "How do I add voice?" FAQ entry', () => {
+    const match = entries.find((e) => e.id === 'faq:add-voice');
+    expect(match).toBeDefined();
+    expect(match!.heading.toLowerCase()).toContain('voice');
+    expect(match!.category).toBe('faq');
+  });
+
+  it('contains the document-export tool reference', () => {
+    const match = entries.find((e) => e.id === 'tool-ref:com.framers.productivity.document-export');
+    expect(match).toBeDefined();
+    expect(match!.category).toBe('tools');
+  });
+
+  it('contains the streamText() API entry', () => {
+    const match = entries.find((e) => e.id === 'api:streamText');
+    expect(match).toBeDefined();
+    expect(match!.heading).toContain('streamText');
+    expect(match!.category).toBe('api');
+  });
+
+  it('contains the "What models are supported?" FAQ entry', () => {
+    const match = entries.find((e) => e.id === 'faq:supported-models');
+    expect(match).toBeDefined();
+    expect(match!.category).toBe('faq');
+  });
+
+  // =========================================================================
+  // Keyword fallback search
+  // =========================================================================
+
+  it('finds document-export when searching "PDF generation"', () => {
+    const results = fallback.search('PDF generation document export', 10);
+    expect(results.length).toBeGreaterThan(0);
+    const ids = results.map((r) => r.id);
+    const hasDocExport = ids.some(
+      (id) => id.includes('document-export') || id.includes('pdf')
+    );
+    expect(hasDocExport).toBe(true);
+  });
+
+  it('finds FAQ entry when searching "what models are supported"', () => {
+    const results = fallback.search('what models are supported', 10);
+    expect(results.length).toBeGreaterThan(0);
+    const ids = results.map((r) => r.id);
+    const hasFaq = ids.some((id) => id.includes('faq:'));
+    expect(hasFaq).toBe(true);
+  });
+
+  it('finds streamText API entry when searching "streaming"', () => {
+    const results = fallback.search('streaming text generation', 10);
+    expect(results.length).toBeGreaterThan(0);
+    const ids = results.map((r) => r.id);
+    const hasStream = ids.some(
+      (id) => id.includes('streamText') || id.includes('streaming')
+    );
+    expect(hasStream).toBe(true);
+  });
+
+  it('finds voice-related entries when searching "voice pipeline"', () => {
+    const results = fallback.search('voice pipeline speech recognition', 10);
+    expect(results.length).toBeGreaterThan(0);
+    const ids = results.map((r) => r.id);
+    const hasVoice = ids.some(
+      (id) => id.includes('voice') || id.includes('stt') || id.includes('tts')
+    );
+    expect(hasVoice).toBe(true);
+  });
+
+  it('returns results with valid relevance scores', () => {
+    const results = fallback.search('authentication tokens', 5);
+    for (const result of results) {
+      expect(result.relevanceScore).toBeGreaterThanOrEqual(0);
+      expect(result.relevanceScore).toBeLessThanOrEqual(1);
+    }
+  });
+});
diff --git a/tests/e2e/platform-knowledge.e2e.spec.ts b/tests/e2e/platform-knowledge.e2e.spec.ts