feat: add tool search with semantic, local, and auto search modes by shashi-stackone · Pull Request #322 · StackOneHQ/stackone-ai-node

shashi-stackone · 2026-03-03T23:10:44Z

Summary

Add searchTools(), searchActionNames(), and getSearchTool() methods to
StackOneToolSet
Implement semantic search via StackOne's search API with per-connector parallel search
Implement local hybrid search (BM25 via Orama + TF-IDF) with 0.2 * BM25 + 0.8 * TF-IDF
scoring
Add auto mode (default): tries semantic first, falls back to local on failure
Add URL deduplication in MCP client to prevent duplicate tool definitions
Update README and all examples to use new search API
Remove old utility-tools example

Test plan

searchTools() returns ranked tools for natural language queries
searchActionNames() returns action names and similarity scores
getSearchTool() returns reusable search tool for agent loops
Auto mode falls back to local when semantic API is unavailable
Semantic mode throws SemanticSearchError on failure
Local mode runs entirely in-process with no network calls
URL deduplication prevents duplicate tools from MCP

Summary by cubic

Adds semantic tool search with an auto fallback to local BM25+TF‑IDF and a reusable SearchTool for agent loops. Adds constructor-level search config and a new semantic search example; examples now use env-based setup with a topK demo.

New Features
- Added StackOneToolSet.searchTools(), searchActionNames(), and getSearchTool(); modes: semantic, local, and auto (default).
- Constructor SearchConfig (method, topK, minSimilarity) with per-call overrides; set search: null to disable.
- Per-connector parallel semantic queries via /actions/search; local search runs offline.
- Normalizes versioned action names to MCP tool names.
- Exported SearchTool, SemanticSearchClient, SemanticSearchError, and related types.
- New example: examples/search-tools.ts with semantic, agent-loop, action-name-only, local-only, and topK demos.
Migration
- Replaced the utility-tools example with search-tools.
- README/examples now read STACKONE_API_KEY and STACKONE_ACCOUNT_ID; baseUrl no longer hardcoded.
- Tests/mocks standardized on a TEST_BASE_URL; minor example command/title fixes.

^{Written for commit 8145e88. Summary will update on new commits.}

pkg-pr-new · 2026-03-03T23:11:32Z

Open in StackBlitz

npm i https://pkg.pr.new/@stackone/ai@322

commit: cdd0eaf

Copilot

Pull request overview

This PR introduces a new tool discovery/search API on StackOneToolSet, supporting semantic (cloud) search, local BM25+TF‑IDF fallback, and an auto mode that attempts semantic first. It also refactors the previous “utility tools” approach into dedicated search APIs and updates docs/examples and test mocks accordingly.

Changes:

Added searchTools(), searchActionNames(), and getSearchTool() (+ exported SearchTool and semantic search client/types).
Implemented semantic search client for /actions/search and a local hybrid search index (ToolIndex) as fallback / local-only mode.
Updated tests, mocks, README, and examples; removed the old utility-tools example.

Reviewed changes

Copilot reviewed 37 out of 37 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
src/toolsets.ts	Adds search modes, SearchTool wrapper, semantic+local search implementations, and provider filtering via connector.
src/semantic-search.ts	Introduces `SemanticSearchClient` + `SemanticSearchError` and response/result mapping.
src/local-search.ts	Adds local hybrid BM25+TF‑IDF search index (`ToolIndex`).
src/utils/normalize.ts	Adds action name normalization bridging semantic API names to MCP tool names.
src/tool.ts	Adds `BaseTool.connector` and replaces `utilityTools()` with `Tools.getConnectors()`.
src/index.ts	Exports new search APIs and semantic search client/types.
src/toolsets.test.ts, src/semantic-search.test.ts, src/local-search.test.ts	Adds/updates tests for semantic/local/auto search and normalization.
mocks/* + mocks/constants.ts	Centralizes test base URL and updates MSW handlers to use it.
examples/*, README.md, examples/README.md	Updates documentation and examples to use the new search APIs; removes utility-tools example.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-03T23:16:46Z

src/toolsets.ts

+				console.warn(
+					`Semantic search failed (${error.message}), falling back to local BM25+TF-IDF search`,
+				);
+				return this.localSearch(query, allTools, options);


console.warn usage here will violate the repo’s no-console lint rule for non-test code (see .oxlintrc.jsonc). Please remove the console call (silent fallback) or route this through an internal logger/diagnostics hook that doesn’t rely on console.*.

Copilot · 2026-03-03T23:16:46Z

src/toolsets.ts

+			if (error instanceof SemanticSearchError) {
+				console.warn(`Semantic search failed: ${error.message}`);
+				return [];
+			}


console.warn usage here will violate the repo’s no-console lint rule for non-test code (see .oxlintrc.jsonc). Consider returning [] without logging, or expose an optional callback/event emitter for diagnostics instead of writing to stdout.

Copilot · 2026-03-03T23:16:47Z

src/toolsets.ts

+		const index = new ToolIndex(allTools.toArray());
+		const results = await index.search(
+			query,
+			options?.topK ?? 5,
+			options?.minSimilarity ?? 0.0,
+		);
+
+		const matchedNames = results.map((r) => r.name);
+		const toolMap = new Map(allTools.toArray().map((t) => [t.name, t]));
+		const filterConnectors = options?.connector
+			? new Set([options.connector.toLowerCase()])
+			: availableConnectors;
+
+		const matchedTools = matchedNames
+			.filter((name) => toolMap.has(name))
+			.map((name) => toolMap.get(name)!)
+			.filter((tool) => tool.connector && filterConnectors.has(tool.connector));
+


localSearch applies the connector filter after taking the top-K results from the full index. This can return an empty/too-small result set even when matches exist for the requested connector (because higher-scoring tools from other connectors get filtered out). Filter the tools by connector before building/searching the index, or over-fetch results and keep collecting until you have topK matches for the connector.

Copilot · 2026-03-03T23:16:47Z

src/toolsets.ts

+			const client = this.getSemanticClient();
+			const allResults: SemanticSearchResult[] = [];


In auto mode, a missing API key will currently throw ToolSetConfigError from getSemanticClient() (not a SemanticSearchError), so searchTools() won’t fall back to local search. If search: 'auto' is intended to be resilient, treat “no API key” as a semantic-unavailable condition and fall back to localSearch, reserving hard failures for search: 'semantic'.

Copilot · 2026-03-03T23:16:47Z

src/local-search.ts

+import * as orama from '@orama/orama';
+import { DEFAULT_HYBRID_ALPHA } from './consts';
+import type { BaseTool } from './tool';
+import type { ToolParameters } from './types';


Unused type import ToolParameters triggers the repo’s typescript/no-unused-vars rule. Remove the import (it’s not referenced in this module).

Suggested change

import type { ToolParameters } from './types';

Copilot · 2026-03-03T23:16:47Z

src/toolsets.ts

+		try {
+			const client = this.getSemanticClient();
+			let allResults: SemanticSearchResult[] = [];


searchActionNames() calls getSemanticClient(), which can throw ToolSetConfigError when no API key is configured. That error isn’t handled here, so the method will throw instead of returning [] as the docs/tests imply for semantic-search failures. Consider handling missing API key the same way as other semantic failures (return [] or add an explicit option to control behavior).

Copilot · 2026-03-03T23:16:48Z

src/toolsets.ts

+		const index = new ToolIndex(allTools.toArray());
+		const results = await index.search(
+			query,


localSearch rebuilds a full BM25/TF-IDF index (new ToolIndex(allTools.toArray())) on every search call. For repeated searches this is unnecessarily expensive; consider caching the index (e.g., keyed by a tools snapshot/version) or constructing it lazily once per StackOneToolSet/Tools instance.

Copilot · 2026-03-03T23:16:48Z

src/local-search.test.ts

+import { ToolIndex, type ToolSearchResult } from './local-search';
+import { BaseTool } from './tool';
+import { ParameterLocation } from './types';


Unused imports (type ToolSearchResult and ParameterLocation) will fail the repo’s typescript/no-unused-vars lint rule. Please remove them or use them in the tests.

Suggested change

import { ToolIndex, type ToolSearchResult } from './local-search';

import { BaseTool } from './tool';

import { ParameterLocation } from './types';

import { ToolIndex } from './local-search';

import { BaseTool } from './tool';

Copilot · 2026-03-03T23:16:48Z

src/toolsets.ts

+	async searchTools(query: string, options?: SearchToolsOptions): Promise<Tools> {
+		const search = options?.search ?? 'auto';
+		const allTools = await this.fetchTools({ accountIds: options?.accountIds });
+		const availableConnectors = allTools.getConnectors();
+
+		if (availableConnectors.size === 0) {
+			return new Tools([]);
+		}
+
+		// Local-only search — skip semantic API entirely
+		if (search === 'local') {
+			return this.localSearch(query, allTools, options);
+		}


searchTools is likely to exceed the repo’s configured cyclomatic complexity limit (max 7 in .oxlintrc.jsonc). Consider extracting the semantic-search path and the connector-selection logic into smaller private helpers (e.g., semanticSearchTools() / getConnectorsToSearch() / matchSemanticResultsToTools()) to keep each function under the limit and easier to test.

cubic-dev-ai

5 issues found across 37 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="src/semantic-search.ts">

<violation number="1" location="src/semantic-search.ts:131">
P1: Custom agent: **Flag Security Vulnerabilities**

Enforce HTTPS for semantic search requests. The constructor accepts any baseUrl and uses it for fetch calls; this allows insecure `http://` endpoints and can leak API keys in transit. Validate the scheme and reject non-HTTPS base URLs per the security rule requiring TLS.</violation>

<violation number="2" location="src/semantic-search.ts:182">
P2: Clear the timeout in a finally block so it’s always cancelled even when fetch throws; otherwise the timer can leak and fire after the request has already failed.</violation>
</file>

<file name="examples/search-tools.ts">

<violation number="1" location="examples/search-tools.ts:105">
P2: Guard the fetchTools call so it only runs when topActions is non-empty; otherwise this example fetches the entire tool catalog whenever no action exceeds the 0.7 threshold.</violation>
</file>

<file name="src/toolsets.ts">

<violation number="1" location="src/toolsets.ts:650">
P2: Bug: `localSearch` applies `topK` limit to the index search *before* filtering by `connector`, which can return fewer results than requested. When a connector filter is active, the index should fetch more results (or all) so that post-filter slicing still yields up to `topK` tools.</violation>
</file>

<file name="src/mcp-client.test.ts">

<violation number="1" location="src/mcp-client.test.ts:50">
P2: Hardcoding `http://localhost/mcp` here makes the test depend on a running local MCP server. Since no server is started in this test and `createMCPClient` performs real HTTP calls, this will fail in CI or for contributors without a local server. Consider using MSW/mocked transport or a test server fixture instead of a hardcoded localhost endpoint.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

cubic-dev-ai · 2026-03-03T23:32:20Z

src/semantic-search.ts

+		timeout?: number;
+	}) {
+		this.apiKey = apiKey;
+		this.baseUrl = baseUrl.replace(/\/+$/, '');


P1: Custom agent: Flag Security Vulnerabilities

Enforce HTTPS for semantic search requests. The constructor accepts any baseUrl and uses it for fetch calls; this allows insecure http:// endpoints and can leak API keys in transit. Validate the scheme and reject non-HTTPS base URLs per the security rule requiring TLS.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At src/semantic-search.ts, line 131: <comment>Enforce HTTPS for semantic search requests. The constructor accepts any baseUrl and uses it for fetch calls; this allows insecure `http://` endpoints and can leak API keys in transit. Validate the scheme and reject non-HTTPS base URLs per the security rule requiring TLS.</comment> <file context> @@ -0,0 +1,260 @@ + timeout?: number; + }) { + this.apiKey = apiKey; + this.baseUrl = baseUrl.replace(/\/+$/, ''); + this.timeout = timeout; + } </file context>

cubic-dev-ai · 2026-03-03T23:32:20Z

examples/search-tools.ts

+		const topActions = results.filter((r) => r.similarityScore > 0.7).map((r) => r.actionName);
+		console.log(`\nFetching tools for top actions: ${topActions.join(', ')}`);
+
+		const tools = await toolset.fetchTools({ actions: topActions });


P2: Guard the fetchTools call so it only runs when topActions is non-empty; otherwise this example fetches the entire tool catalog whenever no action exceeds the 0.7 threshold.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At examples/search-tools.ts, line 105: <comment>Guard the fetchTools call so it only runs when topActions is non-empty; otherwise this example fetches the entire tool catalog whenever no action exceeds the 0.7 threshold.</comment> <file context> @@ -0,0 +1,147 @@ + const topActions = results.filter((r) => r.similarityScore > 0.7).map((r) => r.actionName); + console.log(`\nFetching tools for top actions: ${topActions.join(', ')}`); + + const tools = await toolset.fetchTools({ actions: topActions }); + console.log(`Fetched ${tools.length} tools`); + } </file context>

src/semantic-search.ts

cubic-dev-ai · 2026-03-03T23:32:20Z

src/toolsets.ts

+		}
+
+		const index = new ToolIndex(allTools.toArray());
+		const results = await index.search(


P2: Bug: localSearch applies topK limit to the index search before filtering by connector, which can return fewer results than requested. When a connector filter is active, the index should fetch more results (or all) so that post-filter slicing still yields up to topK tools.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At src/toolsets.ts, line 650: <comment>Bug: `localSearch` applies `topK` limit to the index search *before* filtering by `connector`, which can return fewer results than requested. When a connector filter is active, the index should fetch more results (or all) so that post-filter slicing still yields up to `topK` tools.</comment> <file context> @@ -265,6 +349,326 @@ export class StackOneToolSet { + } + + const index = new ToolIndex(allTools.toArray()); + const results = await index.search( + query, + options?.topK ?? 5, </file context>

cubic-dev-ai · 2026-03-03T23:32:20Z

src/mcp-client.test.ts

 test('createMCPClient can connect and list tools from MCP server', async () => {
 	await using mcpClient = await createMCPClient({
-		baseUrl: 'https://api.stackone-dev.com/mcp',
+		baseUrl: 'http://localhost/mcp',


P2: Hardcoding http://localhost/mcp here makes the test depend on a running local MCP server. Since no server is started in this test and createMCPClient performs real HTTP calls, this will fail in CI or for contributors without a local server. Consider using MSW/mocked transport or a test server fixture instead of a hardcoded localhost endpoint.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At src/mcp-client.test.ts, line 50: <comment>Hardcoding `http://localhost/mcp` here makes the test depend on a running local MCP server. Since no server is started in this test and `createMCPClient` performs real HTTP calls, this will fail in CI or for contributors without a local server. Consider using MSW/mocked transport or a test server fixture instead of a hardcoded localhost endpoint.</comment> <file context> @@ -47,7 +47,7 @@ test('createMCPClient provides asyncDispose for cleanup', async () => { test('createMCPClient can connect and list tools from MCP server', async () => { await using mcpClient = await createMCPClient({ - baseUrl: 'https://api.stackone-dev.com/mcp', + baseUrl: 'http://localhost/mcp', headers: { Authorization: `Basic ${Buffer.from('test-key:').toString('base64')}`, </file context>

willleeney

lgtm

Tool search with semantic and auto mode

d982a63

Copilot AI review requested due to automatic review settings March 3, 2026 23:10

shashi-stackone requested a review from a team as a code owner March 3, 2026 23:10

Copilot started reviewing on behalf of shashi-stackone March 3, 2026 23:11 View session

Copilot AI reviewed Mar 3, 2026

View reviewed changes

Fix CI Lint issues

e320318

cubic-dev-ai bot reviewed Mar 3, 2026

View reviewed changes

shashi-stackone added 4 commits March 5, 2026 10:14

Cubic fixes

8d0e6ff

refactor earch to make aligned with defender

c30c24e

Fix CI Lint issues

13b8c8d

Add topk example

8145e88

willleeney approved these changes Mar 6, 2026

View reviewed changes

shashi-stackone merged commit feefdc5 into main Mar 6, 2026
18 checks passed

shashi-stackone deleted the semantic_search_12112 branch March 6, 2026 09:28

github-actions bot mentioned this pull request Mar 6, 2026

chore(main): release 2.4.0 #323

Merged

		const client = this.getSemanticClient();
		const allResults: SemanticSearchResult[] = [];

Conversation

shashi-stackone commented Mar 3, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by cubic

Uh oh!

pkg-pr-new bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cubic-dev-ai bot Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

willleeney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shashi-stackone commented Mar 3, 2026 •

edited by cubic-dev-ai bot

Loading

pkg-pr-new bot commented Mar 3, 2026 •

edited

Loading

cubic-dev-ai bot Mar 3, 2026 •

edited

Loading

cubic-dev-ai bot Mar 3, 2026 •

edited

Loading

cubic-dev-ai bot Mar 3, 2026 •

edited

Loading

cubic-dev-ai bot Mar 3, 2026 •

edited

Loading