refactor(telemetry)!: relocate module, add MCP snapshots, HITL handling, and TUI tracking by lavaman131 · Pull Request #197 · flora131/atomic

lavaman131 · 2026-02-14T23:02:40Z

Summary

Major refactoring of the telemetry system with new capabilities for TUI session tracking, MCP runtime introspection, and HITL response handling. Includes breaking changes to telemetry module location and init command behavior.

Key Changes

Telemetry System Restructuring

Module relocation: Moved telemetry from src/utils/telemetry/ to src/telemetry/ with updated import paths across the codebase
New TUI session tracking (telemetry-tui.ts): Tracks chat UI lifecycle including session start/end, message submissions, tool lifecycle, slash command execution, and user interrupts
Workflow integration (graph-integration.ts): Added fail-safe callback-based workflow telemetry tracker for graph executor with sampling support

MCP Runtime Introspection

Session API extension: Added getMcpSnapshot() method to Session interface for runtime server introspection
Client implementations:
- Claude client: Uses query.mcpServerStatus() to fetch auth status and available tools
- OpenCode client: Full implementation with tools, resources, resource templates, and auth status detection
UI integration: New /mcp command with Codex-style rendering and enhanced MCP server list component showing transport details, auth status, tools, and resources
Deepwiki restriction: Updated .mcp.json to whitelist only ask_question tool for deepwiki server

HITL Response Handling

New utilities (hitl-response.ts): Structured handling for AskUserQuestion tool responses with normalization and display text formatting
Response modes: Support for option, custom_input, chat_about_this, and declined modes
Transcript integration: Added HITL response display in transcript formatter with comprehensive test coverage

Init Command Refactoring (Breaking Change)

Variant reconciliation: Replaced SCM template copying with variant reconciliation approach
Removed --scm flag behavior: No longer copies templates; instead removes unselected SCM variants (gh-* or sl-*) from target directories
Preserved user content: User-defined or unmanaged files remain untouched during reconciliation
Windows compatibility: Eliminated need for separate sapling-phabricator-windows template variant

Agent Configuration Updates

Recency-based sorting: All research/analysis agents now sort candidate files by date prefix (newest first) before reading
Enhanced coverage: Added specs/ directory to research locator search patterns
Prioritization: Newest documents in research/docs/, research/tickets/, and specs/ are read first

Breaking Changes

Telemetry import paths: All imports of telemetry modules must update from src/utils/telemetry/ to src/telemetry/
Init command behavior: The init command no longer copies SCM template files; it reconciles variants by removing unselected ones
Session interface: Added required getMcpSnapshot() method to Session interface (returns null for unsupported clients)

Test Plan

… deepwiki tools Add recency-based file sorting strategy to analyzer and research agents so newest documents are prioritized when gathering context. Update worker agents with improved task focus guidelines. Restrict deepwiki MCP server to ask_question tool only. Add research doc on MCP tool discovery bugs. Assistant-model: Claude Code

…hots, HITL handling, and TUI tracking Relocate telemetry from src/utils/telemetry/ to src/telemetry/ and add new TUI session tracking (telemetry-tui.ts) and graph integration (graph-integration.ts) modules. Update all import paths across commands, SDK clients, and UI layers. Add MCP runtime snapshot API with getMcpSnapshot() on Session interface, implemented in both Claude and OpenCode clients, enabling server introspection for tools, resources, and auth status. Introduce mcp-output utility for Codex-style /mcp rendering and enhance mcp-server-list component with snapshot views. Add HITL response handling utilities for structured tool response records with normalization. Refactor init command to use variant reconciliation instead of SCM template copying, removing the --scm CLI flag. Integrate TUI telemetry callbacks for tool start/complete, interrupts, command execution, and message submit events. Includes tests for init reconciliation, OpenCode MCP snapshots, HITL responses, MCP output formatting, and transcript formatter HITL support. Assistant-model: Claude Code

claude · 2026-02-14T23:04:10Z

PR Review: refactor(telemetry): relocate module, add MCP snapshots, HITL handling, and TUI tracking

This is a substantial PR with multiple interconnected changes. Overall, the code is well-structured and follows good practices. Here's my detailed review:

✅ Strengths

Good test coverage - Unit tests added for init reconciliation, OpenCode MCP snapshots, HITL responses, MCP output formatting, and transcript formatter HITL support.
Clean module reorganization - Moving telemetry from src/utils/telemetry/ to src/telemetry/ improves discoverability and clarifies that telemetry is a core module.
Well-designed MCP snapshot architecture - The McpRuntimeSnapshot type and buildMcpSnapshotView() function provide a clean abstraction for MCP server introspection.
Good separation of concerns - HITL response handling is properly extracted into hitl-response.ts with clear interfaces.
Comprehensive telemetry types - The new TUI telemetry events (tui_session_start, tui_session_end, etc.) provide granular tracking without being intrusive.

🔍 Code Quality Issues

1. Inconsistent indentation in `.mcp.json` (lines 348-362 in diff)

The JSON formatting changed from 2-space to 4-space indentation. This should be consistent with the project's style.

2. Potential duplicate tool normalization (src/ui/utils/mcp-output.ts:72-75)

function normalizeToolNames(serverName: string, toolNames: string[] | undefined): string[] {
  if (!toolNames || toolNames.length === 0) return [];
  return [...new Set(toolNames.map((name) => normalizeToolName(serverName, name)).filter((name) => name.length > 0))]
    .sort((a, b) => a.localeCompare(b));
}

This is good defensive code, but the long line could be split for readability.

3. Magic string duplication (multiple files)

The HITL tool name check appears in multiple places:

const isHitlTool = tc.toolName === "AskUserQuestion"
  || tc.toolName === "question"
  || tc.toolName === "ask_user";

Consider extracting this to a utility function like isHitlToolName(name: string): boolean.

🐛 Potential Bugs

1. Possible race condition in MCP snapshot fetch (src/ui/commands/builtin-commands.ts:436-443)

let runtimeSnapshot = null;
if (context.session?.getMcpSnapshot) {
  try {
    runtimeSnapshot = await context.session.getMcpSnapshot();
  } catch {
    runtimeSnapshot = null;
  }
}

Silent catch is fine here, but logging the error in debug mode would help troubleshooting.

2. MCP server toggle state not persisted (src/ui/chat.tsx:1790)

const [mcpServerToggles, setMcpServerToggles] = useState<McpServerToggleMap>({});

Toggle state is lost on TUI restart. The message says "Changes apply to the next session" but this only means within the current process. Consider clarifying in the user-facing message or documenting this limitation.

🔒 Security Considerations

Good: Header masking - maskPairValues() properly masks sensitive header values in MCP output.
Good: Environment variable masking - Env vars are masked as API_TOKEN=***** format.
Note: Ensure telemetry events don't accidentally include sensitive data from tool inputs/outputs.

⚡ Performance Considerations

MCP snapshot fetching - getMcpSnapshot() creates a temporary query with maxTurns: 0. This is reasonable but adds API overhead for /mcp command. The caching/memoization approach could be considered for frequent calls.
Telemetry events - Events are written synchronously via appendFileSync. This is fine for low-frequency events but could impact performance if called in hot paths.

📝 Suggestions

Consider adding disabledReason to config files - Currently only set programmatically. Allowing users to document why a server is disabled could be useful.
The research doc is excellent - The research/docs/2026-02-14-mcp-tool-discovery-startup-bugs.md provides thorough documentation of the MCP tool discovery flow and identified bugs. This is valuable for future maintenance.
Init command refactor - The change from copyScmCommands() to reconcileScmVariants() is cleaner. Removing the --scm CLI flag simplifies the API.

📋 Test Plan Verification

Per the PR description:

Unit tests for init reconciliation
Unit tests for OpenCode MCP snapshots
Unit tests for HITL responses
Unit tests for MCP output formatting
Unit tests for transcript formatter HITL support
pnpm typecheck && pnpm lint && pnpm test - Should be verified in CI
Manual verification of /mcp command rendering
Manual verification of telemetry events

Summary

This is a well-executed refactoring PR that:

Improves code organization (telemetry module location)
Adds valuable MCP introspection capabilities
Improves HITL response handling with proper normalization
Adds comprehensive TUI telemetry without being invasive

The code quality is high with good test coverage. The minor issues noted above are suggestions for polish rather than blockers.

Recommendation: ✅ Approve with minor suggestions

Developer added 2 commits February 14, 2026 23:01

lavaman131 merged commit 66590e1 into main Feb 14, 2026
3 checks passed

lavaman131 deleted the lavaman131/hotfix/telemetry branch February 14, 2026 23:03

claude bot changed the title ~~refactor(telemetry): relocate module, add MCP snapshots, HITL handling, and TUI tracking~~ refactor(telemetry)!: relocate module, add MCP snapshots, HITL handling, and TUI tracking Feb 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(telemetry)!: relocate module, add MCP snapshots, HITL handling, and TUI tracking#197

refactor(telemetry)!: relocate module, add MCP snapshots, HITL handling, and TUI tracking#197
lavaman131 merged 2 commits intomainfrom
lavaman131/hotfix/telemetry

lavaman131 commented Feb 14, 2026 •

edited by claude bot

Loading

Uh oh!

Uh oh!

claude bot commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

lavaman131 commented Feb 14, 2026 • edited by claude bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Telemetry System Restructuring

MCP Runtime Introspection

HITL Response Handling

Init Command Refactoring (Breaking Change)

Agent Configuration Updates

Breaking Changes

Test Plan

Uh oh!

Uh oh!

claude bot commented Feb 14, 2026

PR Review: refactor(telemetry): relocate module, add MCP snapshots, HITL handling, and TUI tracking

✅ Strengths

🔍 Code Quality Issues

1. Inconsistent indentation in .mcp.json (lines 348-362 in diff)

2. Potential duplicate tool normalization (src/ui/utils/mcp-output.ts:72-75)

3. Magic string duplication (multiple files)

🐛 Potential Bugs

1. Possible race condition in MCP snapshot fetch (src/ui/commands/builtin-commands.ts:436-443)

2. MCP server toggle state not persisted (src/ui/chat.tsx:1790)

🔒 Security Considerations

⚡ Performance Considerations

📝 Suggestions

📋 Test Plan Verification

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

lavaman131 commented Feb 14, 2026 •

edited by claude bot

Loading

1. Inconsistent indentation in `.mcp.json` (lines 348-362 in diff)