feat(sdk): default merge events in SDK by namrataghadi-galileo · Pull Request #155 · agentcontrol/agent-control

namrataghadi-galileo · 2026-03-31T19:29:13Z

Summary

This PR makes the SDK the single owner of ControlExecutionEvent creation and emission, moving all observability event reconstruction from the server to the SDK. The /api/v1/evaluation endpoint now returns only evaluation semantics, keeping evaluation and observability concerns cleanly separated.

Key changes:

SDK reconstructs both local and server control execution events after receiving the lightweight EvaluationResponse
Server /evaluation endpoint is now evaluation-only — no longer builds or ingests observability events
Events are emitted exclusively through the SDK's existing observability pipeline (batcher → /api/v1/observability/events → OSS storage)
Event identity consistency preserved using ControlDefinition.observability_identity() for composite conditions

Behavior

When SDK observability is enabled

check_evaluation_with_local() reconstructs local + server events in the SDK and enqueues one combined batch
check_evaluation() reconstructs server events in the SDK and enqueues through the built-in pipeline
Trace/span correlation preserved through the tracing resolver and reconstructed control metadata

When SDK observability is disabled

Evaluation still works normally
No control-execution events are created or emitted

Error handling

If local controls succeed but the server request/parsing fails, local events are still enqueued before the error is re-raised
Safe failure handling ensures observability data isn't lost

What changed

SDK (sdks/python/src/agent_control/):

Added shared event reconstruction helpers in evaluation_events.py
Updated evaluation.py to support:
- SDK reconstruction of local events
- SDK reconstruction of server events from lightweight EvaluationResponse + cached control definitions
- Combined enqueue of local + server events through existing SDK batcher
- Standalone check_evaluation() reconstruction when observability enabled
Removed old session-scoped merge_events mode from Python SDK surface

Server (server/src/agent_control_server/):

Updated endpoints/evaluation.py to be evaluation-only
Returns sanitized EvaluationResponse without observability payloads
Removed observability event emission and ingestion from /evaluation endpoint

TypeScript client:

Cleaned up generated client drift to match current pure-evaluation contract

Testing

Tests updated to cover:

✅ Default local event enqueue behavior
✅ SDK reconstruction of server events
✅ Local + server combined enqueue behavior
✅ Provider-backed trace/span propagation
✅ Local-event preservation when server call/parsing fails
✅ Pure /evaluation behavior with no server-side observability emission
✅ Response sanitization and helper behavior on server side

Migration notes

This is now the default observability behavior (not an optional mode)
Direct callers to /api/v1/evaluation no longer get observability automatically unless they also emit events through /api/v1/observability/events

…re/59787-merge-events

codecov · 2026-03-31T19:32:19Z

Codecov Report

❌ Patch coverage is 91.26984% with 11 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
sdks/python/src/agent_control/evaluation.py	91.54%	6 Missing ⚠️
sdks/python/src/agent_control/evaluation_events.py	89.79%	5 Missing ⚠️

📢 Thoughts on this report? Let us know!

lan17

I think there are still two blockers before we merge this. I left inline notes on the partial sink integration and the unconditional reconstruction work on the evaluation hot path.

sdks/python/src/agent_control/evaluation.py

lan17

Blocking issue:

server/src/agent_control_server/endpoints/evaluation.py:242-263 now trusts X-Agent-Control-Merge-Events from the caller and skips server-side observability ingestion solely on that header. A direct API caller can set the header and suppress control-execution events without reconstructing/re-emitting them anywhere. The server needs to treat merged mode as a trusted SDK flow, or otherwise keep the default ingestion path for untrusted callers.

sdks/python/src/agent_control/evaluation.py

sdks/python/src/agent_control/evaluation_events.py

sdks/python/src/agent_control/telemetry/trace_context.py

namrataghadi-galileo · 2026-04-02T17:43:12Z

@lan17 Fixed. The server no longer trusts X-Agent-Control-Merge-Events by itself. Merged mode is now established during init(..., merge_events=True) / initAgent, and the server records that enablement for the initialized (authenticated client, agent) session. On /evaluation, the server only skips observability ingestion if that same client previously initialized that same agent with merge-events enabled; otherwise it keeps the default ingestion path. Since merge_events defaults to False, existing behavior remains unchanged unless merged mode is explicitly turned on.

lan17

Re-reviewed current head 9af1755.

The original header-trust issue looks fixed, but there is still one blocker:

server/src/agent_control_server/merge_event_sessions.py:28-40 scopes merge-enable state by (client.api_key, agent_name). For session-cookie auth, _authenticate_via_cookie() returns AuthenticatedClient(api_key="", ...) in server/src/agent_control_server/auth.py:118-121, so every cookie-authenticated browser session collapses to the same key ("", agent_name). If one logged-in UI session initializes an agent with merge events enabled, any other logged-in UI session can send X-Agent-Control-Merge-Events: true and suppress server-side ingestion for that agent. The trusted-session key needs to use a per-session or otherwise user-specific identifier, not the empty api_key sentinel.

lan17 · 2026-04-02T20:11:14Z

Stepping back from the line-level issues, I think the direction here is reasonable, but the architecture is taking on too much implicit session and state coupling too quickly. Separating evaluation semantics from observability event delivery is the right idea, and extracting shared event reconstruction logic is a real improvement over duplicating emission behavior in multiple places. Where this starts to feel brittle is that the behavior of a single evaluation request now depends on prior init(...)/initAgent side effects, cached control definitions in the SDK, special headers, and server-side trust state. That is a lot of hidden coordination for a latency-sensitive path, and it makes correctness and debuggability harder than it should be.

For a v1, I would be comfortable with a narrower version of this. I would keep merged event creation scoped only to the local-first SDK path, keep check_evaluation() on the plain server-owned flow, and keep the built-in queue as the only delivery mechanism for now. In other words, make one explicit opt-in contract that says the SDK owns creation of the merged batch for this initialized session, and avoid broadening the public surface area until that protocol is solid. That still gets the main benefit of this work without committing us to a more general session-plus-delivery framework yet.

What I do not think we should do yet is spread this across multiple entry points, multiple trust mechanisms, and alternate delivery sinks at the same time. Once merge mode works in a way that is explicit, robust across auth modes and deployment topologies, and easy to reason about end to end, then adding a configurable sink is a straightforward follow-on. Right now #160 feels like it is layering a flexible delivery abstraction on top of a control plane that is still settling.

So my recommendation would be: land a smaller, clearer version of merged creation first, prove that the ownership model is correct, and only then generalize it. I think the core idea is good. I just do not think we should lock in this much flexibility before the underlying contract is simpler and more stable.

namrataghadi-galileo added 9 commits March 23, 2026 13:01

Add provider agnostic traceing

58eb627

fix linting

3d39706

add test

c3241c1

Merge branch 'feature/59789-add-provider-agnostic-tracing' into featu…

fc27836

…re/59787-merge-events

draft

55e57b5

address comments

61cd788

separate evaluation and emission

f0391db

resolve conflicst

d532300

update docstring

ff1e344

namrataghadi-galileo requested review from abhinav-galileo and lan17 March 31, 2026 19:33

TS sdk fix

a8c40c5

lan17 requested changes Apr 1, 2026

View reviewed changes

sdks/python/src/agent_control/evaluation.py Show resolved Hide resolved

sdks/python/src/agent_control/evaluation.py Outdated Show resolved Hide resolved

namrataghadi-galileo added 3 commits March 31, 2026 19:19

address comments

f58778e

ensure control sink exists for merged mode

c0d9a57

refactor this PR to only have merge mode

33fcc18

namrataghadi-galileo requested a review from lan17 April 1, 2026 19:58

Merge branch 'main' into feature/59787-merge-events

b785836

lan17 requested changes Apr 2, 2026

View reviewed changes

abhinav-galileo reviewed Apr 2, 2026

View reviewed changes

sdks/python/src/agent_control/evaluation.py Outdated Show resolved Hide resolved

abhinav-galileo reviewed Apr 2, 2026

View reviewed changes

sdks/python/src/agent_control/evaluation_events.py Outdated Show resolved Hide resolved

abhinav-galileo reviewed Apr 2, 2026

View reviewed changes

sdks/python/src/agent_control/telemetry/trace_context.py Show resolved Hide resolved

mergeing from main

e70832a

address comments

ca5bf6f

namrataghadi-galileo requested review from abhinav-galileo and lan17 April 2, 2026 17:57

namrataghadi-galileo added 2 commits April 2, 2026 11:00

fix typescript

001228a

fix TS

f24e07e

namrataghadi-galileo added 4 commits April 2, 2026 11:08

fix ts

a714e27

add more TS fix

9baf9d1

code coverage

8731f7e

add more tests for code cov

9af1755

lan17 requested changes Apr 2, 2026

View reviewed changes

namrataghadi-galileo added 5 commits April 2, 2026 17:20

address comments

45203f0

TS

c84c040

fix TS

ef59e71

fix TS

1a8a246

fix TS

b598903

namrataghadi-galileo changed the title ~~feat(sdk): 59787 merge events~~ feat(sdk): default merge events in SDK Apr 3, 2026

namrataghadi-galileo requested a review from lan17 April 3, 2026 22:04

lan17 approved these changes Apr 3, 2026

View reviewed changes

namrataghadi-galileo merged commit 5984a60 into main Apr 4, 2026
8 checks passed

namrataghadi-galileo deleted the feature/59787-merge-events branch April 4, 2026 00:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sdk): default merge events in SDK#155

feat(sdk): default merge events in SDK#155
namrataghadi-galileo merged 27 commits intomainfrom
feature/59787-merge-events

namrataghadi-galileo commented Mar 31, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 31, 2026 •

edited

Loading

Uh oh!

lan17 left a comment

Uh oh!

Uh oh!

Uh oh!

lan17 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

namrataghadi-galileo commented Apr 2, 2026

Uh oh!

lan17 left a comment

Uh oh!

lan17 commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

namrataghadi-galileo commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Behavior

When SDK observability is enabled

When SDK observability is disabled

Error handling

What changed

Testing

Migration notes

Uh oh!

codecov bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lan17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lan17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

namrataghadi-galileo commented Apr 2, 2026

Uh oh!

lan17 left a comment

Choose a reason for hiding this comment

Uh oh!

lan17 commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

namrataghadi-galileo commented Mar 31, 2026 •

edited

Loading

codecov bot commented Mar 31, 2026 •

edited

Loading