[BREAKING] Python: Enable instrumentation by default by TaoChenOSU · Pull Request #5865 · microsoft/agent-framework

TaoChenOSU · 2026-05-14T20:44:06Z

Motivation and Context

Closes #5749

Description

Changes required to make the code base consistent throughout.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines
All unit tests pass, and I have added new tests where possible
Is this a breaking change? If yes, add "[BREAKING]" prefix to the title of the PR.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Updates the Python Agent Framework and samples to reflect that instrumentation is now enabled by default, and shifts programmatic opt-in to a new enable_sensitive_telemetry() API for sensitive payload capture.

Changes:

Make ENABLE_INSTRUMENTATION default to enabled (opt-out via false) and update docs/samples accordingly.
Replace most uses of enable_instrumentation() with enable_sensitive_telemetry() for sensitive-data opt-in.
Update DevUI and Foundry integration points to reflect the new defaults/entry points.

Reviewed changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
python/samples/README.md	Updates env var table to reference `observability` settings instead of `enable_instrumentation()`
python/samples/04-hosting/foundry-hosted-agents/responses/07_observability/agent.yaml	Removes `ENABLE_INSTRUMENTATION` from sample agent config; keeps sensitive-data opt-in
python/samples/04-hosting/foundry-hosted-agents/responses/07_observability/agent.manifest.yaml	Removes `ENABLE_INSTRUMENTATION` from manifest sample
python/samples/04-hosting/foundry-hosted-agents/responses/07_observability/README.md	Documents instrumentation default-on and opt-out
python/samples/04-hosting/foundry-hosted-agents/responses/07_observability/.env.example	Removes `ENABLE_INSTRUMENTATION` example line
python/samples/03-workflows/observability/executor_io_observation.py	Wording update to reflect default instrumentation
python/samples/02-agents/observability/advanced_manual_setup_console_output.py	Switches to `enable_sensitive_telemetry()` for sensitive data capture
python/samples/02-agents/observability/README.md	Updates guidance/code samples to default-on instrumentation + sensitive opt-in
python/samples/02-agents/observability/.env.example	Documents opt-out via `ENABLE_INSTRUMENTATION=false`
python/packages/lab/lightning/agent_framework_lab_lightning/init.py	Removes explicit `enable_instrumentation()` call from tracer init
python/packages/foundry/tests/foundry/test_foundry_chat_client.py	Updates mocks to `enable_sensitive_telemetry()`
python/packages/foundry/tests/foundry/test_foundry_agent.py	Updates mocks to `enable_sensitive_telemetry()`
python/packages/foundry/agent_framework_foundry/_chat_client.py	Calls `enable_sensitive_telemetry()` only when sensitive data is requested
python/packages/foundry/agent_framework_foundry/_agent.py	Calls `enable_sensitive_telemetry()` only when sensitive data is requested
python/packages/devui/frontend/src/components/layout/deployment-modal.tsx	Updates docker-compose snippet to reflect default-on instrumentation
python/packages/devui/agent_framework_devui/_server.py	Treats instrumentation as enabled unless explicitly set to `false`
python/packages/devui/agent_framework_devui/_executor.py	Updates comment to reflect new defaults/entry point
python/packages/devui/agent_framework_devui/init.py	Removes `instrumentation_enabled` parameter and related logic
python/packages/core/tests/core/test_observability.py	Updates tests to cover the new `enable_sensitive_telemetry()` behavior
python/packages/core/agent_framework/observability.py	Defaults instrumentation to enabled; replaces `enable_instrumentation()` with `enable_sensitive_telemetry()`
python/CODING_STANDARD.md	Updates import example to `enable_sensitive_telemetry`
python/.github/skills/python-development/SKILL.md	Updates skills doc sample import to `enable_sensitive_telemetry`
python/.env.example	Updates env example to reflect default-on instrumentation

moonbox3 · 2026-05-14T21:02:39Z

Python Test Coverage Report •

File	Stmts	Miss	Cover	Missing
packages/a2a/agent_framework_a2a
_agent.py	277	18	93%	162, 349, 354, 356, 400–401, 554, 570, 578, 598, 619, 647, 663, 673, 684, 691–692, 733
packages/chatkit/agent_framework_chatkit
_converter.py	135	49	63%	112, 117, 165, 167, 335, 388, 390, 409–411, 413, 431, 433, 435, 438, 450, 460, 478, 498–522, 524–525, 527–528, 531–532
packages/core/agent_framework
_skills.py	874	26	97%	289, 531, 541, 977, 992, 994–995, 1549, 1578, 2544–2545, 2680, 2685, 2688, 2693, 2720, 2725, 2779, 2788, 2793, 2796, 2801, 2825, 2830, 3072–3073
observability.py	817	61	92%	382, 384–385, 388, 391, 394–395, 400–401, 407–408, 414–415, 422, 424–426, 429–431, 436–437, 443–444, 450–451, 458, 615–616, 815, 819–821, 823, 827–828, 832, 870, 872, 883–885, 887–889, 893, 901, 1025–1026, 1261, 1504–1505, 1608, 1732, 1773–1774, 1917, 2053, 2250, 2468, 2470
packages/foundry/agent_framework_foundry
_agent.py	239	56	76%	121, 124, 243–244, 248–250, 255–258, 348, 421–422, 434–435, 447–449, 451–452, 454–460, 462–463, 465, 467, 473–475, 478–487, 491–492, 679–680, 683, 709, 719, 735, 805, 810, 814
_chat_client.py	161	21	86%	85, 87–89, 93–94, 98, 192, 227, 283, 288, 339, 400, 402, 500, 504–505, 507–510
packages/openai/agent_framework_openai
_chat_client.py	1080	149	86%	276, 289, 631–635, 639–642, 648–652, 701–711, 718–720, 749, 765, 773, 796, 914, 1013, 1072, 1074, 1076, 1078, 1144, 1158, 1238, 1248, 1253, 1296, 1407–1408, 1423, 1632, 1637, 1641–1643, 1647–1648, 1731, 1741, 1768, 1774, 1784, 1790, 1795, 1801, 1806–1807, 1826, 1829–1832, 1846, 1848, 1856–1857, 1869, 1911, 2001, 2023–2024, 2039–2040, 2058–2059, 2102, 2268, 2306–2307, 2325, 2405–2413, 2443, 2553, 2588, 2603, 2623–2633, 2646, 2657–2661, 2675, 2689–2700, 2709, 2741–2744, 2754–2755, 2766–2768, 2782–2784, 2794–2795, 2801, 2816
TOTAL	34498	3930	88%

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
6936	30 💤	0 ❌	0 🔥	1m 50s ⏱️

github-actions

Automated Code Review

Reviewers: 4 | Confidence: 90%

✗ Correctness

The PR correctly implements the 'enable instrumentation by default' change in ObservabilitySettings and renames enable_instrumentation to enable_sensitive_telemetry. However, beyond the already-flaged issues in the review thread, the configure_azure_monitor() methods in both _agent.py and _chat_client.py have a security-relevant behavioral regression: the old code always actively set enable_sensitive_data to False by default (overriding the env var), whereas the new code is a no-op when enable_sensitive_data=False, meaning ENABLE_SENSITIVE_DATA=true in the environment will no longer be overidden, potentially leaking sensitive telemetry data in production.

✓ Security Reliability

The PR correctly changes the default for enable_instrumentation from False to True, with sound env-var parsing logic. Sensitive data capture remains opt-in. All existing references to enable_instrumentation outside the diff are either pytest fixtures or attribute-level monkeypatches on OBSERVABILITY_SETTINGS (not calls to the removed function), so they won't break. The six existing unresolved review comments already cover the significant concerns: the breaking removal of enable_instrumentation() from the public API, the configure_azure_monitor regression when enable_sensitive_data=False, the DevUI boolean parsing inconsistency, the deployment modal comment mismatch, and the DevUI serve() parameter removal. No additional security or reliability defects were found.

✓ Test Coverage

The PR renames enable_instrumentation() to enable_sensitive_telemetry() and changes the default for enable_instrumentation from False to True. Test renames and adaptations are correct, but there are two notable test coverage gaps: (1) no test verifies the new default enable_instrumentation=True when the ENABLE_INSTRUMENTATION env var is unset — this is the core behavioral change of the PR, and (2) neither foundry test file covers the configure_azure_monitor(enable_sensitive_data=False) path, leaving the new if enable_sensitive_data: guard untested for the false branch.

✗ Design Approach

The main design issue is in the Agent Lightning integration: this PR removes the explicit instrumentation enablement from AgentFrameworkTracer.init(), which means that a user who has ENABLE_INSTRUMENTATION=false will no longer get Agent Framework spans through the Lightning tracer even though that tracer is documented as the component that enables Agent Framework observability for Lightning.

Flagged Issues

configure_azure_monitor() no longer overrides ENABLE_SENSITIVE_DATA=true from the environment when called with the default enable_sensitive_data=False. The old code always called enable_instrumentation(enable_sensitive_data=False), which forcefully set OBSERVABILITY_SETTINGS.enable_sensitive_data=False. The new if enable_sensitive_data: guard skips the call entirely, leaving env-sourced enable_sensitive_data=True in effect. This is a security regression: sensitive data (prompts, tool arguments) may leak in production when users expect configure_azure_monitor() to suppress it.
AgentFrameworkTracer no longer guarantees Agent Framework telemetry is enabled for the Lightning integration. With ENABLE_INSTRUMENTATION=false, observability gates become no-ops, which contradicts the Lightning tracer's documented contract to enable OpenTelemetry observability for Agent-lightning and the README guidance that instantiating AgentFrameworkTracer() sends telemetry to Lightning.

Automated review by TaoChenOSU's agents

…default

github-actions

Automated Code Review

Reviewers: 4 | Confidence: 90%

✓ Correctness

The PR correctly changes the default for enable_instrumentation from False to True, adds a new enable_sensitive_telemetry() convenience function, gates expensive _capture_messages serialization on span.is_recording() as a performance optimization, and replaces assert_never in the converter with a graceful fallback. All code changes are well-tested and semantically correct. The _chat_client.py reformatting is semantically equivalent. No correctness bugs found.

✓ Security Reliability

This PR enables instrumentation by default, adds a new enable_sensitive_telemetry() convenience API, and gates expensive _capture_messages serialization on span.is_recording(). From a security and reliability perspective, the changes are sound: sensitive data capture (enable_sensitive_data) remains off by default and is gated behind SENSITIVE_DATA_ENABLED (which requires both enable_instrumentation AND enable_sensitive_data to be true). The span.is_recording() guards are a good reliability improvement that avoids unnecessary JSON serialization when no tracer provider is configured. The assert_never → logger.debug + skip change in _converter.py trades strict type exhaustiveness for forward compatibility, which is a reasonable tradeoff. No security or reliability issues found.

✓ Test Coverage

The PR's headline behavioral change — defaulting enable_instrumentation to True — lacks a direct unit test on ObservabilitySettings. The removed test_enable_instrumentation_reads_env_sensitive_data test leaves the env-var re-read fallback path of enable_instrumentation() without coverage. The new warning logic in __init__ is also untested. The span.is_recording() gating and the new enable_sensitive_telemetry() function are well-covered by the new tests.

✓ Design Approach

One design issue stands out. The ChatKit converter now silently drops any unsupported ThreadItem variant instead of surfacing that the converter no longer understands the thread payload. That conflicts with the converter’s own documented contract and can truncate conversation state without any caller-visible failure. I did not find another repo-backed design issue in the observability changes.

Automated review by TaoChenOSU's agents

Add a public disable_instrumentation() entry point so users can explicitly opt out of Agent Framework telemetry, with a sticky-disable flag that makes the user's intent "leading" — no framework code path (foundry's configure_azure_monitor, configure_otel_providers, enable_instrumentation, enable_sensitive_telemetry, or direct OBSERVABILITY_SETTINGS.enable_* writes) can re-enable instrumentation until the user explicitly clears the disable with enable_instrumentation(force=True) / enable_sensitive_telemetry(force=True). Also addresses the two remaining unresolved review threads on the PR: 1. test_observability_settings_defaults_instrumentation_true pins the new "ENABLE_INSTRUMENTATION defaults to True when env unset" behavior. 2. test_enable_instrumentation_reads_env_sensitive_data restores coverage for the post-import load_dotenv() fallback path. Implementation: - ObservabilitySettings.enable_instrumentation / enable_sensitive_data become properties backed by _enable_*. While _user_disabled is True, the getters return False and the setters drop True writes (defense in depth so third- party writes can't subvert the disable). - Public is_user_disabled read-only property lets integrations (e.g. foundry's configure_azure_monitor) cheaply check the disable state without poking at privates. - enable_instrumentation() and enable_sensitive_telemetry() short-circuit with an info log when disabled; gain a force=True kwarg that clears the disable. - configure_otel_providers() still creates providers / exporters / views so a later force-enable can use them, but logs an info message when called while disabled. - Foundry's FoundryChatClient.configure_azure_monitor and FoundryAgent.configure_azure_monitor early-return when the user has disabled, so Azure Monitor's global providers aren't installed unnecessarily. Tests: 11 new tests covering default-on, env re-read at call time, sticky behavior against each re-enable surface (enable_instrumentation, enable_sensitive_telemetry, configure_otel_providers, direct attribute writes), force=True override, re-arming the disable, and the __all__ export. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

eavanvalkenburg · 2026-05-19T07:31:18Z

Pushed 6a23dcd while Tao is OOF. Two things in this push:

1. New `disable_instrumentation()` with sticky user-intent (additional requirement)

Adds a public disable_instrumentation() entry point so users can explicitly opt out of Agent Framework telemetry. The disable is sticky: subsequent attempts by framework auto-setup paths, library integrations, enable_instrumentation(), enable_sensitive_telemetry(), configure_otel_providers(), or direct writes to OBSERVABILITY_SETTINGS.enable_instrumentation are silently dropped — so the user's intent is "leading" against any other process that might attempt to re-enable.

To clear the disable, the user calls enable_instrumentation(force=True) or enable_sensitive_telemetry(force=True). Without force=True, those calls are no-ops with an info log.

How it's enforced:

enable_instrumentation / enable_sensitive_data on ObservabilitySettings are now properties backed by _enable_* private attrs. While _user_disabled is True, the getters return False and the setters drop True writes (defense in depth so third-party code that does OBSERVABILITY_SETTINGS.enable_instrumentation = True directly is also intercepted).
A new public read-only is_user_disabled property lets integrations like foundry's configure_azure_monitor cheaply check the disable state without poking at privates.
configure_otel_providers() still configures providers / exporters / views (so a later force=True re-enable can use them) but logs an info message when called while disabled.
Foundry's FoundryChatClient.configure_azure_monitor and FoundryAgent.configure_azure_monitor early-return when the user has disabled, so Azure Monitor's global providers aren't installed unnecessarily.

2. Addresses the two remaining unresolved review threads

test_observability_settings_defaults_instrumentation_true pins the new "ENABLE_INSTRUMENTATION defaults to True when env unset" behavior (the PR's headline change).
test_enable_instrumentation_reads_env_sensitive_data restores coverage for the post-import load_dotenv() fallback path that the old test covered.

Tests

11 new tests covering: default-on; env re-read at call time; sticky behavior against each re-enable surface (enable_instrumentation(), enable_sensitive_telemetry(), configure_otel_providers(), direct attribute writes); force=True override on both functions; re-arming the disable after a force-enable; and the __all__ export.

All 2781 core + 283 foundry + 323 openai + 157 devui tests pass; pyright + mypy clean on all four packages.

Add a "Disabling instrumentation" section to the observability sample README that walks through: - The distinction between the ENABLE_INSTRUMENTATION env var (initial, non-sticky) and disable_instrumentation() (process-wide, sticky). - Why the sticky semantics matter: framework integrations like FoundryChatClient.configure_azure_monitor() can call enable_instrumentation() as part of their setup, and the user's opt-out needs to win. - All five surfaces guarded by the sticky disable (property reads, public enable functions, configure_otel_providers, direct attribute writes, is_user_disabled-aware integrations). - The force=True escape hatch on both enable_instrumentation() and enable_sensitive_telemetry(). - How third-party integrations should consult OBSERVABILITY_SETTINGS.is_user_disabled. - The limits of the disable (does not tear down existing providers / in-flight spans / third-party instrumentation, does not persist across processes). Cross-links the new section from the ENABLE_INSTRUMENTATION row in the env vars table. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…antees Replace 'no telemetry will be emitted no matter what' (which is too strong, since callers can still pass force=True or mutate private attributes) with language framing the disable as a user-intent contract that library and framework code is expected to honor: the framework actively short-circuits the public enable paths, force=True and private-attribute writes are acknowledged as out-of-contract escape hatches that integrations should not use on the user's behalf. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- opentelemetry-sdk is no longer a hard dependency; it is lazily imported by create_resource(), create_metric_views(), and configure_otel_providers() with a clear ImportError when missing. Day-to-day instrumentation works with opentelemetry-api alone provided some other component configures the global OpenTelemetry providers (Azure Monitor, an APM agent, application bootstrap, etc.). - opentelemetry-semantic-conventions-ai is no longer used anywhere in the source; remove it from the listed dependencies. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…nly relevant migration The old guide documented the move away from setup_observability(otlp_endpoint=...) which was an earlier-release API change unrelated to this PR and stale enough that it's more confusing than helpful at this point. Replace it with a short note on the single migration this PR introduces: callers of enable_instrumentation(enable_sensitive_data=True) should switch to enable_sensitive_telemetry(). Cross-link to the Disabling instrumentation section for the rare 'force on without enabling sensitive data' use case where enable_instrumentation() still applies. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Enable instrumentation by default

0219e17

TaoChenOSU self-assigned this May 14, 2026

Copilot AI review requested due to automatic review settings May 14, 2026 20:44

TaoChenOSU added python observability Issues related to observability or telemetry labels May 14, 2026

github-actions Bot changed the title ~~Enable instrumentation by default~~ Python: Enable instrumentation by default May 14, 2026

TaoChenOSU changed the title ~~Python: Enable instrumentation by default~~ [BREAKING] Python: Enable instrumentation by default May 14, 2026

Copilot AI reviewed May 14, 2026

View reviewed changes

moonbox3 added documentation Improvements or additions to documentation lab Agent Framework Lab labels May 14, 2026

Copilot started reviewing on behalf of TaoChenOSU May 14, 2026 20:54 View session

github-actions Bot reviewed May 14, 2026

View reviewed changes

TaoChenOSU added 8 commits May 14, 2026 15:02

Update samples

4a83d92

Optimization when span is not recording

07ea764

Address Copilot comments

13b8e68

Revert uv.lock

f5f0d82

Add warning

9007b32

Merge branch 'main' into local-branch-python-enable-observability-by-…

94ce49d

…default

Formatting

5e5c697

Fix mypy

1d1e58e

TaoChenOSU marked this pull request as ready for review May 15, 2026 18:01

github-actions Bot reviewed May 15, 2026

View reviewed changes

Comment thread python/packages/core/agent_framework/observability.py Outdated

Comment thread python/packages/core/tests/core/test_observability.py

eavanvalkenburg and others added 4 commits May 19, 2026 11:24

eavanvalkenburg approved these changes May 19, 2026

View reviewed changes

eavanvalkenburg enabled auto-merge May 19, 2026 09:45

moonbox3 approved these changes May 20, 2026

View reviewed changes

eavanvalkenburg added this pull request to the merge queue May 20, 2026

Merged via the queue into main with commit 72a6157 May 20, 2026
37 checks passed

giles17 mentioned this pull request May 21, 2026

Python: bump package versions for 1.6.0 release #6017

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BREAKING] Python: Enable instrumentation by default#5865

[BREAKING] Python: Enable instrumentation by default#5865
eavanvalkenburg merged 14 commits into
mainfrom
feature/python-enable-instrumentation-by-default

TaoChenOSU commented May 14, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

moonbox3 commented May 14, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

eavanvalkenburg commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

TaoChenOSU commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Contribution Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

moonbox3 commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python Unit Test Overview

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Automated Code Review

✗ Correctness

✓ Security Reliability

✓ Test Coverage

✗ Design Approach

Flagged Issues

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Automated Code Review

✓ Correctness

✓ Security Reliability

✓ Test Coverage

✓ Design Approach

Uh oh!

Uh oh!

Uh oh!

eavanvalkenburg commented May 19, 2026

1. New disable_instrumentation() with sticky user-intent (additional requirement)

2. Addresses the two remaining unresolved review threads

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

TaoChenOSU commented May 14, 2026 •

edited

Loading

moonbox3 commented May 14, 2026 •

edited

Loading

1. New `disable_instrumentation()` with sticky user-intent (additional requirement)