[Content Understanding] Update to_llm_input page markers and filter telemetry warnings#47326
Open
chienyuanchang wants to merge 3 commits into
Open
[Content Understanding] Update to_llm_input page markers and filter telemetry warnings#47326chienyuanchang wants to merge 3 commits into
chienyuanchang wants to merge 3 commits into
Conversation
Contributor
There was a problem hiding this comment.
Pull request overview
This PR updates the azure-ai-contentunderstanding to_llm_input() helper output format to align with an upcoming service page-marker convention and to suppress non-user-facing telemetry warnings from the rendered rai_warnings YAML front matter.
Changes:
- Switched SDK-injected page markers from
<!-- page N -->to<!-- InputPageNumber: N -->, and avoided injecting markers when the service markdown already includesInputPageNumbermarkers. - Filtered service warning messages that begin with
LLMStats:(after leading whitespace) from the renderedrai_warningsblock. - Updated unit tests and sample tests to validate the new marker format and warning filtering, and bumped package version/changelog.
Reviewed changes
Copilot reviewed 7 out of 7 changed files in this pull request and generated no comments.
Show a summary per file
| File | Description |
|---|---|
| sdk/contentunderstanding/azure-ai-contentunderstanding/tests/test_to_llm_input.py | Updates assertions for the new InputPageNumber marker format and adds coverage for LLMStats: warning filtering and duplicate-marker defense. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/tests/samples/test_sample_to_llm_input.py | Updates sample test expectations to the new page marker format. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/tests/samples/test_sample_to_llm_input_async.py | Updates async sample test expectations to the new page marker format. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/README.md | Adds 1.2.0b2 to the SDK-to-service-version compatibility table. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/CHANGELOG.md | Adds an unreleased 1.2.0b2 entry documenting the marker change and telemetry-warning filtering. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/azure/ai/contentunderstanding/_version.py | Bumps the package version to 1.2.0b2. |
| sdk/contentunderstanding/azure-ai-contentunderstanding/azure/ai/contentunderstanding/_helpers.py | Implements InputPageNumber marker injection + duplicate-marker bypass, and filters LLMStats: entries from rendered RAI warnings. |
This was referenced Jun 5, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Updates the
azure-ai-contentunderstandingto_llm_input()helper to align its rendered output with the upcoming service page-marker format and to remove non-user-facing telemetry from RAI warning output.Changes made:
<!-- page N -->to<!-- InputPageNumber: N -->.<!-- InputPageNumber:,to_llm_input()does not inject additional page markers.LLMStats:from the renderedrai_warningsfront matter.LLMStats:text when it appears in the document markdown body; only structured warnings are filtered.CHANGELOG.md.Relevant issues / context:
LLMStats:filtering: Python: Adopt azure-ai-contentunderstandingto_llm_inputin CU context provider microsoft/agent-framework#5796This PR is not based on regenerated SDK code from a new API spec.
All SDK Contribution checklist:
to_llm_input()helper.General Guidelines and Best Practices
Testing Guidelines
Testing performed: