Feat: Evaluation preview features - Batch evaluation and config bundles #446
Feat: Evaluation preview features - Batch evaluation and config bundles #446
Conversation
Re-adding workflow that was lost in a force push to main. Original PR: aws/bedrock-agentcore-sdk-python-private#60 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add config bundle support to runtime Introduces a new config_bundle module that allows agent handlers to read configuration from BedrockAgentCore configuration bundles, delivered via W3C baggage headers on each invocation
* feat: add batch evaluation Introduces BatchEvaluationRunner, which orchestrates end-to-end batch evaluation against the AgentCore Evaluation Service
GITHUB_TOKEN lacks the workflows permission, so syncing .github/workflows/ from the public repo causes push failures. After merging, restore our workflow files from HEAD before committing in both the clean and conflict paths.
Bumps [cryptography](https://github.com/pyca/cryptography) from 46.0.5 to 46.0.7. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](pyca/cryptography@46.0.5...46.0.7) --- updated-dependencies: - dependency-name: cryptography dependency-version: 46.0.7 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Bumps [pillow](https://github.com/python-pillow/Pillow) from 12.1.1 to 12.2.0. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](python-pillow/Pillow@12.1.1...12.2.0) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.2.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Bumps [python-multipart](https://github.com/Kludex/python-multipart) from 0.0.22 to 0.0.26. - [Release notes](https://github.com/Kludex/python-multipart/releases) - [Changelog](https://github.com/Kludex/python-multipart/blob/master/CHANGELOG.md) - [Commits](Kludex/python-multipart@0.0.22...0.0.26) --- updated-dependencies: - dependency-name: python-multipart dependency-version: 0.0.26 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* feat(runtime): stamp OTel spans with routing experiment baggage
FAILED, STOPPED, and DELETING were raising RuntimeError; COMPLETED_WITH_ERRORS was hitting the unknown-status RuntimeError. All terminal states now return the response so callers can inspect result.status and result.error_details.
docs: add preview warning docstrings to all new evo methods and classes
✅ No Breaking Changes DetectedNo public API breaking changes found in this PR. |
|
Couple things I noticed:
|
Description of changes:
Batch Evaluation
ConfigBundles
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.