Instrument metadata digest utilisation by jbarnes850 · Pull Request #93 · Arc-Computer/atlas-sdk

jbarnes850 · 2025-10-29T19:10:16Z

Summary

add per-section size accounting and budget utilisation metrics to the prompt digest helper
log a warning when the digest exceeds 75% of the provider budget so prompt bloat is visible without extra config
extend docs and unit tests to cover the new stats and warning path while keeping the digested payload unchanged for users

Testing

pytest tests/unit/connectors/test_prompt_digest.py tests/unit/test_openai_adapter.py

Copilot

Pull Request Overview

This pull request implements a comprehensive learning evaluation infrastructure for the Atlas system, introducing playbook entries (formerly "policy nuggets"), usage tracking instrumentation, impact metrics, and prompt digest functionality to handle large metadata payloads. The changes enable systematic measurement of adaptive efficiency and cross-incident transfer for learning-based agent behavior.

Key Changes:

Renamed "policy nuggets" to "playbook entries" with structured schema, rubric gates, and provenance tracking
Added runtime usage instrumentation to track cue hits, action adoptions, and failure signals per playbook entry
Implemented prompt digest system to trim large metadata blobs for providers with smaller context windows (e.g., Claude)

Reviewed Changes

Copilot reviewed 24 out of 24 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`atlas/learning/playbook_entries.py`	New module implementing playbook entry schema validation, rubric scoring (actionability, generality, hookability, concision), and gate enforcement
`atlas/learning/usage.py`	New runtime tracker capturing cue hits, action adoptions, and session outcomes (reward, tokens, incident IDs, retry counts)
`atlas/learning/synthesizer.py`	Extended to evaluate playbook entries against rubric gates, merge impact metrics into learning state, and maintain provenance metadata
`atlas/connectors/prompt_digest.py`	New digest builder that trims metadata sections to fit provider-specific character budgets while preserving high-signal content
`atlas/connectors/openai.py`	Integrated prompt digest into message building; metadata now digested before serialization
`atlas/evaluation/learning_report.py`	Added playbook metrics, lifecycle summary, per-entry impact tracking, usage metrics, and efficiency snapshots to summary/markdown outputs
`atlas/core/__init__.py`	Wired usage tracker into session lifecycle; captures reward/token deltas, incident context, and merges impact rolls into learning state
`atlas/personas/student.py` & `teacher.py`	Instrumented cue detection and action adoption recording at runtime
`atlas/config/models.py`	Added config models for digest, playbook schema, gates, rubric weights, and usage tracking
`scripts/eval_learning.py`	Extended CLI with prompt variant, synthesis model, pamphlet injection, and playbook entry label flags
`docs/learning_eval.md`	Documented prompt digest, playbook schema, impact metrics, experiment configs, and evaluation workflow
`configs/eval/*.yaml`	New evaluation configs for baseline, scope-shift, and Claude synthesis variants
`tests/unit/*.py`	Added test coverage for usage tracker, prompt digest, learning report impact sections, and synthesizer gate failures

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-29T19:12:47Z


 __all__ = ["OpenAIAdapter"]
+
+logger = logging.getLogger(__name__)


Duplicate logger initialization. The logger is already initialized at line 27. Remove this redundant declaration at line 205.

Suggested change

logger = logging.getLogger(__name__)

Copilot · 2025-10-29T19:12:47Z

+    allowed_runtime_handles: List[str] = Field(default_factory=list)
+    runtime_handle_prefixes: List[str] = Field(default_factory=list)
+    cue_types: List[str] = Field(default_factory=lambda: ["regex", "keyword", "predicate"])
+    default_scope_category: str = "differentiation"


[nitpick] The default scope category is set to \"differentiation\" in the schema config, but in learning_overhaul_base.yaml and learning_overhaul_claude.yaml it's overridden to \"reinforcement\" (lines 183 and 127 respectively). This inconsistency could lead to confusion. Consider documenting why the global default differs from the config-specific defaults or aligning them for clarity.

Suggested change

default_scope_category: str = "differentiation"

default_scope_category: str = "reinforcement"

Copilot · 2025-10-29T19:12:48Z

+        try:
+            return float(total)
+        except (TypeError, ValueError):
+            total = None


Variable total is not used.

Suggested change

total = None

pass

jbarnes850 · 2025-10-29T19:22:45Z

Superseded by #94

jbarnes850 added 5 commits October 28, 2025 20:16

Enhance learning evaluation pipeline and baseline config

9fb24d6

Tighten baseline student output schema

91e32b2

Rename learning nuggets to playbook entries (#87)

7de99a8

feat: add playbook impact instrumentation

6c1a65a

Instrument metadata digest utilisation

d8f12d4

Copilot AI review requested due to automatic review settings October 29, 2025 19:10

github-project-automation Bot added this to Arc Project Board & Issue Tracker Oct 29, 2025

Copilot AI reviewed Oct 29, 2025

View reviewed changes

jbarnes850 self-assigned this Oct 29, 2025

jbarnes850 added the enhancement New feature or request label Oct 29, 2025

jbarnes850 moved this to In review in Arc Project Board & Issue Tracker Oct 29, 2025

jbarnes850 closed this Oct 29, 2025

github-project-automation Bot moved this from In review to Done in Arc Project Board & Issue Tracker Oct 29, 2025

jbarnes850 deleted the feature/adapter-metadata-digest branch November 1, 2025 00:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Instrument metadata digest utilisation#93

Instrument metadata digest utilisation#93
jbarnes850 wants to merge 5 commits intomainfrom
feature/adapter-metadata-digest

jbarnes850 commented Oct 29, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

jbarnes850 commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		__all__ = ["OpenAIAdapter"]

		logger = logging.getLogger(__name__)

	default_scope_category: str = "differentiation"
	default_scope_category: str = "reinforcement"

Conversation

jbarnes850 commented Oct 29, 2025

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

jbarnes850 commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants