docs(agent-workflow): deduplicate judge env var block (#130) by Dongbumlee · Pull Request #159 · Azure/agentops

Dongbumlee · 2026-05-14T18:28:06Z

Closes #130.

Summary

Doc-only validation pass for docs/tutorial-agent-workflow.md against current develop (291 lines). One drift fixed.

Drift fixed

Issue	Evidence
Section 3 'Initialize AgentOps' set both `AZURE_OPENAI_DEPLOYMENT` and `AZURE_AI_MODEL_DEPLOYMENT_NAME` to the same value.	`_model_config()` in `src/agentops/pipeline/runtime.py` reads them as fallbacks of each other (`os.getenv(AZURE_OPENAI_DEPLOYMENT) or os.getenv(AZURE_AI_MODEL_DEPLOYMENT_NAME)`); setting both is redundant. Same fix shipped in PR #158 for the Copilot-skills tutorial.

What I verified without re-deploying the Container App

Tutorial Section 1-2 builds + deploys a FastAPI tool-calling agent to ACA. That's heavy to re-spin every validation pass, so I validated the AgentOps-side claims directly against the code:

Claim	Verified via
The documented `agentops.yaml` shape parses cleanly	`AgentOpsConfig.model_validate(...)` succeeded with all six top-level fields (`request_field`, `response_field`, `tool_calls_field`, `thresholds`, etc.)
All six documented thresholds are real evaluators	`grep -n score_key src/agentops/core/evaluators.py`: `coherence`, `fluency`, `tool_call_accuracy`, `intent_resolution`, `task_adherence`, `avg_latency_seconds` all present with matching default thresholds
Dataset shape (`input` + `expected` + `tool_definitions` + `tool_calls`) triggers the agent-evaluator set	Module docstring in `src/agentops/core/evaluators.py`: 'If rows include `tool_calls` or `tool_definitions`: add agent evaluators (ToolCallAccuracy, IntentResolution, TaskAdherence).'
`agentops workflow generate --kinds pr --force` works	`--help` shows `--force` is still a valid `workflow generate` option (distinct from the deprecated `skills install --force`)
`agentops doctor --severity-fail critical` works	Confirmed in #133 / #156 validation

Tests

Full suite: 346 passed, 1 skipped (with the pre-existing test_cli_platform_invalid_value_fails deselected — Click 8.2 stderr issue on develop, unrelated).

Note for reviewers

Branched directly off current develop. No dependencies on other PRs.

Re-validated docs/tutorial-agent-workflow.md against current develop (291 lines). Single doc-only drift fixed: - Section 3 'Initialize AgentOps' set both AZURE_OPENAI_DEPLOYMENT and AZURE_AI_MODEL_DEPLOYMENT_NAME to the same value ('gpt-4o-mini'). _model_config() reads them as fallbacks of each other - setting both is redundant. Reduced to one, added a one-line note explaining the alias (same change as PR #158 for the Copilot-skills tutorial). Other claims verified against the code without re-deploying the container app: - The documented agentops.yaml shape (request_field, response_field, tool_calls_field at top level, plus thresholds for tool_call_accuracy / intent_resolution / task_adherence) parses cleanly via AgentOpsConfig.model_validate. - All six documented thresholds match real evaluators in src/agentops/core/evaluators.py (default thresholds at lines 82, 90, 181, 194, 212, 228). - The dataset shape (input + expected + tool_definitions + tool_calls) triggers the agent-evaluator set per the docstring in src/agentops/core/evaluators.py. - 'agentops workflow generate --kinds pr --force' still exists (--force here is generate's, not the deprecated skills install --force). - 'agentops doctor --severity-fail critical' is valid. Refs #130.

Dongbumlee closed this May 14, 2026

This was referenced May 14, 2026

docs(http-agent): deduplicate judge env var block (#129) #161

Merged

docs(end-to-end): correct init output tree and workflow file count (#135) #162

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(agent-workflow): deduplicate judge env var block (#130)#159

docs(agent-workflow): deduplicate judge env var block (#130)#159
Dongbumlee wants to merge 1 commit into
developfrom
fix/issue-130-agent-workflow-rev2

Dongbumlee commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dongbumlee commented May 14, 2026

Summary

Drift fixed

What I verified without re-deploying the Container App

Tests

Note for reviewers

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant