docs: add MLOps-with-coding-agents how-to#246
Conversation
Follow-on to the on-prem coding agent guide. Covers four agent-driven MLOps workflows: managing InferenceService and LLMInferenceService resources, configuring authentication and rate limiting on Envoy AI Gateway, an iterative agent-driven performance tuning loop, and reusable templates for fine-tuning plans and post-run reports. Links to the existing fine-tuning paths (Workbench Notebook, Training Hub, Kubeflow Trainer v2, LLM Compressor) and to the Envoy AI Gateway install doc. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. 🗂️ Base branches to auto review (3)
Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
Superseded — both docs are now in #245. Closing. |
Summary
Follow-on to #245. Adds a second how-to: Run MLOps with Coding Agents and On-Premise LLMs under
model_inference/inference_service/how_to. Stacked on the PR1 branch — once #245 lands, GitHub will auto-retarget this PR tomaster.The doc focuses on four agent-driven MLOps workflows on Alauda AI:
InferenceService/LLMInferenceService— agent loop (draft → server-side dry-run → apply → poll → smoke-test), with concrete starter prompts.AIGatewayRoute,AIServiceBackend,BackendSecurityPolicy,SecurityPolicy, andBackendTrafficPolicy; links to the existing intro/install docs.Also includes a "daily MLOps loop" walk-through, guardrails (read-only first,
--dry-run=server, one change per iteration, never fabricate metrics, no hosted-provider fallback), and a References section.The amended PR1 commit adds a
\{#best-practices}anchor to the prior doc so this doc's deep links resolve underdoom lint.Test plan
doom lintpasses on the new file (0 errors, 0 warnings)yarn lintpasses./*.mdx,../../../envoy_ai_gateway/*,../../../kubeflow/how_to/*,../../../workbench/how_to/*,../../../llm-compressor/how_to/*,../../../infrastructure_management/hardware_profile/*) resolve to files that exist on the stacked base🤖 Generated with Claude Code