Revised AI engineering docs #472

dominicchapman · 2025-11-18T22:14:28Z

New "evaluation" content:

Updated workflow language from "Measure" to "Evaluate" to better reflect our approach
Reorganized evaluation content into a dedicated section with six focused pages (overview, setup, write evaluations, flags & experiments, run evaluations, analyze results)

Other changes:

Concepts: Added definitions for flags and experiments; integrated AI capability architecture spectrum (single-turn → workflows → single-agent → multi-agent)
Create: De-emphasized experimental prompt management features while clarifying Axiom's current focus on evaluation and observability; added references to Vercel AI SDK examples and Mastra as framework alternatives
Iterate: Complete rewrite introducing the systematic improvement loop; added sections on user feedback capture and domain expert annotation workflows (marked as coming soon); reorganized failure categorization by severity for better prioritization
Quickstart: Updated to reference evaluation framework and CLI authentication; improved "What's next" guidance

dominicchapman · 2025-11-19T10:08:27Z

Closing in favor of #473 which should generate a preview

c-ehrlich and others added 18 commits November 11, 2025 15:50

initial eval docs

2ae1a63

add note about instrumentation fn

a082b90

Stylistic fixes

7df0bdb

Quick fixes

0254557

Merge branch 'main' into evals-1

686a53e

Fixes

7b8bd25

Add keywords

2251591

Restructure Measure page

2c662b2

Implement review

95d4c5c

Refactor

55e6bf4

Update measure.mdx

3e3050c

Update measure.mdx

89ce5ca

docs: concepts and definitions

ad26f30

docs: update overview

d6a1130

docs: new evaluate section

55703d9

docs: create, evaluate/overview, remove measure from docs.json

c6d33c1

docs: revise iterate

9a26814

docs: refinement

528cf1f

dominicchapman changed the base branch from mano/evals to main November 18, 2025 22:40

Merge branch 'main' into dominic/evals-plus-wider-edits

35810e4

dominicchapman closed this Nov 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revised AI engineering docs #472

Revised AI engineering docs #472

dominicchapman commented Nov 18, 2025

Uh oh!

dominicchapman commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Revised AI engineering docs #472

Revised AI engineering docs #472

Conversation

dominicchapman commented Nov 18, 2025

Uh oh!

dominicchapman commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants