Nous

Metacognitive evaluation module for the Life Agent OS -- the "Pepe Grillo" that judges agent behavior in real time, feeding quality signals back into homeostatic control loops.

Nous (Greek: mind, intellect) provides a hybrid evaluation architecture: fast inline heuristics (< 2ms) that run as Arcan middleware, plus optional async LLM-as-judge evaluators for deeper quality assessment.

Architecture

                    +------------------+
                    |   Arcan (Agent)  |
                    +--------+---------+
                             |
              +--------------+--------------+
              |                             |
     +--------v---------+         +--------v---------+
     | nous-middleware   |         |    nous-api      |
     | (Arcan Middleware)|         |  (HTTP daemon)   |
     +--------+---------+         +--------+---------+
              |                             |
    +---------+---------+         +---------+---------+
    |                   |         |                   |
+---v---+         +-----v---+  +-v--------+    +-----v----+
| nous- |         | nous-   |  | nous-    |    | nous-    |
| heur- |         | core    |  | judge    |    | lago     |
| istics|         | (types) |  | (LLM)   |    | (events) |
+-------+         +---------+  +----------+    +----------+
                                                     |
                                              +------v------+
                                              |    Lago     |
                                              +-------------+

Hybrid Deployment

Path	Latency	Where	What
Inline heuristics	< 2ms	Arcan middleware hooks	Token efficiency, budget adherence, tool correctness, safety compliance
Async LLM-as-judge	1-10s	`nousd` daemon	Plan quality, plan adherence, task completion, reasoning depth

5 Evaluator Layers

Safety -- Does the action violate policy constraints?
Correctness -- Are tool arguments valid? Are results well-formed?
Efficiency -- Token usage, step count, budget adherence
Quality -- Plan coherence, reasoning depth, task completion
Alignment -- Does behavior match the agent's soul and beliefs (Anima)?

Crates

Crate	Purpose
`nous-core`	Types, traits, errors (zero I/O). `NousEvaluator` trait, `EvalScore`, `EvalResult`, `EvalLayer` taxonomy
`nous-heuristics`	Inline evaluators: token efficiency, budget adherence, tool correctness, argument validity, safety compliance, step efficiency
`nous-middleware`	`NousMiddleware: impl arcan_core::Middleware` -- wires evaluators into the Arcan agent loop
`nous-judge`	Async LLM-as-judge: plan quality, plan adherence, task completion
`nous-lago`	Lago persistence bridge for eval events (`eval.*` namespace)
`nous-api`	HTTP API (axum): `/eval/{session}`, `/eval/run`
`nousd`	Daemon binary

Quick Start

# Run the evaluation daemon
cargo run -p nousd

# With Lago persistence
cargo run -p nousd -- --lago-data-dir /path/to/data

Integration with Autonomic

Nous feeds evaluation scores into the Autonomic homeostasis controller:

Inline heuristics --> direct fold into Autonomic state (synchronous)
Async LLM judge  --> Lago events --> Autonomic subscription (asynchronous)

When quality scores drop below thresholds, Autonomic can trigger recovery actions: switching operating modes, adjusting budgets, or requesting human intervention.

Event Namespace

All events use EventKind::Custom with prefix "eval.":

eval.heuristic_result -- inline evaluator output
eval.judge_result -- LLM-as-judge output
eval.score_aggregated -- combined score across layers

Scores are OTel-aligned: emitting gen_ai.evaluation.result span events for observability via Vigil.

Build and Test

# Full verification
cargo fmt && cargo clippy --workspace -- -D warnings && cargo test --workspace

# Run tests only
cargo test --workspace

# Build release
cargo build --workspace --release

Dependency Order

aios-protocol (canonical contract)
    |
nous-core (types + traits, zero I/O)
    |           \              \
nous-heuristics  nous-judge    nous-lago (+ lago-core, lago-journal)
    |           /
nous-middleware (+ arcan-core)
    |
nous-api (axum)
    |
nousd (binary)

Documentation

Full documentation: docs.broomva.tech/docs/life/nous

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
crates		crates
fixtures/golden		fixtures/golden
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
deny.toml		deny.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nous

Architecture

Hybrid Deployment

5 Evaluator Layers

Crates

Quick Start

Integration with Autonomic

Event Namespace

Build and Test

Dependency Order

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nous

Architecture

Hybrid Deployment

5 Evaluator Layers

Crates

Quick Start

Integration with Autonomic

Event Namespace

Build and Test

Dependency Order

Documentation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages