Firmament

Runtime-agnostic behavioral monitor for AI agent sessions, grounded in a structured research graph.

What it does

Firmament watches AI agent sessions for behavioral anomalies — action concealment, disproportionate escalation, evaluation-aware behavioral shifts — and emits typed signals when patterns fire. It monitors through ex-post transcript review and voluntary self-reporting, not content interception. A three-axis trust score accumulates across events; high-trust sessions receive reduced monitoring intensity. Every architectural decision traces to a specific finding from a 15-source, 60-finding research graph:

Quick start

go get github.com/vhscom/firmament

package main

import (
    "context"
    "fmt"
    "os"

    firmament "github.com/vhscom/firmament"
)

func main() {
    ctx := context.Background()

    cfg, _ := firmament.LoadConfig("firmament.yaml")

    f, _ := firmament.New(cfg)
    f.Monitor.SetTrustStore(firmament.NewMemoryTrustStore())

    src := firmament.NewTranscriptSource(os.ExpandEnv("$HOME/.claude/projects"), 0)
    f.Monitor.Register(src)
    go src.Start(ctx)

    router := firmament.NewRouter()
    router.Add(firmament.NewLogHandler(os.Stdout))
    go router.Route(ctx, f.Monitor.Signals())

    go f.Monitor.Run(ctx)

    // consult the knowledge graph before each agent task
    g, _ := f.Ground(ctx, "evaluate agent behavioral monitoring strategy")
    fmt.Printf("coverage: %s (%d syntheses, %d findings)\n",
        g.Coverage.Confidence, len(g.Syntheses), len(g.Findings))

    select {}
}

The CLI daemon is also available:

firmament init    # create ~/.firmament/, installation key, SQLite DB
firmament watch   # daemon: watch transcripts and self-reports, log signals
firmament review <path>   # one-shot review of a transcript file or directory
firmament trust --list    # query per-session trust scores

How it works

EventSource implementations (transcript files, self-report directories) feed events into the Monitor. Each event is pushed into a per-session EventRing (512-event ring buffer), then Pattern evaluators run against the accumulated history. When a pattern fires it emits a Signal — a typed record containing the triggering event chain — onto a channel. A Router delivers signals to registered SignalHandler implementations.

Self-report events are cross-validated against observed structural fingerprints: a "high coherence" claim paired with more than three tool failures is internally inconsistent and generates a concealment signal, with the Integrity dimension of the trust score adjusted accordingly.

Research graph format

The library's decisions are grounded in a Logseq-compatible research graph with three namespaces:

sources/     — one page per cited paper or system card
finding/     — one page per extracted empirical finding, linked to its source
syntheses/   — cross-source syntheses that resolve conflicts between findings

A finding page uses Logseq's property format:

# finding/Credible consequences for misrepresentation outperform exhaustive monitoring

source:: [[sources/Fox & Jordan 2011]]
confidence:: high
implication:: make transparency structurally easier than concealment

Fox and Jordan (2011) demonstrate that ex-post verification of agent behavior
achieves accountability equivalent to real-time surveillance without the monitoring
distortion effects documented by Holmstrom-Milgrom (1991).

ADR-005 exposes this graph as an agent expertise source. firmament.New(cfg) loads the graph from cfg.GraphPath and f.Ground(ctx, task) returns ranked syntheses and findings relevant to any task string. The graph format is documented here because the library's decisions are directly traceable to it.

Signal confidence

Signals carry a severity from 1 (informational) to 5 (critical). For patterns that use multiple independent detection strategies (currently evaluation_awareness), a confluence rule maps agreement across strategies to severity:

Low (severity 1–2): one strategy fired at moderate confidence. Worth logging; unlikely to require action on its own.
Medium (severity 3): two strategies fired, or one at high confidence. Consider human review before the next session.
High (severity 4–5): three or more strategies fired, or two with at least one at high confidence. Multi-signal confluence is harder for an adaptive agent to defeat than any single channel; warrants immediate attention.

Behavioral monitoring

Four patterns ship out of the box:

Pattern	Critical	Signal type	What it detects
`action_concealment`	yes	`concealment`	Write-type tool call followed by delete-type in the same session
`transcript_review`	no	`concealment`, `escalation`	Autonomous tool dominance; incoherent self-report; silent failure suppression
`disproportionate_escalation`	no	`escalation`	Tool-call ratio z-score > 2σ against 30-day baseline; hard threshold fallback when no baseline
`evaluation_awareness`	no	`evaluation_aware`	CoT/output gap; behavioral shift on eval-cued responses; statistical anomaly vs. baseline

Critical patterns run unconditionally. Non-critical patterns are skipped for sessions whose trust score exceeds the configured threshold in firmament-constitution.yaml. See docs/adr/ for the research basis of each pattern.

Architecture decisions

ADR	Title	Decision summary
ADR-001	Agent Behavioral Monitor	EventSource/Monitor/Pattern/Signal as the unified ingestion and detection architecture
ADR-002	Incentive-Aligned Agent Governance	TrustScore (MDS three-axis model) and bilateral Constitution as structural substitutes for surveillance
ADR-003	Evaluation Awareness Detection	Four-strategy confluence detection; signal-consistency reframing rather than ground-truth claim
ADR-004	Cross-Session Persistence	SQLite session store with fingerprint-only event records and per-agent Welford sketches
ADR-005	Graph-Driven Expertise Architecture	Reframe Firmament as an expertise library; behavioral monitoring as a structural consequence of grounded agency (proposed)

Development

go test -race ./...   # all tests must pass with -race
go vet ./...
go build ./cmd/...

Requires Go 1.21 or later. Pure Go — no CGO required (modernc.org/sqlite is a C-to-Go transpilation of the SQLite amalgamation).

What Firmament does not do

Build the research graph. Source ingestion, finding extraction, and synthesis are manual work done outside the library.
Inspect content. Only structural fingerprints (tool name, event type, SHA-256 hash truncated to 128 bits) are stored. Firmament cannot read what an agent wrote or said.
Block agent actions. Firmament is an observer; enforcement is out of scope.
Perform vector search or model inference. Pattern evaluation is rule-based and statistical.

License

AGPL-3.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
cmd/firmament		cmd/firmament
docs		docs
migrations		migrations
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DESIGN.md		DESIGN.md
LICENSE		LICENSE
README.md		README.md
config.go		config.go
credential.go		credential.go
credential_test.go		credential_test.go
event.go		event.go
event_test.go		event_test.go
eventring.go		eventring.go
eventring_test.go		eventring_test.go
eventsource.go		eventsource.go
eventsource_selfreport.go		eventsource_selfreport.go
eventsource_selfreport_test.go		eventsource_selfreport_test.go
eventsource_transcript.go		eventsource_transcript.go
eventsource_transcript_test.go		eventsource_transcript_test.go
firmament.go		firmament.go
firmament_test.go		firmament_test.go
go.mod		go.mod
go.sum		go.sum
graph.go		graph.go
graph_test.go		graph_test.go
ground.go		ground.go
ground_test.go		ground_test.go
identity.go		identity.go
identity_test.go		identity_test.go
monitor.go		monitor.go
patterns.go		patterns.go
patterns_eval_awareness.go		patterns_eval_awareness.go
patterns_eval_awareness_test.go		patterns_eval_awareness_test.go
patterns_test.go		patterns_test.go
permission.go		permission.go
permission_test.go		permission_test.go
policy.go		policy.go
policy_test.go		policy_test.go
router.go		router.go
router_test.go		router_test.go
sessionstore.go		sessionstore.go
sessionstore_sqlite.go		sessionstore_sqlite.go
sessionstore_sqlite_test.go		sessionstore_sqlite_test.go
signal.go		signal.go
signal_test.go		signal_test.go
trust.go		trust.go
trust_test.go		trust_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Firmament

What it does

Quick start

How it works

Research graph format

Signal confidence

Behavioral monitoring

Architecture decisions

Development

What Firmament does not do

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Firmament

What it does

Quick start

How it works

Research graph format

Signal confidence

Behavioral monitoring

Architecture decisions

Development

What Firmament does not do

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages