feat: add kosli evaluate input subcommand by tooky · Pull Request #743 · kosli-dev/cli

tooky · 2026-03-28T14:53:37Z

Summary

Adds kosli evaluate input — evaluate a local JSON file (or stdin) against a Rego policy, with no API dependency
Enables local policy development and testing without needing a running Kosli server
Input shape is opaque to the command — the policy defines what it expects

Prompted by this discussion where we realised kosli evaluate trail always hits the API, so there's no way to iterate on policies locally. Dan raised conftest as the alternative, but our tooling should support this natively — Rego is our language choice, and local testing should be turnkey.

In response to Alex's comparison of Rego vs pipeline controls — as more controls move into kosli evaluate, a fast local feedback loop becomes essential.

Usage

# evaluate a local JSON file against a policy:
kosli evaluate input \
    --input-file trail-data.json \
    --policy policy.rego

# read from stdin:
cat trail-data.json | kosli evaluate input --policy policy.rego

# inspect what the policy sees:
kosli evaluate input \
    --input-file trail-data.json \
    --policy policy.rego \
    --show-input --output json

Test plan

All new tests pass (go test -v -run TestEvaluateInputCommandTestSuite ./cmd/kosli/)
loadInput(io.Reader) unit tests pass
make lint clean
Manual: kosli evaluate input --help shows expected output
Manual: evaluate a saved --show-input JSON against a policy

🤖 Generated with Claude Code

claude · 2026-03-28T14:54:06Z

Claude finished @tooky's task in 4m 27s —— View job

PR Review: `kosli evaluate input` subcommand

Run static analysis (build, vet, mod tidy)
Read changed files and review code
Post review feedback

Static Analysis

All three checks pass cleanly: go build ./..., go vet ./..., and go mod tidy with no diff. No issues here.

Code Review

Overall: This is a well-structured PR. The command follows established codebase patterns, the TDD slice approach is clearly documented in TODO.md, and the review feedback from earlier rounds has been addressed thoughtfully. A few observations:

What's good

Clean reuse of evaluateAndPrintResult() — no duplication of the evaluation/output pipeline.
Embedding commonEvaluateOptions with hidden flow/attestations flags is a pragmatic solution to share flag registration without exposing API-specific flags.
cmd.InOrStdin() makes stdin testable via cmd.SetIn(), and the test harness already sets root.SetIn(new(bytes.Buffer)) (testHelpers.go:66), so the empty-stdin test reliably hits loadInput rather than term.IsTerminal.
Good test coverage — 11 command-level test cases covering allow, deny, missing flags, invalid input, policy validation errors, JSON output, and show-input. Plus standalone loadInput unit tests.
Help text examples include the fixture-capture workflow (jq '.input'), directly addressing Alex's feedback about documenting how to produce the input file.

Issues to address

invalid.rego test case missing — There are test fixtures for no-package-policy.rego, no-allow-rule.rego, and deny-no-violations.rego, but the existing invalid.rego (syntax error fixture) isn't exercised through evaluate input. A test case proving that a malformed Rego file produces a clear error would round out the policy validation coverage.
evaluateTrail.go / evaluateTrails.go changes not shown in diff — The changed files list mentions +6/-1 in each, but the TODO mentions "Remove 'using OPA' from all evaluate command long descriptions". Worth confirming the OPA references were cleaned up consistently — I see evaluate.go is clean, but couldn't verify the trail commands' long descriptions changed correctly since the diff wasn't available against origin/main.

Minor / non-blocking

loadInputFromFile close error handling (evaluateInput.go:105-108) — The named return with deferred close-error capture is correct but slightly unusual for this codebase. It works fine; just noting it's a more careful pattern than most of the surrounding code uses.
Test name accuracy (evaluateInput_test.go:48) — "missing --input-file reads from stdin (empty stdin fails)" — this test actually exercises the loadInput error path (since bytes.Buffer isn't *os.File, the terminal check is skipped). The name could be slightly more precise, e.g., "missing --input-file with empty stdin fails to parse". Very minor.
Alex's --data suggestion — Tracked externally per his comment, not a blocker for this PR. Worth a TODO.md entry if the team wants to pursue it.

Summary

Clean, well-tested addition that fills a real gap in the local policy development workflow. The code follows existing patterns, static analysis is clean, and the iterative review process addressed all prior feedback. The one actionable item is adding a test case for invalid.rego syntax errors through the evaluate input command path.

cmd/kosli/evaluateInput.go

tooky · 2026-03-28T15:01:28Z

Addressed the review feedback in 841836c:

1. TODO.md duplicate — Removed the unchecked duplicate Slice 3 line.

2. Parent evaluate command description — Broadened the Short from "Evaluate Kosli trail data" to "Evaluate data" and rewrote the Long to mention both API-backed trail evaluation and local input evaluation. kosli evaluate --help now reads:

Use evaluate trail or evaluate trails to fetch data from Kosli and evaluate it.
Use evaluate input to evaluate a local JSON file or stdin without any API calls.

3. Flag registration duplication — Left as-is per the review (non-blocking). Three flags registered by hand is simpler than embedding commonEvaluateOptions and hiding flow/attestations.

4. --input-file in long desc — Left as-is. The flag is already documented in the flag help text and the "When --input-file is omitted" paragraph.

cmd/kosli/evaluateInput.go

AlexKantor87 · 2026-03-30T18:25:39Z

From Alex — feedback on kosli evaluate input

This is a strong addition that directly addresses a gap we've been working around in the agentic SDLC demo. We run 10 control gates in CI, each calling kosli evaluate trails against Rego policies — and every policy iteration currently requires a live Kosli org with real trail data. Our kosli-evaluate skill already documents a workaround using raw opa eval, but having this native in the CLI is much better.

How we'd use this immediately:

Local policy development — capture a fixture with evaluate trail --show-input, then iterate with evaluate input instead of needing API access and an OPA install
CI optimization — fetch trail data once and evaluate all 10 control policies against the same JSON instead of 10 independent API round-trips
Pipeline debugging — test policy fixes against captured data before pushing and re-running the full pipeline

A few suggestions:

Add a fixture capture example to the help text. The examples show --input-file trail-data.json but don't show how to produce that file. Something like: kosli evaluate trail TRAIL --policy allow-all.rego --flow FLOW --show-input --output json > trail-data.json would make the workflow self-documenting.
Clarify the --show-input wrapper. When you capture with --show-input --output json, the result has an outer structure with allow, violations, and input keys. Should evaluate input receive the full output, or just the .input portion? The examples would benefit from being explicit about this.
Consider --data as a follow-up slice. Since this is the offline path, it's a natural place to add external config support (budget thresholds, expected counts, etc.). We currently hardcode these values in Rego because kosli evaluate has no --data flag. Not a blocker for this PR, but worth tracking.

Implementation looks clean — good reuse of evaluateAndPrintResult(), solid test coverage, and the stdin piping support is a nice touch.

cmd/kosli/evaluateInput.go

OPA is an implementation detail. The other commands say 'Rego policy' consistently — align all three evaluate subcommands to match. Addresses Dan's review comment on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Tests for missing `package policy`, missing `allow` rule, and `allow = false` without a `violations` rule. Documents the expected error messages and behaviour through `evaluate input`. Addresses Dan's review comments on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Show how to capture trail data with --show-input and extract .input with jq for local policy iteration. Clarify that the input file should contain the raw JSON object, not the --show-input wrapper. Addresses Alex's review suggestions on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

cmd/kosli/evaluate.go

cmd/kosli/evaluateInput.go

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Avoid restating the short description; the second sentence now adds new information (file or stdin) rather than repeating. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

loadInputFromFile now delegates to loadInput after opening the file, eliminating the duplicated JSON unmarshal logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Remove duplicate Slice 3 entry in TODO.md - Broaden parent evaluate command description to cover both API-backed trail evaluation and local input evaluation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

OPA is an implementation detail. The other commands say 'Rego policy' consistently — align all three evaluate subcommands to match. Addresses Dan's review comment on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Tests for missing `package policy`, missing `allow` rule, and `allow = false` without a `violations` rule. Documents the expected error messages and behaviour through `evaluate input`. Addresses Dan's review comments on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Show how to capture trail data with --show-input and extract .input with jq for local policy iteration. Clarify that the input file should contain the raw JSON object, not the --show-input wrapper. Addresses Alex's review suggestions on PR #743. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replace the os.Stdin global with an injected io.Reader parameter, passed via Cobra's InOrStdin(). This makes the stdin path testable with controlled input via cmd.SetIn(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Remove flag registration duplication by embedding the shared struct. Hide flow and attestations flags that don't apply to evaluate input. Move RequireFlags out of addFlags so each command specifies its own required flags independently. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Avoids the double-allocation from io.ReadAll + json.Unmarshal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

go.mod

cmd/kosli/evaluateInput.go

claude · 2026-04-01T16:32:54Z

cmd/kosli/evaluateInput_test.go

+			name:        "missing --input-file reads from stdin (empty stdin fails)",
+			cmd:         "evaluate input --policy testdata/policies/allow-all.rego",
+			goldenRegex: `failed to parse input:`,
+		},


The test at line 48–51 ("missing --input-file reads from stdin (empty stdin fails)") — since the terminal detection was added in Slice 5, does this test still exercise the loadInput error path, or does it now hit the term.IsTerminal check first? In CI the stdin is likely not a terminal so it should still reach loadInput, but the test name and golden regex (failed to parse input) might not match the actual error in all environments.

Consider adding a comment noting this assumption, or explicitly using cmd.SetIn() with an empty bytes.Buffer to guarantee the loadInput path is tested regardless of CI environment.

When --input-file is omitted and stdin is a terminal, return a clear error instead of silently blocking. Uses golang.org/x/term to detect the terminal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

go.mod

claude · 2026-04-01T16:36:14Z

cmd/kosli/evaluateInput_test.go

+			name:        "missing --input-file reads from stdin (empty stdin fails)",
+			cmd:         "evaluate input --policy testdata/policies/allow-all.rego",
+			goldenRegex: `failed to parse input:`,
+		},


Minor: This test implicitly relies on CI stdin not being a terminal. If CI ever allocates a PTY, it would hit the term.IsTerminal check instead of the loadInput error path. Consider explicitly providing an empty reader via a helper that calls cmd.SetIn(&bytes.Buffer{}) to guarantee this tests the loadInput path regardless of environment.

go mod tidy resolved conflict markers left from rebase and updated golang.org/x/term to v0.41.0. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Set root.SetIn(new(bytes.Buffer)) in executeCommandC so tests get an empty buffer as stdin rather than inheriting the process stdin. This ensures the empty-stdin test exercises loadInput reliably regardless of whether CI stdin is a terminal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

claude · 2026-04-01T16:45:06Z

cmd/kosli/evaluateInput_test.go

+			name:        "deny without violations rule returns DENIED with no violation messages",
+			cmd:         "evaluate input --input-file testdata/evaluate/trail-input.json --policy testdata/policies/deny-no-violations.rego",
+			goldenRegex: `RESULT:\s+DENIED`,
+		},


Consider adding a test case for invalid.rego (the syntax-error fixture that already exists in testdata/policies/). This would round out the policy validation coverage alongside the no-package-policy, no-allow-rule, and deny-no-violations cases:

{ wantError: true, name: "syntactically invalid policy returns error", cmd: "evaluate input --input-file testdata/evaluate/trail-input.json --policy testdata/policies/invalid.rego", goldenRegex: `failed to evaluate policy`, },

(Adjust the expected regex to match the actual error from evaluate.Evaluate for malformed Rego.)

claude bot reviewed Mar 28, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Mar 28, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Mar 28, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Outdated Show resolved Hide resolved

dangrondahl reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Outdated Show resolved Hide resolved

cmd/kosli/evaluateInput.go Show resolved Hide resolved

cmd/kosli/evaluateInput.go Show resolved Hide resolved

tooky force-pushed the evaluate-input branch from 841836c to b0e5435 Compare April 1, 2026 12:56

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluate.go Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluate.go Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluate.go Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluate.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Outdated Show resolved Hide resolved

tooky enabled auto-merge (squash) April 1, 2026 16:21

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

tooky and others added 7 commits April 1, 2026 17:29

chore: add evaluate input test list to TODO.md

2f9777e

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

green: evaluate input exists as a subcommand with required flags

f3910a2

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

green: evaluate input loads file and evaluates against allow-all policy

327aa7c

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

green: evaluate input handles deny, errors, JSON output, and show-input

d34c534

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: mark evaluate input slice 1 complete, start slice 2

c60384f

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

green: extract loadInput(io.Reader) for stdin support

71befc7

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

green: evaluate input reads from stdin when --input-file not provided

3072fe0

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

tooky and others added 13 commits April 1, 2026 17:29

refactor: tighten evaluate input long description opening

e331f83

Avoid restating the short description; the second sentence now adds new information (file or stdin) rather than repeating. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor: remove duplication between loadInputFromFile and loadInput

7964433

loadInputFromFile now delegates to loadInput after opening the file, eliminating the duplicated JSON unmarshal logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: address PR review feedback

a46f555

- Remove duplicate Slice 3 entry in TODO.md - Broaden parent evaluate command description to cover both API-backed trail evaluation and local input evaluation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: mark PR review feedback slice complete

d414cc0

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Update cmd/kosli/evaluate.go

67191b1

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>

Update cmd/kosli/evaluate.go

8e293f4

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>

fix: remove duplicate line in evaluate long description

477eb04

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor: use json.NewDecoder in loadInput

d59732c

Avoids the double-allocation from io.ReadAll + json.Unmarshal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

tooky force-pushed the evaluate-input branch from d006bfd to f93a228 Compare April 1, 2026 16:30

claude bot reviewed Apr 1, 2026

View reviewed changes

go.mod Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

cmd/kosli/evaluateInput.go Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

green: error when stdin is a terminal with no input piped

001dbdd

When --input-file is omitted and stdin is a terminal, return a clear error instead of silently blocking. Uses golang.org/x/term to detect the terminal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

tooky force-pushed the evaluate-input branch from f93a228 to 001dbdd Compare April 1, 2026 16:33

claude bot reviewed Apr 1, 2026

View reviewed changes

go.mod Outdated Show resolved Hide resolved

claude bot reviewed Apr 1, 2026

View reviewed changes

tooky and others added 2 commits April 1, 2026 17:36

fix: resolve go.mod merge conflict and update go.sum

48688e3

go mod tidy resolved conflict markers left from rebase and updated golang.org/x/term to v0.41.0. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

claude bot reviewed Apr 1, 2026

View reviewed changes

AlexKantor87 approved these changes Apr 2, 2026

View reviewed changes

tooky merged commit 8687eaa into main Apr 2, 2026
17 checks passed

tooky deleted the evaluate-input branch April 2, 2026 04:58

Conversation

tooky commented Mar 28, 2026

Summary

Usage

Test plan

Uh oh!

claude bot commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: kosli evaluate input subcommand

Static Analysis

Code Review

What's good

Issues to address

Minor / non-blocking

Summary

Uh oh!

Uh oh!

tooky commented Mar 28, 2026

Uh oh!

Uh oh!

Uh oh!

AlexKantor87 commented Mar 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

claude bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

claude bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

claude bot commented Mar 28, 2026 •

edited

Loading

PR Review: `kosli evaluate input` subcommand