feat: infer scenarios and probe sandbox status by Ker102 · Pull Request #13 · Ker102/nullstate-cli

Ker102 · 2026-05-08T20:46:00Z

Summary

default nullstate run to --scenario auto and --target auto
infer scenarios from Terraform, Kubernetes YAML, Docker Compose, Ansible-style SSH baselines, and exported plan JSON
add sandbox runtime probes to nullstate sandbox status, including LocalStack Docker and HTTP health checks
clarify in docs that models are optional for deterministic/local demos; no endpoint means mock red/blue agent responses

Evidence to capture before merge

PR checks pending and green
python -m nullstate run examples/aws-public-s3 --offline showing inferred scenario/target
python -m nullstate sandbox status localstack-azure showing Docker running and HTTP 200, with license/session IDs redacted from any separate Docker logs

Verification

python -m unittest discover -s tests -v
python -m ruff check src tests
python -m mypy src
python -m pip_audit . --skip-editable
python -m nullstate run examples/aws-public-s3 --offline --runs-dir $env:TEMP\nullstate-auto-aws-run
python -m nullstate sandbox status localstack-azure
python -m nullstate run examples/azure-public-blob --offline --target localstack-azure --runs-dir $env:TEMP\nullstate-auto-azure-run

Closes #12

Summary by CodeRabbit

New Features
- Automatic scenario and target inference for nullstate run, removing the need for explicit --scenario/--target in most cases.
- Sandbox status now shows live runtime probes and detailed backend availability.
Documentation
- Updated docs and README: defaults to --scenario auto/--target auto, clarified --offline vs model usage, and documented --mock-agents deterministic fallback.
Tests
- Added tests covering inference, sandbox probes, and offline model endpoint behavior.

coderabbitai · 2026-05-08T20:46:14Z

📝 Walkthrough

Walkthrough

This PR implements automatic scenario and sandbox backend inference for the nullstate run command. The run command now defaults both --scenario and --target to "auto", allowing the CLI to detect the infrastructure scenario from IaC files and route to an appropriate sandbox backend. A new sandbox probing system reports runtime health for deployed environments, and documentation is updated to reflect these simplified defaults.

Changes

Auto Scenario/Target Inference & Sandbox Probing

Layer / File(s)	Summary
Data Contracts `src/nullstate/sandbox.py`	Introduces `RuntimeProbe` dataclass with `name`, `status`, and `detail` fields to represent sandbox health checks.
Scenario Detection `src/nullstate/scenario_detection.py`	New module implements `infer_scenario()` that reads IaC files and pattern-matches against Azure blob, AWS S3, Kubernetes, Compose, SSH, and Terraform scenarios; returns matched `Scenario` or `None`.
Sandbox Probing `src/nullstate/sandbox.py`	Implements `probe_backend()` to execute Docker, HTTP, and shell-based health checks per sandbox backend; includes helpers for timeout-safe subprocess execution and error normalization; imports `requests` for HTTP checks.
CLI Integration `src/nullstate/cli.py`	Updates `run()` defaults to `--scenario auto` and `--target auto`; adds `_resolve_scenario()` and `_resolve_backend()` helpers to infer or lookup scenario/target; integrates `probe_backend()` into `sandbox status` output; generalizes event labels from Terraform-specific to IaC-focused.
Documentation `CHANGELOG.md`, `README.md`, `docs/architecture.md`, `docs/runbook.md`	Documents auto-inference defaults, simplifies examples by removing explicit `--scenario`/`--target` flags, clarifies deterministic mock fallback when model endpoint is absent, and explains `--offline` semantics and `--mock-agents`.
Tests `tests/test_scenario_detection.py`, `tests/test_sandbox.py`, `tests/test_offline_scenario_runs.py`, `tests/test_cli_model_endpoint.py`	Adds unit tests for scenario detection across demo scenarios and unknown IaC shapes, verifies `sandbox status` probe output, end-to-end test demonstrating auto-inference, and a test asserting `--offline` still uses a configured model endpoint and emits metrics.

Sequence Diagram(s)

sequenceDiagram
  participant User
  participant CLI
  participant ResolveScenario as _resolve_scenario
  participant InferScenario as infer_scenario
  participant GetScenario as get_scenario
  participant ResolveBackend as _resolve_backend
  participant GetBackend as get_backend
  participant RunLogic as run_logic
  
  User->>CLI: nullstate run /path/to/iac --offline
  CLI->>ResolveScenario: _resolve_scenario(iac_dir, "auto")
  ResolveScenario->>InferScenario: infer_scenario(iac_dir)
  InferScenario->>InferScenario: read & pattern-match IaC files
  InferScenario-->>ResolveScenario: Scenario object or None
  ResolveScenario->>GetScenario: get_scenario(matched_name)
  GetScenario-->>ResolveScenario: Scenario
  ResolveScenario-->>CLI: resolved_scenario
  
  CLI->>ResolveBackend: _resolve_backend("auto", scenario.backend)
  ResolveBackend->>GetBackend: get_backend(scenario.backend)
  GetBackend-->>ResolveBackend: SandboxBackend
  ResolveBackend-->>CLI: resolved_backend
  
  CLI->>RunLogic: execute red/blue agents with inferred config
  RunLogic-->>User: findings.json with inferred scenario/target

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Ker102/nullstate-cli#7: Both PRs modify the CLI to read NULLSTATE_LLM_BASE_URL and collect/store model endpoint metrics; related to model/metrics handling in CLI.
Ker102/nullstate-cli#10: Related — overlaps in offline multi-scenario support and scenario/run behavior.

Poem

🐰 A rabbit's note on clever cues:
IaC whispers, the CLI sees through,
Defaults now find the scenario true.
Sandboxes check their health and sing,
Hop along — the findings spring!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 6.06% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately summarizes the two main changes: scenario inference and sandbox status probing.
Description check	✅ Passed	The description covers all major changes with clear section headings and includes thorough verification steps.
Linked Issues check	✅ Passed	All acceptance criteria from issue `#12` are met: auto scenario defaults [`#12`], scenario inference from multiple IaC sources [`#12`], sandbox status with health checks [`#12`], and model endpoint documentation [`#12`].
Out of Scope Changes check	✅ Passed	All changes are directly aligned with the linked issue objectives; no out-of-scope modifications were introduced.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/scenario-auto-status

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/nullstate/cli.py`:
- Around line 341-362: The two helpers _resolve_scenario and _resolve_backend
lack return type annotations, preventing mypy from validating callers; add
explicit return types matching what get_scenario and get_backend return by
importing and using Scenario from .scenarios for _resolve_scenario and Backend
from .sandbox for _resolve_backend, then annotate the functions as def
_resolve_scenario(terraform_dir: Path, scenario: str) -> Scenario: and def
_resolve_backend(target: str, scenario_backend: str) -> Backend: while leaving
existing logic and error handling unchanged.

In `@src/nullstate/scenario_detection.py`:
- Around line 46-47: The function _looks_like_k8s_privileged_pod currently
treats any file containing "hostPath:" as a privileged-k8s signal; change the
logic so hostPath is only considered when Kubernetes markers are present:
require both "apiVersion:" and "kind:" to be in text, and then check for either
"privileged: true" or "hostPath:" (i.e., gate the hostPath check behind the
apiVersion/kind check) so non-Kubernetes files with hostPath do not trigger the
detector.
- Around line 50-54: The helper _looks_like_compose_exposed_admin currently
returns true for any compose file because it only checks filename and
"services:"; change it to detect admin-specific signals by parsing the compose
content (or using regex) to look for admin service names (e.g., "adminer",
"phpmyadmin", "portainer", "grafana", "kibana") and for explicit port/expose
mappings that bind typical admin ports (e.g., 80, 443, 8080, 3000, 9000, 5601)
under a service's "ports:" or "expose:" entries; only return True when at least
one admin service name or an exposed/admin port mapping is present, otherwise
return False. Ensure you update _looks_like_compose_exposed_admin to inspect
each service block in text_by_name values and match on service name keys and
port lines rather than just "services:".
- Around line 42-43: The helper _looks_like_aws_public_s3 is too permissive;
replace the broad substring check for "aws_s3_bucket" with a tighter heuristic:
return true if text contains "aws_s3_bucket_public_access_block" OR contains an
aws_s3_bucket resource declaration plus explicit public indicators (e.g.,
'resource "aws_s3_bucket"' together with 'acl = "public-read"' or 'acl =
"public-read-write"', a grant/acl that mentions "AllUsers" or
"AuthenticatedUsers", or an explicit public_access_block configuration set to
false). Update the function _looks_like_aws_public_s3 to scan for those combined
patterns instead of plain "aws_s3_bucket" so only buckets that are likely
publicly accessible trigger the aws-public-s3 scenario.

In `@tests/test_offline_scenario_runs.py`:
- Around line 118-120: The test uses next(runs_dir.iterdir()) and directly
indexes findings[0], which can raise StopIteration/FileNotFoundError or
IndexError instead of an assertion failure; change to collect the runs into a
list (e.g., runs = list(runs_dir.iterdir())) and assert its length equals the
expected count before selecting run_dir (e.g., run_dir = runs[0]), and add an
assertion that findings is non-empty (e.g., self.assertTrue(findings) or
self.assertGreater(len(findings), 0)) before accessing findings[0] so failures
produce clear assertion messages; update references to runs_dir and findings
accordingly.

In `@tests/test_scenario_detection.py`:
- Around line 26-28: The test currently accesses inferred.name without checking
for None; update the test around infer_scenario(demo_dir) to first assert that
inferred is not None (e.g., using self.assertIsNotNone(inferred,
f"infer_scenario returned None for {demo_dir} expected {expected}")), then
perform the existing self.assertEqual(inferred.name, expected); reference the
infer_scenario call and the inferred variable so the assertion failure will
clearly indicate which demo_dir/expected pair failed.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 9a199d43-e2a6-4434-b6fd-38a0250f90de

📥 Commits

Reviewing files that changed from the base of the PR and between bae71fb and 08b4f00.

📒 Files selected for processing (10)

CHANGELOG.md
README.md
docs/architecture.md
docs/runbook.md
src/nullstate/cli.py
src/nullstate/sandbox.py
src/nullstate/scenario_detection.py
tests/test_offline_scenario_runs.py
tests/test_sandbox.py
tests/test_scenario_detection.py

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests/test_cli_model_endpoint.py`:
- Line 82: The test uses a relative scenario path string
"examples/azure-public-blob" which can break when tests run from a different
CWD; resolve that path to an absolute path before passing it to the CLI. Update
the test in tests/test_cli_model_endpoint.py to compute the scenario path from
the test file location (e.g. using __file__ or Path(__file__).resolve().parent
and joining the repo/testroot up to the examples directory) and pass the
resulting absolute path to the CLI invocation instead of the raw relative
string.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: b1abba09-85b6-4a5a-bda7-b5525cde9679

📥 Commits

Reviewing files that changed from the base of the PR and between 08b4f00 and c3134a6.

📒 Files selected for processing (5)

CHANGELOG.md
README.md
docs/runbook.md
src/nullstate/cli.py
tests/test_cli_model_endpoint.py

Ker102 · 2026-05-09T15:34:00Z

all checks pass

feat: infer scenarios and probe sandbox status

08b4f00

coderabbitai Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/nullstate/cli.py

Comment thread src/nullstate/scenario_detection.py

Comment thread src/nullstate/scenario_detection.py

Comment thread src/nullstate/scenario_detection.py

Comment thread tests/test_offline_scenario_runs.py

Comment thread tests/test_scenario_detection.py

fix: use configured model endpoint in offline iac mode

c3134a6

coderabbitai Bot reviewed May 9, 2026

View reviewed changes

Comment thread tests/test_cli_model_endpoint.py

This was referenced May 9, 2026

feat: support separate red and blue model endpoints #15

Closed

feat: support role-specific model endpoints #16

Merged

Ker102 merged commit bd13cb1 into main May 9, 2026
5 checks passed

Ker102 deleted the feat/scenario-auto-status branch May 9, 2026 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: infer scenarios and probe sandbox status#13

feat: infer scenarios and probe sandbox status#13
Ker102 merged 2 commits into
mainfrom
feat/scenario-auto-status

Ker102 commented May 8, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 8, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Ker102 commented May 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ker102 commented May 8, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Evidence to capture before merge

Verification

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Ker102 commented May 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ker102 commented May 8, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 8, 2026 •

edited

Loading