Reuse staged Linux snapshots by content hash by justinmoon · Pull Request #571 · sledtools/pika

justinmoon · 2026-03-10T21:56:47Z

Summary

Content-hash staged Linux snapshots so identical flake inputs skip rebuilds
Record pikaci snapshot hash dependencies for cache invalidation
Reuse staged Linux runner flakes remotely via SSH

Test plan

CI passes

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Implemented content-addressed snapshot storage with SHA-256 hashing for improved build caching and reuse validation.
- Added automatic snapshot metadata tracking to enable content-based reuse decisions.
Improvements
- Enhanced snapshot and runner reuse logic with strict hash-matching validation to prevent stale data reuse.
- Improved artifact management and traceability for remote build artifacts.

coderabbitai · 2026-03-10T21:57:08Z

📝 Walkthrough

Walkthrough

The changes implement content-addressed snapshot and runner flake reuse by computing SHA-256 content hashes, storing them in JSON metadata files, and modifying executor and run logic to match local and remote hashes for selective reuse, failing on mismatches.

Changes

Cohort / File(s)	Summary
Dependency Management `crates/pikaci/Cargo.toml`	Added sha2 and hex workspace dependencies for cryptographic hashing operations.
Snapshot Content Hashing `crates/pikaci/src/snapshot.rs`	Added optional `content_hash` field to SnapshotMetadata; introduced public functions `read_snapshot_metadata()` and `compute_snapshot_content_hash()` to read/compute SHA-256 hashes via deterministic tree traversal, with hash stored during snapshot creation.
Remote Snapshot and Flake Reuse `crates/pikaci/src/executor.rs`	Introduced RunnerFlakeMetadata struct and extended RemoteMicrovmContext with remote_work_dir field; added `staged_linux_remote_snapshot_dir()` helper for content-addressed snapshot paths; refactored snapshot and flake reuse logic to compare local and remote metadata hashes, conditionally reusing or failing on mismatches; added metadata read/write functions and artifact reset capability.
Remote Realization Path Translation `crates/pikaci/src/run.rs`	Updated remote snapshot path derivation to use `staged_linux_remote_snapshot_dir()` for content-addressed snapshots; added conditional nested path handling for content-addressed snapshot realization; includes tests verifying correct remote path derivation for hashed snapshots.
Documentation `todos/pikaci-staged-ci-plan.md`	Added design documentation explaining content-addressed snapshot/flake reuse strategy, hash-based validation requiring exact matches, per-run artifact paths, and concrete example runs demonstrating reuse behavior.

Sequence Diagram

sequenceDiagram
    participant Local as Local Build
    participant Executor as Remote Executor
    participant Remote as Remote Host
    participant Metadata as Remote Metadata File
    
    Local->>Local: Compute snapshot content_hash
    Local->>Executor: Initiate remote execution
    Executor->>Remote: Check if remote snapshot dir exists
    Remote-->>Executor: Path exists/not exists
    alt Snapshot exists
        Executor->>Metadata: Read remote pikaci-snapshot.json
        Metadata-->>Executor: Return remote content_hash
        Executor->>Executor: Compare hashes
        alt Hashes match
            Executor-->>Local: Reuse existing remote snapshot
        else Hashes mismatch
            Executor-->>Local: Fail with metadata mismatch error
        end
    else Snapshot missing
        Local->>Remote: Sync snapshot to remote
        Local->>Metadata: Write content_hash to remote metadata
        Executor-->>Local: Snapshot synced and ready
    end

Estimated Code Review Effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly Related PRs

Staged Linux CI: builder repair, guest hardening, and remote output #563: Extends remote prepared-output/snapshot handling and replaces remote realization paths; this PR builds on the same executor/run utilities and remote snapshot logic foundations.
Staged Linux CI: consolidate remote lane and shrink snapshots #568: Both PRs modify snapshot.rs and run.rs with snapshot-related logic; one adds SnapshotProfile support while this adds content-hash metadata and content-addressed reuse.
pikaci: stage linux rust prepare/execute lane #492: Implements staged Linux Rust support with similar executor/run module changes and staged path handling patterns.

Poem

🐰 Content addresses unlock,
Snapshots hashed, reuse unlocked,
No more duplicates to mock—
SHA-256 keeps the ledger tocked! ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 30.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title directly corresponds to the main objective described in the PR summary: implementing content-hash-based reuse for staged Linux snapshots. The title is concise, specific, and clearly reflects the primary change across multiple files (snapshot.rs, executor.rs, run.rs, and documentation).

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch wt-e693550e

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 5 additional findings.

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

crates/pikaci/src/executor.rs (2)
1319-1374: ⚠️ Potential issue | 🔴 Critical

Publish hash-addressed snapshots atomically.

Line 1321 treats pikaci-snapshot.json in the final snapshots/<hash>/snapshot directory as the reuse signal, but Lines 1367-1373 populate that same shared directory in place. Two matching runs can now race: one can observe the metadata before extraction finishes, or rm -rf the shared directory while another run is already using it. Please stage into a temp dir and atomically rename/create the ready marker only after the sync completes, or guard per-hash publication with a remote lock.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/pikaci/src/executor.rs` around lines 1319 - 1374, The code is using
the presence of ready_marker (pikaci-snapshot.json in remote_snapshot_dir) as
the reuse signal while also populating that same remote_snapshot_dir in-place,
causing race conditions; change the publish flow so sync_directory_to_remote
writes the snapshot into a temp/staging directory on the remote (e.g.
remote_snapshot_dir.tmp or use a unique temp path), complete all extraction/sync
there, and only after successful sync atomically create/rename the ready marker
or rename the staged directory into remote_snapshot_dir (or acquire a per-hash
remote lock) so that load_remote_snapshot_metadata/read_snapshot_metadata see
the ready marker only after the snapshot is fully published; update calls around
ready_marker, read_snapshot_metadata, load_remote_snapshot_metadata, append_line
and sync_directory_to_remote accordingly to use the staging path and atomic
rename/marker creation.
1661-1727: ⚠️ Potential issue | 🔴 Critical

Serialize writes to runner-flakes/<hash>.

remote_flake_root is now shared across runs by content hash, but on a miss we still sync into the final directory with the same destructive rm -rf flow. Two runs building the same flake hash can delete the flake out from under each other between sync, build, and metadata write, causing nondeterministic remote builds. This cache needs the same temp-dir + atomic publish or per-hash locking strategy as the snapshot cache.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/pikaci/src/executor.rs` around lines 1661 - 1727, The remote
"runner-flakes/<hash>" path (remote_flake_root / flake_hash) can be clobbered by
concurrent runs; change the publish flow in the remote flake path handling
(symbols: remote_flake_root, remote_flake_dir, sync_directory_to_remote,
remote_symlink, load_remote_runner_flake_metadata, remote_store_path) to
serialize writes by using a per-hash atomic publish or lock: on miss, create a
unique temp directory under remote_work_dir (e.g. runner-flakes/.tmp-<uuid>),
sync into that temp with sync_directory_to_remote, perform any build/verify
steps, write the metadata into the temp, then atomically rename/move the temp to
the final directory (or acquire a per-hash lock on flake_hash before syncing and
release after metadata write); also re-check load_remote_runner_flake_metadata
after acquiring the lock/after rename to avoid racing with concurrent
publishers.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/pikaci/src/executor.rs`:
- Around line 1221-1229: remote_artifacts_dir is currently set to
shared_job_dir.join("artifacts") which places artifacts under
`${remote_work_dir}/jobs/${job.id}` and can collide between runs; change it to
be scoped to the run by using remote_job_dir.join("artifacts") (or include
run_id in the path) when constructing the RemoteMicrovmContext so artifacts live
under the same `${remote_work_dir}/runs/${run_id}/...` as the rest of the state;
update any related logic that assumes the old path (e.g.,
reset_remote_microvm_artifacts) to reference
RemoteMicrovmContext.remote_artifacts_dir accordingly.

In `@crates/pikaci/src/run.rs`:
- Around line 3104-3118: staged_linux_remote_snapshot_dir is being called with a
potentially nested installable path which causes it to read pikaci-snapshot.json
from the wrong location; instead, call staged_linux_remote_snapshot_dir with the
actual snapshot root (i.e., the non-nested "snapshot" root) to compute
remote_snapshot_root, then compute remote_snapshot_dir by appending the nested
suffix (using snapshot_relative.strip_prefix("snapshot") -> nested ->
remote_snapshot_root.join(nested)); also ensure the later sync that uses
remote_snapshot_root (lines ~3139-3144) still syncs the root so the copied tree
matches remote_installable.

In `@crates/pikaci/src/snapshot.rs`:
- Around line 222-251: The snapshot hash currently omits file permission bits
and does not length-prefix file contents; update hash_snapshot_tree so that for
files (metadata.is_file()) you include the file's mode/permission bits (from
metadata.permissions()/platform-specific mode) and the file length
(metadata.len()) into the hasher before streaming content, and for symlinks
include any relevant permission/mode info from the symlink's metadata as well;
keep using the existing hasher.update(...) calls and the existing relative/path
markers and separators so you only add the mode and length values (as
deterministic byte sequences) to prevent chmod-only or delimiter-collision
collisions.

In `@todos/pikaci-staged-ci-plan.md`:
- Line 910: The remote artifacts directory is currently job-scoped and reset
destructively by reset_remote_microvm_artifacts(), enabling concurrent runs to
clobber each other; change the artifacts root to be run-scoped by including the
run identifier (e.g., run.id or run.uuid) in the path used when
mounting/creating artifacts under
'/var/tmp/pikaci-prepared-output/jobs/<job.id>/artifacts' (or alternatively
remove the destructive reset in reset_remote_microvm_artifacts() and make that
function operate on a run-specific path), and ensure any code that references
the runner flake cache still uses its content-addressed path while writable
artifacts use the new per-run path.

---

Outside diff comments:
In `@crates/pikaci/src/executor.rs`:
- Around line 1319-1374: The code is using the presence of ready_marker
(pikaci-snapshot.json in remote_snapshot_dir) as the reuse signal while also
populating that same remote_snapshot_dir in-place, causing race conditions;
change the publish flow so sync_directory_to_remote writes the snapshot into a
temp/staging directory on the remote (e.g. remote_snapshot_dir.tmp or use a
unique temp path), complete all extraction/sync there, and only after successful
sync atomically create/rename the ready marker or rename the staged directory
into remote_snapshot_dir (or acquire a per-hash remote lock) so that
load_remote_snapshot_metadata/read_snapshot_metadata see the ready marker only
after the snapshot is fully published; update calls around ready_marker,
read_snapshot_metadata, load_remote_snapshot_metadata, append_line and
sync_directory_to_remote accordingly to use the staging path and atomic
rename/marker creation.
- Around line 1661-1727: The remote "runner-flakes/<hash>" path
(remote_flake_root / flake_hash) can be clobbered by concurrent runs; change the
publish flow in the remote flake path handling (symbols: remote_flake_root,
remote_flake_dir, sync_directory_to_remote, remote_symlink,
load_remote_runner_flake_metadata, remote_store_path) to serialize writes by
using a per-hash atomic publish or lock: on miss, create a unique temp directory
under remote_work_dir (e.g. runner-flakes/.tmp-<uuid>), sync into that temp with
sync_directory_to_remote, perform any build/verify steps, write the metadata
into the temp, then atomically rename/move the temp to the final directory (or
acquire a per-hash lock on flake_hash before syncing and release after metadata
write); also re-check load_remote_runner_flake_metadata after acquiring the
lock/after rename to avoid racing with concurrent publishers.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2147cd98-4600-4936-8257-57837fe54c63

📥 Commits

Reviewing files that changed from the base of the PR and between 1c3777d and 5a9d19a.

⛔ Files ignored due to path filters (1)

Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (5)

crates/pikaci/Cargo.toml
crates/pikaci/src/executor.rs
crates/pikaci/src/run.rs
crates/pikaci/src/snapshot.rs
todos/pikaci-staged-ci-plan.md

coderabbitai · 2026-03-10T22:05:26Z

crates/pikaci/src/executor.rs

    let remote_job_dir = remote_run_dir.join("jobs").join(job.id);
    let shared_job_dir = remote_work_dir.join("jobs").join(job.id);
    Ok(RemoteMicrovmContext {
        remote_host,
-        remote_snapshot_dir: remote_run_dir.join("snapshot"),
+        remote_work_dir: remote_work_dir.clone(),
+        remote_snapshot_dir,
        remote_vm_dir: remote_job_dir.join("vm"),
-        remote_artifacts_dir: remote_job_dir.join("artifacts"),
+        remote_artifacts_dir: shared_job_dir.join("artifacts"),
        remote_cargo_home_dir: remote_work_dir.join("cache").join("cargo-home"),


⚠️ Potential issue | 🔴 Critical

Keep remote artifacts scoped to the run.

remote_artifacts_dir now lives under ${remote_work_dir}/jobs/${job.id}, while the rest of the execution state stays under ${remote_work_dir}/runs/${run_id}/.... Combined with reset_remote_microvm_artifacts, two runs of the same job id will delete or overwrite each other’s guest.log and result.json. This needs to stay under remote_job_dir or include run_id in the shared path.

Suggested fix

- remote_artifacts_dir: shared_job_dir.join("artifacts"), + remote_artifacts_dir: remote_job_dir.join("artifacts"),

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

let remote_job_dir = remote_run_dir.join("jobs").join(job.id);

let shared_job_dir = remote_work_dir.join("jobs").join(job.id);

Ok(RemoteMicrovmContext {

remote_host,

remote_snapshot_dir: remote_run_dir.join("snapshot"),

remote_work_dir: remote_work_dir.clone(),

remote_snapshot_dir,

remote_vm_dir: remote_job_dir.join("vm"),

remote_artifacts_dir: remote_job_dir.join("artifacts"),

remote_artifacts_dir: shared_job_dir.join("artifacts"),

remote_cargo_home_dir: remote_work_dir.join("cache").join("cargo-home"),

let remote_job_dir = remote_run_dir.join("jobs").join(job.id);

let shared_job_dir = remote_work_dir.join("jobs").join(job.id);

Ok(RemoteMicrovmContext {

remote_host,

remote_work_dir: remote_work_dir.clone(),

remote_snapshot_dir,

remote_vm_dir: remote_job_dir.join("vm"),

remote_artifacts_dir: remote_job_dir.join("artifacts"),

remote_cargo_home_dir: remote_work_dir.join("cache").join("cargo-home"),

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/pikaci/src/executor.rs` around lines 1221 - 1229, remote_artifacts_dir is currently set to shared_job_dir.join("artifacts") which places artifacts under `${remote_work_dir}/jobs/${job.id}` and can collide between runs; change it to be scoped to the run by using remote_job_dir.join("artifacts") (or include run_id in the path) when constructing the RemoteMicrovmContext so artifacts live under the same `${remote_work_dir}/runs/${run_id}/...` as the rest of the state; update any related logic that assumes the old path (e.g., reset_remote_microvm_artifacts) to reference RemoteMicrovmContext.remote_artifacts_dir accordingly.

coderabbitai · 2026-03-10T22:05:26Z

crates/pikaci/src/run.rs

+    let remote_snapshot_root =
+        staged_linux_remote_snapshot_dir(&snapshot_dir, remote_work_dir, run_id)?;
+    let remote_snapshot_dir = if snapshot_relative == Path::new("snapshot") {
+        remote_snapshot_root
+    } else {
+        let nested = snapshot_relative
+            .strip_prefix("snapshot")
+            .with_context(|| {
+                format!(
+                    "translate nested staged Linux Rust snapshot path {}",
+                    snapshot_relative.display()
+                )
+            })?;
+        remote_snapshot_root.join(nested)
+    };


⚠️ Potential issue | 🟠 Major

Keep the snapshot root separate from the translated installable path.

Line 3105 passes the possibly nested installable directory into staged_linux_remote_snapshot_dir(). That helper reads pikaci-snapshot.json from the exact path it is given (crates/pikaci/src/executor.rs, Lines 1240-1253), so .../snapshot/<subdir>#... will miss the stored content_hash and fall back to /runs/<run_id>/snapshot. Lines 3139-3144 still sync the snapshot root, which means the copied remote tree no longer matches remote_installable when the nested branch is exercised. Derive remote_snapshot_root from the actual snapshot root and append the nested suffix afterwards.

💡 Suggested fix

+ let local_snapshot_root = run_dir.join("snapshot"); let run_id = run_dir .file_name() .and_then(|value| value.to_str()) .ok_or_else(|| anyhow!("derive run id from {}", run_dir.display()))?; let remote_snapshot_root = - staged_linux_remote_snapshot_dir(&snapshot_dir, remote_work_dir, run_id)?; + staged_linux_remote_snapshot_dir(&local_snapshot_root, remote_work_dir, run_id)?;

Also applies to: 3139-3144

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/pikaci/src/run.rs` around lines 3104 - 3118, staged_linux_remote_snapshot_dir is being called with a potentially nested installable path which causes it to read pikaci-snapshot.json from the wrong location; instead, call staged_linux_remote_snapshot_dir with the actual snapshot root (i.e., the non-nested "snapshot" root) to compute remote_snapshot_root, then compute remote_snapshot_dir by appending the nested suffix (using snapshot_relative.strip_prefix("snapshot") -> nested -> remote_snapshot_root.join(nested)); also ensure the later sync that uses remote_snapshot_root (lines ~3139-3144) still syncs the root so the copied tree matches remote_installable.

coderabbitai · 2026-03-10T22:05:26Z

crates/pikaci/src/snapshot.rs

+        if metadata.file_type().is_symlink() {
+            hasher.update(b"symlink\0");
+            hasher.update(relative.as_os_str().as_encoded_bytes());
+            hasher.update(b"\0");
+            let target =
+                fs::read_link(&path).with_context(|| format!("read symlink {}", path.display()))?;
+            hasher.update(target.as_os_str().as_encoded_bytes());
+            hasher.update(b"\0");
+        } else if metadata.is_dir() {
+            hasher.update(b"dir\0");
+            hasher.update(relative.as_os_str().as_encoded_bytes());
+            hasher.update(b"\0");
+            hash_snapshot_tree(root, &path, hasher)?;
+        } else if metadata.is_file() {
+            hasher.update(b"file\0");
+            hasher.update(relative.as_os_str().as_encoded_bytes());
+            hasher.update(b"\0");
+            let mut file =
+                fs::File::open(&path).with_context(|| format!("open {}", path.display()))?;
+            let mut buffer = [0u8; 8192];
+            loop {
+                let read = file
+                    .read(&mut buffer)
+                    .with_context(|| format!("read {}", path.display()))?;
+                if read == 0 {
+                    break;
+                }
+                hasher.update(&buffer[..read]);
+            }
+            hasher.update(b"\0");


⚠️ Potential issue | 🟠 Major

Hash file mode and content length, not just raw bytes.

Lines 236-251 ignore permission bits and do not length-prefix file contents. A chmod-only change therefore keeps the same snapshot hash, and a binary file containing delimiter bytes can serialize to the same digest stream as multiple later entries. Since this hash now controls content-addressed snapshot reuse, two distinct trees can alias the same remote snapshot.

💡 Suggested fix

-use std::os::unix::fs as unix_fs; +use std::os::unix::fs::{self as unix_fs, PermissionsExt}; ... } else if metadata.is_file() { hasher.update(b"file\0"); hasher.update(relative.as_os_str().as_encoded_bytes()); hasher.update(b"\0"); + hasher.update(metadata.permissions().mode().to_be_bytes()); + hasher.update(metadata.len().to_be_bytes()); let mut file = fs::File::open(&path).with_context(|| format!("open {}", path.display()))?; let mut buffer = [0u8; 8192]; loop { let read = file .read(&mut buffer) .with_context(|| format!("read {}", path.display()))?; if read == 0 { break; } hasher.update(&buffer[..read]); } - hasher.update(b"\0"); }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/pikaci/src/snapshot.rs` around lines 222 - 251, The snapshot hash currently omits file permission bits and does not length-prefix file contents; update hash_snapshot_tree so that for files (metadata.is_file()) you include the file's mode/permission bits (from metadata.permissions()/platform-specific mode) and the file length (metadata.len()) into the hasher before streaming content, and for symlinks include any relevant permission/mode info from the symlink's metadata as well; keep using the existing hasher.update(...) calls and the existing relative/path markers and separators so you only add the mode and length values (as deterministic byte sequences) to prevent chmod-only or delimiter-collision collisions.

coderabbitai · 2026-03-10T22:05:26Z

todos/pikaci-staged-ci-plan.md

+  - note that the next reuse slice moved that remaining per-run runner staging onto the same strict content-addressed footing:
+    - each staged Linux remote microVM runner flake is now keyed by the SHA-256 content hash of its rendered `flake.nix`, under `/var/tmp/pikaci-prepared-output/runner-flakes/<hash>/flake` on `pika-build`,
+    - `pikaci` records `pikaci-runner-flake.json` next to that remote flake payload with the exact `content_hash` and realized remote runner store path, then validates both on reuse by requiring an exact hash match and a still-existing remote store path; it now hard-fails on mismatch or stale metadata instead of silently re-rendering ambiguous state,
+    - the remote artifacts mount for staged Linux remote microVM jobs now points at the shared per-job path under `/var/tmp/pikaci-prepared-output/jobs/<job>/artifacts`, and each run resets that directory before launch so the rendered runner flake stays content-stable across unchanged reruns,


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail # Inspect how remote artifact directories are derived. rg -n -C4 'prepared-output/jobs|jobs/.*/artifacts|artifacts_dir|remote.*artifacts' \ crates/pikaci/src/executor.rs crates/pikaci/src/run.rs # Inspect whether launch/reset code removes or recreates those directories. rg -n -C4 'remove_dir_all|create_dir_all|reset.*artifacts|artifacts.*reset' \ crates/pikaci/src/executor.rs crates/pikaci/src/run.rs

Repository: sledtools/pika

Length of output: 50370

🏁 Script executed:

rg -n 'shared_job_dir' crates/pikaci/src/executor.rs | head -20

Repository: sledtools/pika

Length of output: 310

🏁 Script executed:

rg -n 'remote_work_dir.*=' crates/pikaci/src/executor.rs | grep -v '//' | head -20

Repository: sledtools/pika

Length of output: 103

🏁 Script executed:

sed -n '1200,1240p' crates/pikaci/src/executor.rs

Repository: sledtools/pika

Length of output: 1888

Make the remote artifacts dir run-scoped, not just job-scoped.

The remote artifacts path is /var/tmp/pikaci-prepared-output/jobs/<job.id>/artifacts (keyed only by job ID), and reset_remote_microvm_artifacts() destructively removes and recreates it before every launch. This allows overlapping runs of the same lane to clobber each other's artifacts on pika-build. Keep the runner flake cache content-addressed, but isolate the writable artifacts root per run—either by including the run identifier in the path or by removing the destructive reset of the shared mutable location.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@todos/pikaci-staged-ci-plan.md` at line 910, The remote artifacts directory is currently job-scoped and reset destructively by reset_remote_microvm_artifacts(), enabling concurrent runs to clobber each other; change the artifacts root to be run-scoped by including the run identifier (e.g., run.id or run.uuid) in the path used when mounting/creating artifacts under '/var/tmp/pikaci-prepared-output/jobs/<job.id>/artifacts' (or alternatively remove the destructive reset in reset_remote_microvm_artifacts() and make that function operate on a run-specific path), and ensure any code that references the runner flake cache still uses its content-addressed path while writable artifacts use the new per-run path.

* Reuse staged Linux snapshots by content hash * Record pikaci snapshot hash dependencies * Reuse staged Linux runner flakes remotely

justinmoon added 3 commits March 10, 2026 16:27

Reuse staged Linux snapshots by content hash

d5cf861

Record pikaci snapshot hash dependencies

f0ee218

Reuse staged Linux runner flakes remotely

5a9d19a

justinmoon temporarily deployed to ci-auto March 10, 2026 21:56 — with GitHub Actions Inactive

devin-ai-integration bot reviewed Mar 10, 2026

View reviewed changes

coderabbitai bot reviewed Mar 10, 2026

View reviewed changes

justinmoon merged commit f4f587c into master Mar 10, 2026
18 checks passed

justinmoon added a commit that referenced this pull request Mar 11, 2026

Reuse staged Linux snapshots by content hash (#571)

0c5b8be

* Reuse staged Linux snapshots by content hash * Record pikaci snapshot hash dependencies * Reuse staged Linux runner flakes remotely

This was referenced Mar 15, 2026

Harden pikaci host-local workspace reuse and snapshot fingerprinting #656

Merged

qyrnxshe #681

Closed

Speed up Incus single-host CI path #716

Merged

justinmoon deleted the wt-e693550e branch March 20, 2026 14:52

justinmoon restored the wt-e693550e branch March 20, 2026 21:20

justinmoon deleted the wt-e693550e branch March 20, 2026 21:21

justinmoon restored the wt-e693550e branch March 20, 2026 21:49

justinmoon deleted the wt-e693550e branch March 20, 2026 21:53

justinmoon restored the wt-e693550e branch March 20, 2026 21:58

justinmoon deleted the wt-e693550e branch March 20, 2026 21:59

justinmoon restored the wt-e693550e branch March 20, 2026 22:04

justinmoon deleted the wt-e693550e branch March 20, 2026 22:04

justinmoon restored the wt-e693550e branch March 20, 2026 22:09

justinmoon deleted the wt-e693550e branch March 20, 2026 22:10

justinmoon restored the wt-e693550e branch March 20, 2026 22:15

justinmoon deleted the wt-e693550e branch March 20, 2026 22:20

justinmoon restored the wt-e693550e branch March 20, 2026 22:21

justinmoon deleted the wt-e693550e branch March 20, 2026 22:26

justinmoon restored the wt-e693550e branch March 20, 2026 22:26

justinmoon deleted the wt-e693550e branch March 20, 2026 22:31

justinmoon restored the wt-e693550e branch March 21, 2026 18:29

justinmoon deleted the wt-e693550e branch March 21, 2026 18:31

justinmoon restored the wt-e693550e branch March 21, 2026 18:35

justinmoon deleted the wt-e693550e branch March 21, 2026 18:36

justinmoon restored the wt-e693550e branch March 21, 2026 18:41

justinmoon deleted the wt-e693550e branch March 21, 2026 18:46

justinmoon restored the wt-e693550e branch March 21, 2026 18:47

justinmoon deleted the wt-e693550e branch March 21, 2026 18:52

justinmoon restored the wt-e693550e branch March 21, 2026 18:53

justinmoon deleted the wt-e693550e branch March 21, 2026 18:57

justinmoon restored the wt-e693550e branch March 21, 2026 18:59

justinmoon deleted the wt-e693550e branch March 21, 2026 19:02

justinmoon restored the wt-e693550e branch March 21, 2026 19:04

justinmoon deleted the wt-e693550e branch March 21, 2026 19:08

justinmoon restored the wt-e693550e branch March 21, 2026 19:10

justinmoon deleted the wt-e693550e branch March 21, 2026 19:13

justinmoon restored the wt-e693550e branch March 21, 2026 19:16

justinmoon deleted the wt-e693550e branch March 21, 2026 19:18

justinmoon restored the wt-e693550e branch March 21, 2026 19:21

justinmoon deleted the wt-e693550e branch March 21, 2026 19:24

justinmoon restored the wt-e693550e branch March 21, 2026 19:27

justinmoon deleted the wt-e693550e branch March 21, 2026 19:29

justinmoon restored the wt-e693550e branch March 21, 2026 19:33

justinmoon deleted the wt-e693550e branch March 21, 2026 19:34

justinmoon restored the wt-e693550e branch March 21, 2026 19:39

justinmoon deleted the wt-e693550e branch March 21, 2026 19:40

justinmoon restored the wt-e693550e branch March 21, 2026 19:44

justinmoon deleted the wt-e693550e branch March 21, 2026 19:45

justinmoon restored the wt-e693550e branch March 21, 2026 19:50

justinmoon deleted the wt-e693550e branch March 21, 2026 19:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse staged Linux snapshots by content hash#571

Reuse staged Linux snapshots by content hash#571
justinmoon merged 3 commits intomasterfrom
wt-e693550e

justinmoon commented Mar 10, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 10, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated Code Review Effort

Possibly Related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 10, 2026

Uh oh!

coderabbitai bot Mar 10, 2026

Uh oh!

coderabbitai bot Mar 10, 2026

Uh oh!

coderabbitai bot Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

justinmoon commented Mar 10, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated Code Review Effort

Possibly Related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

justinmoon commented Mar 10, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 10, 2026 •

edited

Loading