You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Native vs Original AgentFS (origin/main3a5ed2b, AgentFS 0.6.4)
vs Tier One AgentFS (phase4-north-star-implementation9be0da4,
the kernel-cache-by-default ship).
All runs on the same machine with no AGENTFS_FUSE_* env vars set, release builds.
Headline (ratio of agentfs / native; lower is better)
Workload
Original
Tier One
Δ
Read-heavy (full run incl. startup)
2.51x
3.03x
+21%
Read-heavy (steady-state only)
7.76x
3.79x
−51%
Copy-on-write edit (50 MiB file)
8.19x
5.42x
−34%
Mixed git workload
5.16x
3.21x
−38%
Plus: CoW delta DB growth (overlay copy-up footprint, lower is better):
Original 172.6 MiB → Tier One 50.4 MiB (−71%)
Tier One mixed-workload stdev: 0.85x Original mixed-workload stdev: 1.21x
Tier Two focus areas (from this comparison)
Clone phase still dominates the mixed wall (~2.2 s of the ~2.9 s
agentfs median). Native does the same clone in ~0.28 s. The clone phase
does many small writes through copy-on-write to the SQLite delta;
batched write path and/or parallel git-pack creation under copy-on-write
is the next big lever. (median ratio: orig 7.03x → tier1 7.65x; the
variance is within noise on a 0.28 s native baseline.)
Mount startup regressed by ~10-15 ms (read-heavy full-run ratio went
2.51x → 3.03x — going up) because Tier One mounts now negotiate
parallel workers + readdirplus + writeback + ABI 7.31 caps at FUSE init.
For short-lived sandboxes this dominates total wall; for sustained
workloads it is amortised, which is why steady-state read-storms dropped
from 7.76x → 3.79x in the same comparison. Tier Two should defer worker
pool warmup to first request to recover that startup cost.
Copy-on-write DB growth is now great (-71%) but the wall-time ratio
(5.42x) is still the worst-of-three. Chunked copy-up + smarter chunk
sizing is the obvious next win and would compound with Tier Two prep: fresh native vs origin/main vs Tier One benchmark comparison #1 since
git-clone bottlenecks on the same path.
Steady-state read storms are near best-case: stat_lstat_storm
(warm) is 0.97x — actually faster than native — because the kernel
attribute cache absorbs everything past the first lookup. Future
read-path tuning is diminishing returns; the Tier Two budget should go
to CoW writes and clone-phase batching.
Behaviour to verify before Tier Two ships: the per-iteration
variance on the mixed workload is high (Tier One stdev 0.85x, Original
1.21x). A longer-iter (e.g. 10 + 2 warmup) run on a quiet machine would
tighten the medians; current 3-iter medians are reliable directional
signal but not paper-grade absolutes.
Tier Two prep — fresh benchmark comparison
Native vs Original AgentFS (
origin/main3a5ed2b, AgentFS 0.6.4)vs Tier One AgentFS (
phase4-north-star-implementation9be0da4,the kernel-cache-by-default ship).
All runs on the same machine with no
AGENTFS_FUSE_*env vars set, release builds.Headline (ratio of agentfs / native; lower is better)
Plus: CoW delta DB growth (overlay copy-up footprint, lower is better):
Original 172.6 MiB → Tier One 50.4 MiB (−71%)
Read-heavy detail (read-path-benchmark.py, cold + warm modes)
8 files / 2 dirs / 64 KiB each; 8 iters each of stat-storm, readdir-storm,
open-read-close, repeated-open-read on a steady-state mount.
Copy-on-write detail (large-edit-benchmark.py)
50 MiB base file, single-byte edit at file midpoint, then re-read+compare for correctness.
Mixed git-workload detail (git-workload-benchmark-multi.py)
openai/codex (4 643 files, 690 dirs, 63 MiB) bare→working clone, status,
32-file ls-files scan w/ 4 KiB reads, 4 representative edits w/ fsync, diff.
3 measurement iterations + 1 warmup. Medians shown.
Per-iteration reproducibility — mixed workload
Tier One mixed-workload stdev: 0.85x
Original mixed-workload stdev: 1.21x
Tier Two focus areas (from this comparison)
Clone phase still dominates the mixed wall (~2.2 s of the ~2.9 s
agentfs median). Native does the same clone in ~0.28 s. The clone phase
does many small writes through copy-on-write to the SQLite delta;
batched write path and/or parallel git-pack creation under copy-on-write
is the next big lever. (median ratio: orig 7.03x → tier1 7.65x; the
variance is within noise on a 0.28 s native baseline.)
Mount startup regressed by ~10-15 ms (read-heavy full-run ratio went
2.51x → 3.03x — going up) because Tier One mounts now negotiate
parallel workers + readdirplus + writeback + ABI 7.31 caps at FUSE init.
For short-lived sandboxes this dominates total wall; for sustained
workloads it is amortised, which is why steady-state read-storms dropped
from 7.76x → 3.79x in the same comparison. Tier Two should defer worker
pool warmup to first request to recover that startup cost.
Copy-on-write DB growth is now great (-71%) but the wall-time ratio
(5.42x) is still the worst-of-three. Chunked copy-up + smarter chunk
sizing is the obvious next win and would compound with Tier Two prep: fresh native vs origin/main vs Tier One benchmark comparison #1 since
git-clone bottlenecks on the same path.
Steady-state read storms are near best-case:
stat_lstat_storm(warm) is 0.97x — actually faster than native — because the kernel
attribute cache absorbs everything past the first lookup. Future
read-path tuning is diminishing returns; the Tier Two budget should go
to CoW writes and clone-phase batching.
Behaviour to verify before Tier Two ships: the per-iteration
variance on the mixed workload is high (Tier One stdev 0.85x, Original
1.21x). A longer-iter (e.g. 10 + 2 warmup) run on a quiet machine would
tighten the medians; current 3-iter medians are reliable directional
signal but not paper-grade absolutes.