Handle extended ASCII whitespace in remote shell parser by oferchen · Pull Request #1297 · oferchen/rsync

oferchen · 2025-10-27T20:50:29Z

Summary

treat vertical tab and form feed as whitespace delimiters in the remote shell parser
add a regression test covering extended ASCII whitespace handling for --rsh

Testing

cargo test -p rsync-transport parser_treats_extended_ascii_whitespace_as_delimiters

https://chatgpt.com/codex/tasks/task_e_68ffd8be49a0832393e09ec429638762

Specifies the criterion harness that gates the sharded BufferPool layout from #1295: 1/4/16/64 thread acquire-release loops with mixed adaptive buffer sizes, pass/fail thresholds keyed to linear scaling through 16 threads, and explicit risk mitigations for false sharing and hot-shard imbalance.

Add criterion microbench under crates/transfer/benches profiling four result-collection strategies (shared Arc<Mutex<Vec>>, sharded Mutex<Vec> by rayon worker id, crossbeam SegQueue, crossbeam unbounded channel) over 100K items at 1/4/8/16 rayon worker counts. Throughput is reported in elements/sec so the bench output names the worker count at which the single shared mutex saturates. The shipping parallel-stat path in crates/transfer/src/parallel_io.rs does not use Arc<Mutex<Vec>>; it collects via into_par_iter().map(f).collect(), which delegates to rayon's lock-free reducer. Document this in docs/audits/parallel-stat-collection.md and keep the microbench as the baseline against which any future PR that proposes reintroducing a shared mutex on this path must be measured (tracked under #1192, #1271, #1297, #1370, #1682).

… (#4170) * chore(bench): add parallel-stat collector contention microbench Add criterion microbench under crates/transfer/benches profiling four result-collection strategies (shared Arc<Mutex<Vec>>, sharded Mutex<Vec> by rayon worker id, crossbeam SegQueue, crossbeam unbounded channel) over 100K items at 1/4/8/16 rayon worker counts. Throughput is reported in elements/sec so the bench output names the worker count at which the single shared mutex saturates. The shipping parallel-stat path in crates/transfer/src/parallel_io.rs does not use Arc<Mutex<Vec>>; it collects via into_par_iter().map(f).collect(), which delegates to rayon's lock-free reducer. Document this in docs/audits/parallel-stat-collection.md and keep the microbench as the baseline against which any future PR that proposes reintroducing a shared mutex on this path must be measured (tracked under #1192, #1271, #1297, #1370, #1682). * chore: sync Cargo.lock after dev-dep addition

The #1297 task assumed a Mutex<Vec<Vec<u8>>> baseline for a single-mutex vs sharded contention bench. The production BufferPool already uses a thread-local single-slot cache in front of a lock-free crossbeam_queue::ArrayQueue with a CAS-admission soft cap. This audit documents the current architecture, lists the remaining (non-storage) mutexes, explains why the originally-scoped comparison cannot be authored, and records what is already measured vs the gaps that the sharded-queue follow-up plan would address.

Specifies the criterion harness that gates the sharded BufferPool layout from #1295: 1/4/16/64 thread acquire-release loops with mixed adaptive buffer sizes, pass/fail thresholds keyed to linear scaling through 16 threads, and explicit risk mitigations for false sharing and hot-shard imbalance.

… (#4170) * chore(bench): add parallel-stat collector contention microbench Add criterion microbench under crates/transfer/benches profiling four result-collection strategies (shared Arc<Mutex<Vec>>, sharded Mutex<Vec> by rayon worker id, crossbeam SegQueue, crossbeam unbounded channel) over 100K items at 1/4/8/16 rayon worker counts. Throughput is reported in elements/sec so the bench output names the worker count at which the single shared mutex saturates. The shipping parallel-stat path in crates/transfer/src/parallel_io.rs does not use Arc<Mutex<Vec>>; it collects via into_par_iter().map(f).collect(), which delegates to rayon's lock-free reducer. Document this in docs/audits/parallel-stat-collection.md and keep the microbench as the baseline against which any future PR that proposes reintroducing a shared mutex on this path must be measured (tracked under #1192, #1271, #1297, #1370, #1682). * chore: sync Cargo.lock after dev-dep addition

The #1297 task assumed a Mutex<Vec<Vec<u8>>> baseline for a single-mutex vs sharded contention bench. The production BufferPool already uses a thread-local single-slot cache in front of a lock-free crossbeam_queue::ArrayQueue with a CAS-admission soft cap. This audit documents the current architecture, lists the remaining (non-storage) mutexes, explains why the originally-scoped comparison cannot be authored, and records what is already measured vs the gaps that the sharded-queue follow-up plan would address.

… (#4170) * chore(bench): add parallel-stat collector contention microbench Add criterion microbench under crates/transfer/benches profiling four result-collection strategies (shared Arc<Mutex<Vec>>, sharded Mutex<Vec> by rayon worker id, crossbeam SegQueue, crossbeam unbounded channel) over 100K items at 1/4/8/16 rayon worker counts. Throughput is reported in elements/sec so the bench output names the worker count at which the single shared mutex saturates. The shipping parallel-stat path in crates/transfer/src/parallel_io.rs does not use Arc<Mutex<Vec>>; it collects via into_par_iter().map(f).collect(), which delegates to rayon's lock-free reducer. Document this in docs/audits/parallel-stat-collection.md and keep the microbench as the baseline against which any future PR that proposes reintroducing a shared mutex on this path must be measured (tracked under #1192, #1271, #1297, #1370, #1682). * chore: sync Cargo.lock after dev-dep addition

The #1297 task assumed a Mutex<Vec<Vec<u8>>> baseline for a single-mutex vs sharded contention bench. The production BufferPool already uses a thread-local single-slot cache in front of a lock-free crossbeam_queue::ArrayQueue with a CAS-admission soft cap. This audit documents the current architecture, lists the remaining (non-storage) mutexes, explains why the originally-scoped comparison cannot be authored, and records what is already measured vs the gaps that the sharded-queue follow-up plan would address.

Handle additional whitespace in remote shell parser

57d6be8

oferchen added the codex label Oct 27, 2025 — with ChatGPT Codex Connector

oferchen merged commit fe9c963 into master Oct 27, 2025

oferchen deleted the integrate-rsync-3.4.1-protocol-in-rust branch October 27, 2025 20:50

This was referenced May 5, 2026

docs(design): sharded BufferPool layout for high thread counts #3645

Merged

docs(audits): BufferPool sharded benchmark plan #3820

Merged

oferchen mentioned this pull request May 16, 2026

bench: profile Arc<Mutex<Vec>> contention in parallel-stat path (#1192) #4170

Merged

3 tasks

oferchen mentioned this pull request May 16, 2026

docs(audits): BufferPool current state - sharding bench precondition (#1297) #4179

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle extended ASCII whitespace in remote shell parser#1297

Handle extended ASCII whitespace in remote shell parser#1297
oferchen merged 1 commit into
masterfrom
integrate-rsync-3.4.1-protocol-in-rust

oferchen commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

oferchen commented Oct 27, 2025

Summary

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant