feat: periodic soak test mode for RAC fuzz framework by rophy · Pull Request #22 · rophy/OpenLogReplicator

rophy · 2026-04-29T20:53:52Z

Summary

Evolve fuzz test into repeatable periodic soak mode — each cycle is a finite fuzz run with cursor-based resume
Make workload idempotent across cycles (repopulate tracked IDs from existing rows, continue sequences from MAX)
Move table cleanup into workload PL/SQL so DELETEs flow through CDC and get validated
Replace host-side archive cleanup with containerized cron service
Drop streaming soak design (Phase B) — replaced by periodic fuzz model

Test plan

8-hour soak run (36 cycles, 10min each): ~1.1M events validated, 0 mismatches
Back-to-back cycles with no teardown: cursor advances correctly each cycle
Existing single-cycle fuzz test (run N + validate) still works

Summary by CodeRabbit

New Features
- Added an archive-cleanup service with scheduled hourly removal of old archive logs.
- Added a soak driver for continuous fuzz testing with cycle carryover and metrics aggregation.
- Added resumable validation via checkpoint cursors.
Improvements
- Better failure diagnostics, readiness detection, and DB connection robustness.
- Configurable TTL-based in-memory cleanup and periodic data purge.
- Workload now supports periodic cleanup to limit historical data growth.
Tests
- Workspace persistence for soak-state carryover between cycles.

Phase A - Storage cleanup (works with existing finite runs): - Add created_at column to all FUZZ_* tables for TTL-based purge - Add FUZZ_WKL.cleanup() procedure + DBMS_SCHEDULER job (every 30min) - Add archive log cleanup loop in fuzz-test.sh up (hourly) - Add seq dict pruning in kafka-consumer.py (every 10min, 24h TTL) - Add SQLite event purge in validator.py after each validation cycle Phase B - Continuous soak operation: - Add FUZZ_WKL.run_forever() with rate limiting via DBMS_SESSION.SLEEP - Add SOAK_MODE to validator.py: continuous validate-purge cycles - Add stall detection (exit if no new events for 5min) - Add fuzz-test.sh soak subcommand with health monitoring - Add SQLite busy_timeout to handle concurrent writer contention Tested: 2-min finite run (11,238 events, 0 mismatches) and 5-min soak run (2,407 events validated in cycle 1, 0 mismatches) on RAC VM.

Phase B (continuous soak) is being redesigned as periodic fuzz runs; remove the streaming implementation: - Drop run_forever() in FUZZ_WKL package - Drop SOAK_MODE path in validator.py (stall detection, continuous loop) - Drop soak subcommand and action_soak from fuzz-test.sh - Remove SOAK-TEST.md design doc Replace host-side archive cleanup loop with a docker-compose service: - New archive-cleanup/ directory (Dockerfile + crontab) - Alpine + openssh-client + supercronic v0.2.44 (SHA1 pinned) - Hourly find -mtime +1 -print -delete, deletions logged to stdout - Lifecycle tied to docker-compose up/down, visible via docker logs Phase A cleanup mechanisms retained: - created_at columns + FUZZ_WKL.cleanup() + DBMS_SCHEDULER job - Consumer seq dict pruning (10min interval, 24h TTL) - validator.py purge_old_events() called after each one-shot cycle - SQLite busy_timeout=30000 on both consumer and validator

- Move FUZZ_* table cleanup into workload PL/SQL (every 5min) so DELETEs flow through CDC and get validated like normal DML. - Remove DBMS_SCHEDULER FUZZ_CLEANUP job from up/down (replaced by in-workload cleanup). - Fix grep -q + pipefail SIGPIPE false-failure in OLR/Debezium readiness waits.

…ator - fuzz-workload.sql: make FUZZ_WKL.run() idempotent across cycles. Repopulate per-node tracked-ID arrays from existing rows (parity- filtered), continue g_next_id from MAX(id) preserving parity, and continue g_event_seq from MAX numeric tail of existing event_ids so cycle N+1 event_ids never collide with cycle N. Seed INSERTs only run on cold start (empty tables). - validator.py: accept START_CURSOR env and emit '[validator] final_cursor=...' (safe frontier only). Lets a soak loop resume past already-validated events without re-scanning. - fuzz-test.sh: forward START_CURSOR to the validator container; dump sqlplus output when FUZZ_DONE summary is missing (diagnostic). Verified: 3 back-to-back cycles with no teardown, 32k events, 0 mismatches, cursor advances each cycle.

coderabbitai · 2026-04-29T20:55:40Z

Warning

Rate limit exceeded

@rophy has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 40 minutes and 7 seconds before requesting another review.

To continue reviewing without waiting, purchase usage credits in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 6301765f-0983-46c5-a01b-c2740a72b277

📥 Commits

Reviewing files that changed from the base of the PR and between c95225c and d892bd3.

📒 Files selected for processing (1)

tests/dbz-twin/rac/soak.sh

📝 Walkthrough

Walkthrough

Adds a soak orchestration script that repeatedly runs fuzz cycles, validates results with resumable cursors, aggregates per-cycle logs/metrics, and advances state until a time deadline or a cycle failure.

Changes

Soak orchestration

Layer / File(s)	Summary
Soak driver and cycle orchestration `tests/dbz-twin/rac/soak.sh`	New soak driver that runs repeated `fuzz-test.sh run` cycles, validates with `fuzz-test.sh validate` using `START_CURSOR`, extracts `final_cursor`, aggregates metrics, logs per-cycle outputs, and stops on deadline or failure.

Sequence Diagram

sequenceDiagram
  participant soak as soak.sh
  participant run as fuzz-test.sh (run)
  participant validate as fuzz-test.sh (validate) / validator.py
  participant logs as cycle-logs

  soak->>run: start run for CYCLE_MIN, write cycle-###-run.log
  run-->>logs: combined run output
  soak->>validate: start validate with START_CURSOR, write cycle-###-validate.log
  validate-->>logs: validation output + final_cursor
  soak->>soak: extract final_cursor, update cursor for next cycle

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

rophy/OpenLogReplicator#13: Related fuzz-test framework changes that introduced validator and consumer adjustments referenced by this soak orchestration.
rophy/OpenLogReplicator#14: Overlapping updates to validator/fuzz-test scripts and test orchestration.
rophy/OpenLogReplicator#16: Changes to validator safe-frontier and cursor handling related to this PR's validation flow.

Poem

🐰 In midnight logs I softly creep,

I run the cycles while others sleep.
I save the cursor, march along,
prune old files and hum a song.
Hop, resume, and keep testing strong.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely describes the main addition: a new periodic soak test mode for the RAC fuzz framework, which aligns with the PR's primary objective of converting the fuzz test into a repeatable periodic soak mode.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/soak-test-periodic

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

rophy · 2026-04-29T22:02:05Z

@coderabbitai review

coderabbitai · 2026-04-29T22:02:12Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

tests/dbz-twin/rac/archive-cleanup/Dockerfile (1)

1-14: Consider adding a non-root user for security best practices.

The container runs as root by default. While this is acceptable for test infrastructure, adding a non-root user would align with container security best practices.

🛡️ Optional: Add non-root user

 RUN apk add --no-cache openssh-client curl \
  && curl -fsSL -o /usr/local/bin/supercronic "${SUPERCRONIC_URL}" \
  && echo "${SUPERCRONIC_SHA}  /usr/local/bin/supercronic" | sha1sum -c - \
  && chmod +x /usr/local/bin/supercronic \
- && apk del curl
+ && apk del curl \
+ && adduser -D -u 1000 cleanup

+USER cleanup
 ENTRYPOINT ["/usr/local/bin/supercronic", "-passthrough-logs"]

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/dbz-twin/rac/archive-cleanup/Dockerfile` around lines 1 - 14, Add a
non-root user and switch to it at the end of the Dockerfile: create a group/user
(e.g., addgroup -S super && adduser -S super -G super), change ownership of
/usr/local/bin/supercronic (and any other runtime paths like /etc/crontab if
needed) to that user, and add a USER super line before ENTRYPOINT/CMD; ensure
permissions allow execution of /usr/local/bin/supercronic and that ENTRYPOINT
("/usr/local/bin/supercronic", "-passthrough-logs") continues to work under the
non-root user.

tests/dbz-twin/rac/fuzz-test.sh (1)

463-463: Quote $_SSH_OPTS to prevent word splitting.

Shellcheck SC2086 correctly identifies that $_SSH_OPTS should be quoted. While it likely works in practice, quoting is safer.
🔧 Suggested fix
-    ssh $_SSH_OPTS "${VM_USER}@${VM_HOST}" \
+    ssh "$_SSH_OPTS" "${VM_USER}@${VM_HOST}" \
Note: If $_SSH_OPTS contains multiple space-separated options, you may need to use an array instead: ssh "${_SSH_OPTS[@]}" ...
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/dbz-twin/rac/fuzz-test.sh` at line 463, The ssh invocation in
fuzz-test.sh uses an unquoted variable $_SSH_OPTS which can undergo word
splitting (SC2086); update the ssh call to quote the variable (use
"${_SSH_OPTS}" or, if _SSH_OPTS is intended as an array, use "${_SSH_OPTS[@]}")
so options are preserved correctly when invoking ssh "${_SSH_OPTS}"
"${VM_USER}@${VM_HOST}" ...; ensure references to _SSH_OPTS, VM_USER, VM_HOST
and the ssh command are updated accordingly.

tests/dbz-twin/rac/validator.py (2)

434-434: Minor: Unused variable nf in unpacking.

Prefix with underscore to indicate it's intentionally unused.

🧹 Fix unused variable warning

-            (v, m, mm, mo, ml, to_, tl, lmc, oc, nf) = result
+            (v, m, mm, mo, ml, to_, tl, lmc, oc, _nf) = result

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/dbz-twin/rac/validator.py` at line 434, The tuple unpacking currently
binds an unused variable named `nf`; change the unpack target `nf` to `_nf` (or
`_`) in the unpack expression `(v, m, mm, mo, ml, to_, tl, lmc, oc, nf) =
result` so the unused value is clearly marked and silences the warning while
leaving the rest of the bindings (`v, m, mm, mo, ml, to_, tl, lmc, oc`)
unchanged.

358-361: Minor: Remove unnecessary f-string prefixes.

Lines 359 and 382 use f-strings without placeholders.

🧹 Fix linting warnings

-    print(f"\n{'='*60}", flush=True)
-    print(f"  Fuzz Test Validation Summary", flush=True)
-    print(f"{'='*60}", flush=True)
+    print(f"\n{'='*60}", flush=True)
+    print("  Fuzz Test Validation Summary", flush=True)
+    print(f"{'='*60}", flush=True)

And at line 382:

-    print(f"Validator starting", flush=True)
+    print("Validator starting", flush=True)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/dbz-twin/rac/validator.py` around lines 358 - 361, Several print
statements use f-strings without placeholders (for example the header prints
using {'='*60} and the print that outputs total_validated), which is
unnecessary; update those prints to plain string literals by removing the
leading f (e.g., change print(f"\n{'='*60}", ...) and print(f"  Total validated:
{total_validated}", ...) to use non-f-prefixed strings where appropriate and
only keep f-strings for prints that actually interpolate variables like
total_validated). Ensure you only remove the f-prefix on lines that have no
Python expression interpolation so formatting remains identical.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@tests/dbz-twin/rac/soak.sh`:
- Line 6: The hardcoded absolute path in the source command inside
tests/dbz-twin/rac/soak.sh should be replaced with a path computed relative to
the script location (the source invocation in soak.sh), e.g., compute the script
directory via $0 or ${BASH_SOURCE[0]} and source the vm-env.sh from
../environments/rac/vm-env.sh relative to that directory so the script is
portable across machines.

---

Nitpick comments:
In `@tests/dbz-twin/rac/archive-cleanup/Dockerfile`:
- Around line 1-14: Add a non-root user and switch to it at the end of the
Dockerfile: create a group/user (e.g., addgroup -S super && adduser -S super -G
super), change ownership of /usr/local/bin/supercronic (and any other runtime
paths like /etc/crontab if needed) to that user, and add a USER super line
before ENTRYPOINT/CMD; ensure permissions allow execution of
/usr/local/bin/supercronic and that ENTRYPOINT ("/usr/local/bin/supercronic",
"-passthrough-logs") continues to work under the non-root user.

In `@tests/dbz-twin/rac/fuzz-test.sh`:
- Line 463: The ssh invocation in fuzz-test.sh uses an unquoted variable
$_SSH_OPTS which can undergo word splitting (SC2086); update the ssh call to
quote the variable (use "${_SSH_OPTS}" or, if _SSH_OPTS is intended as an array,
use "${_SSH_OPTS[@]}") so options are preserved correctly when invoking ssh
"${_SSH_OPTS}" "${VM_USER}@${VM_HOST}" ...; ensure references to _SSH_OPTS,
VM_USER, VM_HOST and the ssh command are updated accordingly.

In `@tests/dbz-twin/rac/validator.py`:
- Line 434: The tuple unpacking currently binds an unused variable named `nf`;
change the unpack target `nf` to `_nf` (or `_`) in the unpack expression `(v, m,
mm, mo, ml, to_, tl, lmc, oc, nf) = result` so the unused value is clearly
marked and silences the warning while leaving the rest of the bindings (`v, m,
mm, mo, ml, to_, tl, lmc, oc`) unchanged.
- Around line 358-361: Several print statements use f-strings without
placeholders (for example the header prints using {'='*60} and the print that
outputs total_validated), which is unnecessary; update those prints to plain
string literals by removing the leading f (e.g., change print(f"\n{'='*60}",
...) and print(f"  Total validated:    {total_validated}", ...) to use
non-f-prefixed strings where appropriate and only keep f-strings for prints that
actually interpolate variables like total_validated). Ensure you only remove the
f-prefix on lines that have no Python expression interpolation so formatting
remains identical.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: ab34e29b-3e2d-4124-a533-7cdad113d68b

📥 Commits

Reviewing files that changed from the base of the PR and between 21955ba and d938471.

📒 Files selected for processing (8)

tests/dbz-twin/rac/archive-cleanup/Dockerfile
tests/dbz-twin/rac/archive-cleanup/crontab
tests/dbz-twin/rac/docker-compose-fuzz.yaml
tests/dbz-twin/rac/fuzz-test.sh
tests/dbz-twin/rac/kafka-consumer.py
tests/dbz-twin/rac/perf/fuzz-workload.sql
tests/dbz-twin/rac/soak.sh
tests/dbz-twin/rac/validator.py

rophy · 2026-05-14T23:01:51Z

@coderabbitai review

coderabbitai · 2026-05-14T23:01:57Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

tests/dbz-twin/rac/soak.sh (1)
4-4: ⚡ Quick win

Consider enabling undefined-variable protection.

Adding set -u alongside set -e will catch typos in variable names and prevent silent failures from undefined variables.
🛡️ Proposed enhancement
-set -e
+set -euo pipefail
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/dbz-twin/rac/soak.sh` at line 4, The script currently has "set -e"
which stops on errors but doesn't catch undefined variables; update the shell
options by replacing or augmenting that line to enable undefined-variable
protection (e.g., change "set -e" to "set -eu" or add "set -u" alongside "set
-e") so typos in variable names will cause immediate failures.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests/dbz-twin/rac/soak.sh`:
- Around line 36-39: Detect and handle a missing cursor after validation: after
extracting new_cursor from "$LOG_DIR/cycle-${num}-validate.log" (the new_cursor
variable), check if new_cursor is empty; if it is, write an error to the
log/console including the cycle number and the validate log path (use
"$LOG_DIR/cycle-${num}-validate.log"), append the summary line as context, and
exit with a non-zero status instead of assigning cursor="$new_cursor" so the
script fails fast rather than reprocessing from the beginning.

---

Nitpick comments:
In `@tests/dbz-twin/rac/soak.sh`:
- Line 4: The script currently has "set -e" which stops on errors but doesn't
catch undefined variables; update the shell options by replacing or augmenting
that line to enable undefined-variable protection (e.g., change "set -e" to "set
-eu" or add "set -u" alongside "set -e") so typos in variable names will cause
immediate failures.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 289dd34b-533c-43a2-b1cb-222a03665a0b

📥 Commits

Reviewing files that changed from the base of the PR and between d938471 and c95225c.

📒 Files selected for processing (1)

tests/dbz-twin/rac/soak.sh

- set -euo pipefail for undefined-variable protection - fail fast if validator cursor is missing

rophy added 5 commits April 29, 2026 20:45

feat: add soak.sh driver for back-to-back fuzz cycles

d938471

coderabbitai Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread tests/dbz-twin/rac/soak.sh Outdated

fix: use relative path for vm-env.sh in soak.sh

c95225c

coderabbitai Bot reviewed May 14, 2026

View reviewed changes

Comment thread tests/dbz-twin/rac/soak.sh

fix: stricter error handling in soak.sh

d892bd3

- set -euo pipefail for undefined-variable protection - fail fast if validator cursor is missing

rophy merged commit a9345e3 into master May 14, 2026
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: periodic soak test mode for RAC fuzz framework#22

feat: periodic soak test mode for RAC fuzz framework#22
rophy merged 7 commits into
masterfrom
feat/soak-test-periodic

rophy commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

rophy commented Apr 29, 2026

Uh oh!

coderabbitai Bot commented Apr 29, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

rophy commented May 14, 2026

Uh oh!

coderabbitai Bot commented May 14, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rophy commented Apr 29, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

rophy commented Apr 29, 2026

Uh oh!

coderabbitai Bot commented Apr 29, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rophy commented May 14, 2026

Uh oh!

coderabbitai Bot commented May 14, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rophy commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading