Pipeline Design 324

Now I have everything needed. Here is the ADR:

Design: feat(ruflo): close the learning feedback loop — call ruflo_learn_from_shipwright() at pipeline completion

Context

Shipwright's ruflo integration currently handles detection, MCP lifecycle, circuit-breaking, and memory indexing (ruflo-adapter.sh, 195 lines on this branch). The main branch already has 3 functions needed for learning — ruflo_store() (line 257), _ruflo_resolve_repo_hash() (line 314), and ruflo_learn_from_shipwright() (line 900) — totaling ~780 additional lines of adapter code including review/CQ stage helpers. However, none of these functions are called from the pipeline completion paths.

The pipeline has two terminal outcomes: success (end of stage_validate() at pipeline-stages-monitor.sh:115) and failure (threshold exceeded in stage_monitor() at line 254). Neither currently feeds outcome data back to ruflo's HNSW index for semantic recall on future runs. This means ruflo_recall_similar_outcomes() has no data to search — the feedback loop is open.

Constraints:

All ruflo calls must be fail-open (|| true) per adapter convention
Must guard with ruflo_available() + function-existence check (adapter may not be sourced)
Bash 3.2 compatible — no associative arrays, no ${var,,}
Outcome artifacts must go to $ARTIFACTS_DIR for pipeline traceability
Event schema (config/event-schema.json) must register new event types

Decision

Direct function calls at two pipeline exit points, with fail-open guards. Extract the three missing functions from main into this branch's ruflo-adapter.sh, then add call sites in pipeline-stages-monitor.sh.

Data Flow

stage_validate() success                   stage_monitor() threshold exceeded
        │                                           │
        ▼                                           ▼
Build outcome JSON                         Build outcome JSON
  {status:"success", issue,                  {status:"failure", issue,
   goal, task_type, duration_s}               goal, task_type, errors,
        │                                     error_threshold}
        ▼                                           │
Write to ARTIFACTS_DIR/                             ▼
  pipeline-outcome-validate.json           Write to ARTIFACTS_DIR/
        │                                    pipeline-outcome-monitor-failure.json
        ▼                                           │
ruflo_learn_from_shipwright(file)          ruflo_learn_from_shipwright(file)
        │                                           │
        ├─ _ruflo_resolve_repo_hash()      (same flow)
        ├─ jq parse task_type
        ├─ ruflo_store(key, content,
        │    "learning-<hash>",
        │    "skill-memory,outcome,<type>")
        └─ emit_event "ruflo.learn_from_shipwright"

Guard Pattern (both call sites identical)

if [[ -n "${ARTIFACTS_DIR:-}" ]]; then
    # ... build and write outcome JSON ...
    if ruflo_available && type ruflo_learn_from_shipwright >/dev/null 2>&1; then
        ruflo_learn_from_shipwright "$_outcome_file" || true
    fi
fi

Three layers of protection: (1) ARTIFACTS_DIR existence, (2) ruflo_available() boolean + function existence, (3) || true catch-all. The function itself has internal guards: ruflo_available || return 0 and _ruflo_resolve_repo_hash || return 0.

Error Handling

ruflo unavailable: ruflo_available() returns 1 → skip silently, pipeline unaffected
Function not sourced: type ruflo_learn_from_shipwright fails → skip silently
Bad JSON / jq failure: Function returns 0 (fail-open), no outcome indexed
ruflo timeout: Circuit-breaker fires (ruflo_with_timeout), disables ruflo for remainder of run
Disk write failure: echo > file 2>/dev/null || true — outcome not persisted, pipeline continues

Alternatives Considered

Inline learning code at call sites — Pros: no function extraction, self-contained / Cons: duplicates ~30 lines of jq + store + emit logic across two sites, violates DRY, breaks if learning logic evolves. Rejected.
Batch learning via post-pipeline job — Pros: decouples learning from pipeline, easier standalone testing / Cons: adds async orchestration, learning doesn't feed back to the next run's stage_plan/stage_design recall, more operational surface. Rejected.
Event-callback mechanism — Pros: flexible, could enable/disable per template / Cons: current emit_event is fire-and-forget (writes JSONL), not a callback dispatch; building callback infra for one call site is over-engineering. Rejected.

Implementation Plan

Files to modify

File	Change	Estimated lines
`scripts/lib/ruflo-adapter.sh`	Cherry-pick `ruflo_store()`, `_ruflo_resolve_repo_hash()`, `ruflo_learn_from_shipwright()` and their transitive dependencies (`_ruflo_run_quiet`, `ruflo_recall`, `_ruflo_repo_hash_candidates`, `_ruflo_shipwright_memory_dir`) from `main` (lines 230–940)	~700 lines (extract, not new code)
`scripts/lib/pipeline-stages-monitor.sh`	Add outcome JSON + `ruflo_learn_from_shipwright` call at end of `stage_validate()` (before line 115) and after `emit_event "monitor.alert"` in `stage_monitor()` (after line 264)	~30 lines
`config/event-schema.json`	Add `ruflo.learn_from_shipwright` event type	~4 lines
`scripts/sw-ruflo-adapter-test.sh`	Add unit test for `ruflo_learn_from_shipwright` + integration tests for both call sites	~80 lines

Files to create

None.

Dependencies

None (all ruflo dependencies already present via MCP and the adapter's fail-open pattern).

Risk Areas

Risk	Mitigation
Cherry-pick from main brings unrelated code	Only extract the 3 target functions + their direct helpers; verify with `grep` that no new globals or side-effects leak
`now_epoch` / `PIPELINE_START_EPOCH` not available in validate context	Both are set by `pipeline-stages.sh` before any stage runs; verify with read of pipeline-stages.sh
Outcome JSON missing `task_type`	Function handles this: falls back to `issue_type`, then `"unknown"` (line 917 on main)
`jq` not installed	`jq` is already a hard dependency of the pipeline (used in every stage config read); not a new requirement
Monitor failure path has complex branching (rollback, hotfix issue, etc.)	Insert call before rollback logic so outcome is captured regardless of rollback success

Pipeline Design 324

Design: feat(ruflo): close the learning feedback loop — call ruflo_learn_from_shipwright() at pipeline completion

Context

Decision

Data Flow

Guard Pattern (both call sites identical)

Error Handling

Alternatives Considered

Implementation Plan

Files to modify

Files to create

Dependencies

Risk Areas

Validation Criteria

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally