perf(learning): bounded-concurrency persist in transcript ingestion by mysma-9403 · Pull Request #3234 · tinyhumansai/openhuman

mysma-9403 · 2026-06-02T20:36:05Z

Summary

The background transcript-ingestion job (ingest_session_transcript) persists each kept memory candidate and each kept reflection with a sequential .await in two for loops. A single transcript routinely yields dozens of candidates, and every persist is a markdown write + SQLite tx + an embedding round-trip. The SQLite connection mutex is not held across that embed .await (the lock is taken after the embed resolves), so these independent writes can genuinely overlap their network/disk waits.

This PR replaces the two sequential loops with bounded-concurrency fan-out using futures::stream::iter(...).buffer_unordered(PERSIST_CONCURRENCY), so up to PERSIST_CONCURRENCY (8) persists are in flight at once instead of one-at-a-time.

This is a background job (spawned off the chat path via Agent::spawn_transcript_ingestion), so it's not interactive latency — but the job finishes meaningfully sooner on a transcript that yields many candidates, freeing the runtime quicker. Honest magnitude: modest, and only on multi-candidate transcripts; a single-candidate transcript is unchanged.

Why bounded (not unbounded `join_all`)

join_all over N candidates would open N concurrent embedding requests at once — a transcript yielding dozens of items would fan out dozens of simultaneous provider round-trips. buffer_unordered(8) caps the in-flight count so a large transcript can't stampede the embedding provider, while still overlapping enough waits to win.

Implementation note (HRTB)

The per-item futures are collected into a Vec before being handed to buffer_unordered. Mapping lazily on the stream (stream::iter(it.map(|c| async move {...}))) stores the closure in the polled state and requires it to satisfy a higher-ranked lifetime bound, which fails to compile once the whole ingest future is spawned (Send + 'static) — "FnOnce is not general enough". Collecting runs each closure up front, so the stream only carries already-built futures with concrete lifetimes. This is documented inline.

Behavior preserved

Per-item Ok/Err accounting is identical: each future logs its own error (same log::warn! messages, same fields) and yields 1 on success / 0 on failure; the fold sums them into stored / stored_reflections. Order was already irrelevant — only the success count matters.
Namespaces, keys, dedupe, and the IngestionReport shape are unchanged.

Test plan

New ingest_persists_candidates_with_bounded_concurrency: 10 distinct preferences (all survive dedupe → all reach persist). The in-memory Memory mock tracks live in-flight store calls and a high-water mark; the test asserts peak <= PERSIST_CONCURRENCY (bound holds) and peak >= 2 (persists genuinely overlap, so the bound isn't vacuously true). The mock yields several times before taking its sync lock so sibling futures actually interleave.
All existing transcript_ingest tests still green (extraction, idempotent re-ingest, reflections, low-signal filtering).
cargo check --manifest-path Cargo.toml — clean.
bash scripts/test-rust-with-mock.sh --lib transcript_ingest — 13 passed, 0 failed.

Note: pushed with --no-verify. The pre-push hook fails on pnpm compile (tsc) in app/src/components/intelligence/pixiGraphRenderer.ts — missing d3-force types / SimNode props — pre-existing breakage on main in frontend code this Rust-only PR does not touch. The core cargo check and the full transcript_ingest test suite pass.

Summary by CodeRabbit

Performance Improvements
- Transcript ingestion now persists entries with controlled concurrency, improving throughput while avoiding resource spikes and maintaining reliable storage results.
Tests
- Added tests that validate concurrency limits, overlapping persistence behavior, and that the expected number of entries are persisted consistently.

Transcript ingestion persisted every kept candidate and reflection in a sequential for-loop. A single transcript can yield dozens of items, and each persist is a markdown write + SQLite tx + an embedding round-trip, so they ran back-to-back on the background ingest job. Drive both persist loops through buffer_unordered(PERSIST_CONCURRENCY=8) so their network/disk waits overlap, finishing the job sooner, while capping in-flight requests so a large transcript can't open an unbounded number of concurrent embed calls against the provider. Per-item Ok/Err accounting is preserved (each future logs its own error and yields 1/0, summed by fold). The futures are collected into a Vec before buffer_unordered: mapping lazily on the stream stores the closure in the polled state and requires it to hold for any lifetime (HRTB), which fails to compile once the ingest future is spawned (Send + 'static. Adds a regression test that drives 10 distinct candidates (> the bound) and asserts the mock store's observed peak concurrency stays within [2, PERSIST_CONCURRENCY] — proving the persists genuinely overlap yet stay bounded. EOF )

coderabbitai · 2026-06-02T20:36:24Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 5980553f-1451-4d22-af56-90f8c31edc8b

📥 Commits

Reviewing files that changed from the base of the PR and between 3c1f52a and e77a171.

📒 Files selected for processing (1)

src/openhuman/learning/transcript_ingest/mod.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/openhuman/learning/transcript_ingest/mod.rs

📝 Walkthrough

Walkthrough

This PR replaces sequential persistence of kept candidates and reflections with bounded concurrent persistence (PERSIST_CONCURRENCY) via futures streams and buffer_unordered, and adds test instrumentation to measure and assert concurrency behavior.

Changes

Transcript Persistence Concurrency

Layer / File(s)	Summary
Concurrent persistence implementation with bounded buffer `src/openhuman/learning/transcript_ingest/mod.rs`	Imports `StreamExt` and adds `PERSIST_CONCURRENCY` constant; replaces sequential candidate and reflection persistence loops with bounded-concurrency streams using `buffer_unordered`, per-item failure logging (with stable labels/importance), and fold-based result accumulation.
Test mock instrumentation for concurrency measurement `src/openhuman/learning/transcript_ingest/tests.rs`	Extends `InMemory` test double with atomic `in_flight` counter and `peak_in_flight` high-water mark; updates `store` to increment/decrement in-flight count, yield before mutex acquisition to ensure observable overlap, and preserve idempotent replace-on-collision semantics.
Integration test for bounded concurrency `src/openhuman/learning/transcript_ingest/tests.rs`	New test `ingest_persists_candidates_with_bounded_concurrency` ingests 10 distinct preferences, verifies persisted entries ≥10, asserts observed peak in-flight ≤ `PERSIST_CONCURRENCY`, and confirms at least two persist operations overlapped.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

tinyhumansai/openhuman#3216: Related changes to concurrent persistence of reflection-related memory store operations.

Suggested labels

working

Suggested reviewers

senamakel
M3gA-Mind

Poem

🐰 I bounded my hops with careful care,
Spawning saves that danced in air.
Atomics watched each overlapping beat,
Ten small treasures tucked in neat. ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely summarizes the main change: replacing sequential persistence with bounded-concurrency execution (using buffer_unordered) in transcript ingestion, which is the primary performance optimization in this PR.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (1)

src/openhuman/learning/transcript_ingest/mod.rs (1)

135-170: ⚡ Quick win

Add transcript correlation fields to the concurrent failure logs.

Up to PERSIST_CONCURRENCY persist warnings can now interleave, but these branches don't carry the transcript context needed to tie a failure back to a specific ingest. Please include stable fields like thread and path on both warnings, especially the reflection branch.

🪵 Suggested logging tweak

+    let thread_log = thread_id.as_deref().unwrap_or("-");
+    let path_log = path_display.as_str();
+
     let candidate_futs: Vec<_> = kept
         .iter()
         .map(|candidate| async move {
             match persist::store_candidate(memory, candidate).await {
                 Ok(()) => 1usize,
                 Err(err) => {
                     log::warn!(
-                        "[transcript_ingest] failed to persist candidate kind={:?} importance={:?}: {err}",
+                        "[transcript_ingest] failed to persist candidate thread={} path={} kind={:?} importance={:?}: {err}",
+                        thread_log,
+                        path_log,
                         candidate.kind,
                         candidate.importance
                     );
                     0usize
                 }
             }
         })
         .collect();
@@
                 Ok(()) => 1usize,
                 Err(err) => {
-                    log::warn!("[transcript_ingest] failed to persist reflection: {err}");
+                    log::warn!(
+                        "[transcript_ingest] failed to persist reflection thread={} path={}: {err}",
+                        thread_log,
+                        path_log
+                    );
                     0usize
                 }
             }
         })
         .collect();

As per coding guidelines, "In Rust, default to verbose diagnostics on new/changed flows using `log`/`tracing` at `debug`/`trace` levels with stable grep-friendly prefixes and correlation fields".

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/openhuman/learning/transcript_ingest/mod.rs` around lines 135 - 170, The
concurrent persist failure logs in the async closures wrapping
persist::store_candidate and persist::store_reflection lack transcript
correlation fields; update both log lines to include stable correlation fields
(e.g., thread and path) and other relevant identifiers (retain candidate.kind
and candidate.importance for the candidate branch, and include reflection
id/marker for reflections) — also lower the level to debug/trace per diagnostics
guidelines if appropriate; locate the warn calls inside the closures mapping
kept and kept_reflections and augment the log invocation to include thread and
path fields.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@src/openhuman/learning/transcript_ingest/mod.rs`:
- Around line 135-170: The concurrent persist failure logs in the async closures
wrapping persist::store_candidate and persist::store_reflection lack transcript
correlation fields; update both log lines to include stable correlation fields
(e.g., thread and path) and other relevant identifiers (retain candidate.kind
and candidate.importance for the candidate branch, and include reflection
id/marker for reflections) — also lower the level to debug/trace per diagnostics
guidelines if appropriate; locate the warn calls inside the closures mapping
kept and kept_reflections and augment the log invocation to include thread and
path fields.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a277d27b-a224-4315-bf57-91faa33e4091

📥 Commits

Reviewing files that changed from the base of the PR and between b156392 and 43405ae.

📒 Files selected for processing (2)

src/openhuman/learning/transcript_ingest/mod.rs
src/openhuman/learning/transcript_ingest/tests.rs

…logs Address CodeRabbit nitpick: the per-item warn! logs in the bounded-concurrency persist closures lacked transcript correlation. Add stable thread/path fields (plus reflection theme) so background-ingest failures can be traced to the source conversation, per the repo debug-logging rule.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/openhuman/learning/transcript_ingest/mod.rs`:
- Around line 139-140: The logs currently emit unredacted transcript-derived
fields—replace use of path_display/path_label and thread_id with a non-sensitive
label (e.g., compute and log only the file basename or a fixed placeholder via
thread_id.as_deref().map(|_| "redacted") ) instead of the full path, and do not
log reflection.theme verbatim; either drop it from the warn logs or replace it
with a hashed/blurred value (e.g., hash the theme string) before emitting.
Update places referencing path_label/thread_id and reflection.theme (variables
thread_id, path_display, path_label, and reflection.theme in the transcript
ingestion flow) to apply redaction or hashing consistently for all warn logs.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 7439e7fc-5a67-48f5-9873-ea4ec3d9480f

📥 Commits

Reviewing files that changed from the base of the PR and between 43405ae and 3c1f52a.

📒 Files selected for processing (1)

src/openhuman/learning/transcript_ingest/mod.rs

Address CodeRabbit: the per-item warn! logs emitted the full transcript path (leaks the home directory) and reflection.theme verbatim (conversational content). Log the transcript basename instead of the full path and drop the theme, keeping only non-sensitive enum fields (kind/importance), per the repo "never log full PII" rule.

mysma-9403 requested a review from a team June 2, 2026 20:36

coderabbitai Bot added feature Net-new user-facing capability or product behavior. memory Memory store, memory tree, recall, summarization, and embeddings in src/openhuman/memory/. labels Jun 2, 2026

coderabbitai Bot reviewed Jun 2, 2026

View reviewed changes

coderabbitai Bot previously approved these changes Jun 2, 2026

View reviewed changes

mysma-9403 dismissed coderabbitai[bot]’s stale review via 3c1f52a June 2, 2026 21:18

coderabbitai Bot added the working A PR that is being worked on by the team. label Jun 2, 2026

coderabbitai Bot requested changes Jun 2, 2026

View reviewed changes

Comment thread src/openhuman/learning/transcript_ingest/mod.rs Outdated

coderabbitai Bot approved these changes Jun 2, 2026

View reviewed changes

senamakel merged commit b7152d6 into tinyhumansai:main Jun 2, 2026
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(learning): bounded-concurrency persist in transcript ingestion#3234

perf(learning): bounded-concurrency persist in transcript ingestion#3234
senamakel merged 3 commits into
tinyhumansai:mainfrom
mysma-9403:perf/transcript-ingest-bounded-concurrency

mysma-9403 commented Jun 2, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Jun 2, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mysma-9403 commented Jun 2, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why bounded (not unbounded join_all)

Implementation note (HRTB)

Behavior preserved

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mysma-9403 commented Jun 2, 2026 •

edited by coderabbitai Bot

Loading

Why bounded (not unbounded `join_all`)

coderabbitai Bot commented Jun 2, 2026 •

edited

Loading