Harden rollout JSONL writes by etraut-openai · Pull Request #24471 · openai/codex

etraut-openai · 2026-05-25T18:16:29Z

Why

Mitigates #24425

A rollout write can fail after partially appending a JSONL record. The retry path then reopens the same file and writes the pending item again, which can splice a partial record and a retried full record onto the same physical line. That leaves the rollout unreadable even though the retried item itself is valid JSON.

What changed

Roll back a failed JSONL line write to the file length captured before the write started.
Repair the final rollout line before appending: preserve valid JSONL that is missing only its trailing newline, and truncate malformed trailing fragments.
Open rollout files with read/write access so repair and rollback can truncate on platforms where append-only handles are not sufficient.
Log ErrorKind and raw OS errno for rollout writer failures so future filesystem failures are easier to diagnose.

Fixes #24425

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e7581cbdd6

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-25T18:20:40Z

+    let mut file = std::fs::OpenOptions::new()
+        .read(true)
+        .write(true)
        .create(true)
-        .open(path)
+        .truncate(false)


Preserve append mode for live rollout writers

When a live thread has been persisted and then metadata changes, update_thread_metadata appends a SessionMeta line through append_rollout_item_to_path while this live writer handle stays open. Because this handle is now plain read/write instead of append(true), its cursor remains at the old EOF; the next live rollout item writes from that stale offset and overwrites the metadata append, corrupting the JSONL history. Keep append semantics on the recorder handle while still allowing the read/truncate operations used for repair.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-25T18:20:40Z

+                let mut file = std::fs::OpenOptions::new()
+                    .read(true)
+                    .write(true)
+                    .open(&path)?;


Preserve append mode for resumed rollout writers

The same stale-offset corruption affects resumed live threads: after RolloutRecorder::new(...Resume...) opens this non-append handle and later update_thread_metadata appends a metadata line via a separate append-only handle, subsequent resumed-thread writes use this handle's old EOF position and overwrite that appended record. The resumed writer should retain append/O_APPEND behavior in addition to the read/write permissions needed for final-line repair.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-25T18:20:40Z

+        let start_len = self.file.metadata().await?.len();
+        if let Err(err) = async {
+            self.file.write_all(json.as_bytes()).await?;
+            self.file.flush().await
+        }
+        .await
+        {
+            if let Err(rollback_err) = self.file.set_len(start_len).await {


Avoid truncating records from other appenders

If another writer appends to the same rollout after this start_len is captured and this writer then hits a write/flush error, set_len(start_len) rolls the entire file back to the stale length and deletes the other writer's successfully appended records. This can happen with the separate metadata append path (append_rollout_item_to_path) or another resumed process, so the rollback needs to be guarded to only remove bytes written by this attempt, or skipped when the file has grown beyond this write's range.

Useful? React with 👍 / 👎.

Harden rollout JSONL writes

e7581cb

Fixes #24425

chatgpt-codex-connector Bot reviewed May 25, 2026

View reviewed changes

etraut-openai closed this May 25, 2026

EthanSK mentioned this pull request May 26, 2026

code-mode rollout: exec_command stdout framing leaks into input_image base64, corrupting Responses API requests #24582

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden rollout JSONL writes#24471

Harden rollout JSONL writes#24471
etraut-openai wants to merge 1 commit into
mainfrom
etraut/rollout-jsonl-repair

etraut-openai commented May 25, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

etraut-openai commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What changed

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

etraut-openai commented May 25, 2026 •

edited

Loading