Skip to content

flake: TestOpenAIResponsesFullReplayPairsReasoningAndWebSearch #1508

@flake-investigator

Description

@flake-investigator

CI Run: https://github.com/coder/coder/actions/runs/25131965591
Failing Job: https://github.com/coder/coder/actions/runs/25131965591/job/73662589528 (test-go-pg (ubuntu-latest))
Commit: 70d46943db313d3b60a5166186002d5e743ac225 (Asher)

Failure:

  • coderd/x/chatd TestOpenAIResponsesFullReplayPairsReasoningAndWebSearch
  • Error: expected 2 OpenAI requests but recorder only had 1.
  • Log snippet:
=== FAIL: coderd/x/chatd TestOpenAIResponsesFullReplayPairsReasoningAndWebSearch (0.31s)
    integration_responses_test.go:180:
        Error: "[{[map[content:You are the Coder agent — ... [map[text:search for the latest Coder docs type:input_text]] role:user]] ...}]" should have 2 item(s), but has 1
  • Nearby context shows the chat being interrupted mid-processing:
processor: chat interrupted  chat_id=c7cb92e5-ac19-44a5-8e08-3661813052da

Root Cause Assessment:

  • Likely flaky chatd integration test where the follow-up message does not trigger the second OpenAI Responses request, leaving only the initial request recorded.
  • Searched logs for WARNING: DATA RACE, panic:, and OOM indicators; none found.

Assignment Analysis:

  • Failing test location: coderd/x/chatd/integration_responses_test.go (TestOpenAIResponsesFullReplayPairsReasoningAndWebSearch, around line 180).
  • Tooling does not expose git blame -L directly; used github_list_commits on the test file as a proxy for recent ownership.
  • Most recent meaningful change in this file is commit 62e9752acd3212991de1c4a5da3d43bdf6da1fb9 by @ibetitsmike ("fix: prevent malformed OpenAI Responses continuations"), which introduced/updated these OpenAI Responses replay tests.
  • Assigning to @ibetitsmike based on most recent test-file changes in this area.

Related Issues:

Reproduction:

  • GOWORK=off go test ./coderd/x/chatd -run TestOpenAIResponsesFullReplayPairsReasoningAndWebSearch -count=1

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions