Skip to content

Conversation

@benjamin-awd
Copy link
Contributor

@benjamin-awd benjamin-awd commented Oct 24, 2025

Summary

This PR fixes Vector entering a "zombie" state when the connect_timeout_secs threshold is reached with multiple sources running at the same time. The previous behavior was to retry until connect_timeout_secs runs out, but it probably makes more sense to apply the connect timeout per connect attempt and retry indefinitely.

How did you test this PR?

Tested using example from #23992

Change Type

  • Bug fix
  • New feature
  • Non-functional (chore, refactoring, docs)
  • Performance

Is this a breaking change?

  • Yes
  • No

Does this PR include user facing changes?

  • Yes. Please add a changelog fragment based on our guidelines.
  • No. A maintainer will apply the no-changelog label to this PR.

References

Closes #23992

@benjamin-awd benjamin-awd requested a review from a team as a code owner October 24, 2025 16:12
@github-actions github-actions bot added the domain: sources Anything related to the Vector's sources label Oct 24, 2025
@thomasqueirozb
Copy link
Contributor

Hi @benjamin-awd, thanks for this. Since this alters Vector's behavior and has been part of a previous release this is considered user facing and therefore needs a changelog. I'll take a look again after another commit is pushed

@thomasqueirozb thomasqueirozb added meta: awaiting author Pull requests that are awaiting their author. source: websocket Anything `websocket` source related labels Oct 24, 2025
@github-actions github-actions bot removed the meta: awaiting author Pull requests that are awaiting their author. label Oct 25, 2025
@pront pront enabled auto-merge October 30, 2025 18:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

domain: sources Anything related to the Vector's sources source: websocket Anything `websocket` source related

Projects

None yet

Development

Successfully merging this pull request may close these issues.

WebSocket source reconnect failure doesn't cause process termination

3 participants