Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

etcd client outCh blocking too long when pause changefeed #10629

Closed
CharlesCheung96 opened this issue Feb 20, 2024 · 1 comment · Fixed by #10686
Closed

etcd client outCh blocking too long when pause changefeed #10629

CharlesCheung96 opened this issue Feb 20, 2024 · 1 comment · Fixed by #10686
Assignees
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/moderate This is a moderate bug. type/bug This is a bug.

Comments

@CharlesCheung96
Copy link
Contributor

CharlesCheung96 commented Feb 20, 2024

What did you do?

  1. Create changefeed with pulsar sink
  2. Stop the pulsar server
  3. Delete some data from upstream
  4. Pause the changefeed

What did you expect to see?

No response

What did you see instead?

[2024/02/20 20:21:00.048 +08:00] [WARN] [client.go:272] ["etcd client outCh blocking too long, the etcdWorker may be stuck"] [duration=14m2.000284532s] [role=processor]
[2024/02/20 20:21:00.406 +08:00] [WARN] [client.go:272] ["etcd client outCh blocking too long, the etcdWorker may be stuck"] [duration=8m16.000233667s] [role=owner]

ticdc.log

goroutine.log

Versions of the cluster

Upstream TiDB cluster version (execute SELECT tidb_version(); in a MySQL client):

(paste TiDB cluster version here)

Upstream TiKV version (execute tikv-server --version):

(paste TiKV version here)

TiCDC version (execute cdc version):

2eadc08f4bd64d00250e9ce6f7c69eda5498464c
@CharlesCheung96 CharlesCheung96 added type/bug This is a bug. area/ticdc Issues or PRs related to TiCDC. labels Feb 20, 2024
@github-actions github-actions bot added this to Need Triage in Question and Bug Reports Feb 20, 2024
@asddongmen asddongmen added severity/moderate This is a moderate bug. affects-7.5 labels Feb 21, 2024
@asddongmen asddongmen moved this from Need Triage to In Progress in Question and Bug Reports Feb 21, 2024
@asddongmen
Copy link
Contributor

This is because the close method of pulsar client and pulsar producer is blocked when downstream pulsar server is down.
A possible fix is to spawn a goroutine to close pulsar client and producer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/moderate This is a moderate bug. type/bug This is a bug.
2 participants