disk_snapshot_backup: init pod may get stuck due to concurrency call to Send
#52049
Labels
affects-6.5
affects-7.1
affects-7.5
component/br
This issue is related to BR of TiDB.
severity/major
type/bug
This issue is a bug.
Bug Report
Please answer these questions before submitting your issue. Thanks!
1. Minimal reproduce step (Required)
Run a disk snapshot backup in a cluster that huge enough.
(Or just injecting some failpoints that make sometimes renewing the lease and sending wait apply happens concurrently.
2. What did you expect to see? (Required)
It should success to prepare -- nothing wrong happens.
3. What did you see instead (Required)
We were stuck at sending wait apply. When applying for quotas.
4. What is your TiDB version? (Required)
v6.5.x
But this may still happen in master.
NOTE
We call
Send
concurrently over a stream. Which isn't safe according to the document ofClientStream
:The text was updated successfully, but these errors were encountered: