New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-33049][CORE] Decommission shuffle block test is flaky #29929
[SPARK-33049][CORE] Decommission shuffle block test is flaky #29929
Conversation
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129328 has finished for PR 29929 at commit
|
jenkins retest this please |
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129334 has finished for PR 29929 at commit
|
Jenkins failure is unrelated. |
Retest this please. |
Thank you, @holdenk . Merged to master for Apache Spark 3.1.0. |
Test build #129386 has finished for PR 29929 at commit
|
What changes were proposed in this pull request?
Increase the listener bus event length, syncrhonize the addition of blocks modified to the array list.
Why are the changes needed?
This test appears flaky in Jenkins (can not repro locally). Given that the index file made it through and the index file is only transferred after the data file, the only two reasons I could come up with an interminentent failure here are with the listenerbus dropping a message or the two block change messages being received at the same time.
Does this PR introduce any user-facing change?
No (test only).
How was this patch tested?
The tests still pass on my machine but they did before. We'll need to run it through jenkins a few times first.