-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SUPPORT] Hudi Application getting stuck when Async cleaner is spawned #7364
Labels
priority:major
degraded perf; unable to move forward; potential bugs
release-0.12.2
Patches targetted for 0.12.2
table-service
Comments
xushiyan
added
priority:major
degraded perf; unable to move forward; potential bugs
table-service
labels
Dec 6, 2022
@nsivabalan have you already looked into this? related to rollback and cleaner entanglement under lazy mode. |
any news on this? I am using hudi 0.14.1 on aws glue and getting from time to time the following error that seems to be related to this issue:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
priority:major
degraded perf; unable to move forward; potential bugs
release-0.12.2
Patches targetted for 0.12.2
table-service
Tips before filing an issue
Have you gone through our FAQs?
Join the mailing list to engage in conversations and get faster support at dev-subscribe@hudi.apache.org.
If you have triaged this as a bug, then file an issue directly.
Describe the problem you faced
After some failed commits, Hudi application thread-pool is stuck while acquiring the lock without any progress.
A clear and concise description of the problem.
When Hudi application is stopped with some error or ungracefully for some time, when it is restarted it is getting stuck with the below logs without any progress for ~115 minutes.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Pipeline should not stuck and run without error.
A clear and concise description of what you expected to happen.
Environment Description
Hudi version : 0.12.1
Spark version : 3.2.2
Hive version : 2.3.5
Hadoop version : 2.7.7
Storage (HDFS/S3/GCS..) : GCS
Running on Docker? (yes/no) : yes
Additional context
config:
cleaner config
Add any other context about the problem here.
Stacktrace
The text was updated successfully, but these errors were encountered: