[hive] Fix hive catalog lock may encounter deadlock. #6783
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose
Linked issue: close #6782
There are many possible causes for “hive lock may encounter deadlock”, for example:
1.The first table task is still running, and subsequent tasks cannot acquire the Hive table lock, leading to a timeout.
2.Delays in acquiring the Hive metastore lock, which also cause timeouts.
And so on.
In my latest changes:
1.Added detailed logs to show whether a lock acquisition failure was caused by a timeout or by another lock state.
paimon/paimon-hive/paimon-hive-catalog/src/main/java/org/apache/paimon/hive/HiveCatalogLock.java
Lines 116 to 120 in 3342ef9
2.Fixed an issue where lockResponse = clients.run(client -> client.checkLock(lockId)); would throw an exception and the lock would not be released, preventing subsequent tasks from acquiring the lock.
paimon/paimon-hive/paimon-hive-catalog/src/main/java/org/apache/paimon/hive/HiveCatalogLock.java
Lines 95 to 112 in 3342ef9
Tests
API and Format
Documentation