[CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency #3854

akashrn5 · 2020-07-20T09:21:47Z

Why is this PR needed?

When load and compaction are happening concurrently, in reliability test segment data will be deleted from SI table, which leads to exception/failures
pre-priming was happening for SI table segment in case of compaction before making SI segment as a success.

What changes were proposed in this PR?

remove unnecessary cleaning API call from SI flow and before compaction success segment locks were getting released for SI, handle that
do the code refactoring in case of SI load after main table compaction to handle proper pre-priming after segments were made success.

Does this PR introduce any user interface change?

No

Is any new testcase added?

No(tested in cluster with 10 concurrency and around 1000 loads)

CarbonDataQA1 · 2020-07-20T11:40:16Z

Build Failed with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3439/

CarbonDataQA1 · 2020-07-20T11:40:44Z

Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1697/

CarbonDataQA1 · 2020-07-22T15:10:29Z

Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1729/

CarbonDataQA1 · 2020-07-22T15:28:02Z

Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3471/

ajantha-bhat · 2020-07-23T14:06:37Z

.../src/main/scala/org/apache/spark/sql/secondaryindex/events/CleanFilesPostEventListener.scala

+    var mainTableLocked = false
+    var indexTableLocked = false
+    try {
+      mainTableLocked = mainTableStatusLock.lockWithRetries()


If unable to get lock during the concurrent scenario, better to throw an exception to retry clean files command?

since its a clean files, no need to throw exception, it can retry next time

Atleast add an error log of unable to get lock, so that the user will know that something happened and need to retry.

User tries multiple times in concurrent scenario and it won't clean due to lock issue. He will never know why it is not cleaned.

ajantha-bhat · 2020-07-23T14:16:29Z

.../src/main/scala/org/apache/spark/sql/secondaryindex/events/CleanFilesPostEventListener.scala

+          detail.setSegmentStatus(segToStatusMap(detail.getLoadName))
+          detail.setVisibility("false")
+        }
+        indexTableStatusLock.unlock()


release the lock after updating the SI table status. now it is released before. It can impact concurrent scenarios

ajantha-bhat · 2020-07-23T15:25:28Z

.../src/main/scala/org/apache/spark/sql/secondaryindex/events/CleanFilesPostEventListener.scala

+          detail.setSegmentStatus(segToStatusMap(detail.getLoadName))
+          detail.setVisibility("false")
+        }
+        CarbonInternalLoaderUtil.recordLoadMetadata(


This will fail as it will try to acquire lock and we didn't release.

here we need to call directly writeLoadDetailsIntoFile, as we already have a lock.

CarbonDataQA1 · 2020-07-23T16:51:51Z

Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3483/

CarbonDataQA1 · 2020-07-23T16:56:11Z

Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1741/

…ncurrency

CarbonDataQA1 · 2020-07-23T19:42:18Z

Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1743/

CarbonDataQA1 · 2020-07-23T19:44:03Z

Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3485/

akashrn5 · 2020-07-24T04:57:48Z

@ajantha-bhat please review and merge

ajantha-bhat · 2020-07-24T12:31:17Z

LGTM

akashrn5 force-pushed the SI_concurrent branch from fb107e0 to 45aac81 Compare July 22, 2020 12:20

akashrn5 changed the title ~~[WIP]Fix compaction failure issue for SI table and metadata mismatch in concurrency~~ [CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency Jul 22, 2020

ajantha-bhat reviewed Jul 23, 2020

View reviewed changes

akashrn5 force-pushed the SI_concurrent branch from 45aac81 to 251b08e Compare July 23, 2020 14:25

ajantha-bhat reviewed Jul 23, 2020

View reviewed changes

Fix compaction failure issue for SI table and metadata mismatch in co…

12efda9

…ncurrency

akashrn5 force-pushed the SI_concurrent branch from 251b08e to 12efda9 Compare July 23, 2020 17:14

asfgit closed this in 30eefe3 Jul 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency #3854

[CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency #3854

akashrn5 commented Jul 20, 2020 •

edited

CarbonDataQA1 commented Jul 20, 2020

CarbonDataQA1 commented Jul 20, 2020

CarbonDataQA1 commented Jul 22, 2020

CarbonDataQA1 commented Jul 22, 2020

ajantha-bhat Jul 23, 2020

akashrn5 Jul 23, 2020

ajantha-bhat Jul 23, 2020

akashrn5 Jul 23, 2020

ajantha-bhat Jul 23, 2020

akashrn5 Jul 23, 2020

ajantha-bhat Jul 23, 2020

akashrn5 Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

akashrn5 commented Jul 24, 2020

ajantha-bhat commented Jul 24, 2020

[CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency #3854

[CARBONDATA-3920]Fix compaction failure issue for SI table and metadata mismatch in concurrency #3854

Conversation

akashrn5 commented Jul 20, 2020 • edited

Why is this PR needed?

What changes were proposed in this PR?

Does this PR introduce any user interface change?

Is any new testcase added?

CarbonDataQA1 commented Jul 20, 2020

CarbonDataQA1 commented Jul 20, 2020

CarbonDataQA1 commented Jul 22, 2020

CarbonDataQA1 commented Jul 22, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

CarbonDataQA1 commented Jul 23, 2020

akashrn5 commented Jul 24, 2020

ajantha-bhat commented Jul 24, 2020

akashrn5 commented Jul 20, 2020 •

edited