[HOTFIX] Fix task id in FileFormat write #3324

ajantha-bhat · 2019-07-13T05:59:56Z

problem : In FIleFormat write, carbon is using task id as System.nanoTime()

cause : when multiple tasks launched concurrently, there is a chance that two task can have same id very rarely. Due to this, two spark task launched for one insert will have same carbondata file name.
so, when both tasks write to one file, chances are more to corrupt the file. which leads in query failure

solution: use own unique task id instead of nano seconds.
here use spark task id + global counter to generate unique task id across jobs.

Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:

Any interfaces changed? NA
Any backward compatibility impacted? NA
Document update required? NA
Testing done
done. Attached the report
testReport.txt
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. [NA]

CarbonDataQA · 2019-07-13T06:04:01Z

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/3810/

CarbonDataQA · 2019-07-13T06:06:57Z

Build Failed with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/12083/

CarbonDataQA · 2019-07-13T06:10:57Z

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/4015/

CarbonDataQA · 2019-07-13T06:24:53Z

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/3811/

jackylk · 2019-07-13T06:33:46Z

...main/scala/org/apache/spark/sql/carbondata/execution/datasources/SparkCarbonFileFormat.scala

+      /**
+       * counter used for generating unique task numbers.
+       */
+      val counter = new AtomicLong()


Can we use UUID instead of this shared variable to generate the id? In practise, UUID is better than timestamp

@jackylk : ok I will check.
@ravipesala : I have used the similar logic used in SparkCarbonTableFormat, UUID is not used there because of any issue ?

@jackylk in carbon taskid is treated as number, not a string so the queries might fail. And to change the number to the string may need more changes. So here uuid generated based on the taskid and counter.

understood.

jackylk · 2019-07-13T07:03:48Z

core/src/main/java/org/apache/carbondata/core/util/CarbonThreadFactory.java

@@ -49,7 +49,7 @@ public CarbonThreadFactory(String name, boolean withTime) {
  @Override public Thread newThread(Runnable r) {
    final Thread thread = defaultFactory.newThread(r);
    if (withTime) {
-      thread.setName(name + "_" + System.currentTimeMillis());
+      thread.setName(name + "_" + System.nanoTime());


Here it is better to use the generated task number as thread id, so that debug is easier

I just made it more accurate by changing from milli to nano.

This is not directly linked with spark task number so cannot use it here, This thread pool is used internally in many places.
I will analyse and handle in other PR if required.

CarbonDataQA · 2019-07-13T07:32:13Z

Build Failed with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/12084/

jackylk · 2019-07-13T07:42:29Z

LGTM

ajantha-bhat · 2019-07-13T07:50:56Z

retest this please

CarbonDataQA · 2019-07-13T08:04:49Z

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/3812/

CarbonDataQA · 2019-07-13T09:12:21Z

Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/4017/

CarbonDataQA · 2019-07-13T09:12:52Z

Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/12085/

ravipesala · 2019-07-13T09:55:08Z

@ajantha-bhat I feel concurrent loading may still have the issue as we don't have segmentid in it. I feel it is safe to have UUID but changes are a little high.
And also it is better to write directly to the location instead of copying to local directory, so please enable carbon.load.directWriteToStorePath.enabled in case of carbon file format.

ajantha-bhat · 2019-07-15T13:04:10Z

Scenario is handled in #3325

Fix task id in file format

498ce1f

ajantha-bhat force-pushed the issue_fix branch from 4bf3a78 to 498ce1f Compare July 13, 2019 06:13

jackylk reviewed Jul 13, 2019

View reviewed changes

ajantha-bhat closed this Jul 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HOTFIX] Fix task id in FileFormat write #3324

[HOTFIX] Fix task id in FileFormat write #3324

ajantha-bhat commented Jul 13, 2019 •

edited

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

jackylk Jul 13, 2019 •

edited

ajantha-bhat Jul 13, 2019 •

edited

ravipesala Jul 13, 2019

jackylk Jul 13, 2019

jackylk Jul 13, 2019

ajantha-bhat Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

jackylk commented Jul 13, 2019

ajantha-bhat commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

ravipesala commented Jul 13, 2019

ajantha-bhat commented Jul 15, 2019

[HOTFIX] Fix task id in FileFormat write #3324

[HOTFIX] Fix task id in FileFormat write #3324

Conversation

ajantha-bhat commented Jul 13, 2019 • edited

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

jackylk Jul 13, 2019 • edited

Choose a reason for hiding this comment

ajantha-bhat Jul 13, 2019 • edited

Choose a reason for hiding this comment

ravipesala Jul 13, 2019

Choose a reason for hiding this comment

jackylk Jul 13, 2019

Choose a reason for hiding this comment

jackylk Jul 13, 2019

Choose a reason for hiding this comment

ajantha-bhat Jul 13, 2019

Choose a reason for hiding this comment

CarbonDataQA commented Jul 13, 2019

jackylk commented Jul 13, 2019

ajantha-bhat commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

CarbonDataQA commented Jul 13, 2019

ravipesala commented Jul 13, 2019

ajantha-bhat commented Jul 15, 2019

ajantha-bhat commented Jul 13, 2019 •

edited

jackylk Jul 13, 2019 •

edited

ajantha-bhat Jul 13, 2019 •

edited