New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KYLIN-4035 Calculate column cardinality by using spark engine #680
Conversation
Can one of the admins verify this patch? |
1 similar comment
Can one of the admins verify this patch? |
Codecov Report
@@ Coverage Diff @@
## master #680 +/- ##
========================================
Coverage ? 25.7%
Complexity ? 6011
========================================
Files ? 1386
Lines ? 82510
Branches ? 11568
========================================
Hits ? 21207
Misses ? 59258
Partials ? 2045
Continue to review full report at Codecov.
|
}) | ||
.sortByKey(true, 1); | ||
|
||
if (resultRdd.count() == 0) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Both count
and saveAsNewAPIHadoopFile
are action of RDD
, I think here resultRdd
should be cached to avoid recompute, am I right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, It's a good point. I forgot to cache it.
I will add cache, Thank you !
@@ -1430,6 +1430,10 @@ public boolean isSparkFactDistinctEnable() { | |||
return Boolean.parseBoolean(getOptional("kylin.engine.spark-fact-distinct", "false")); | |||
} | |||
|
|||
public boolean isSparkCardinalityEnabled(){ | |||
return Boolean.parseBoolean(getOptional("kylin.engin.spark-cardinality", "false")); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"engin" should be "engine"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fine to me
link to https://issues.apache.org/jira/browse/KYLIN-4035
Support calculating column cardinality by using spark engine