Skip to content

Conversation

@wzhfy
Copy link
Contributor

@wzhfy wzhfy commented Dec 28, 2017

What changes were proposed in this pull request?

For empty/null column, the result of ApproximatePercentile is null. Then in ApproxCountDistinctForIntervals, a MatchError (for endpoints) will be thrown if we try to generate histogram for that column. Besides, there is no need to generate histogram for such column. In this patch, we exclude such column when generating histogram.

How was this patch tested?

Enhanced test cases for empty/null columns.

@SparkQA
Copy link

SparkQA commented Dec 28, 2017

Test build #85467 has finished for PR 20102 at commit 9617c2d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wzhfy
Copy link
Contributor Author

wzhfy commented Dec 28, 2017

cc @cloud-fan

@cloud-fan
Copy link
Contributor

LGTM, merging to master!

@asfgit asfgit closed this in 2877817 Dec 28, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants