Skip to content

Conversation

@yanboliang
Copy link
Contributor

When sampling and then filtering DataFrame, the SQL Optimizer will push down filter into sample and produce wrong result. This is due to the sampler is calculated based on the original scope rather than the scope after filtering.

@marmbrus
Copy link
Contributor

LGTM

@SparkQA
Copy link

SparkQA commented Oct 27, 2015

Test build #44412 has finished for PR 9294 at commit ef16f3d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@marmbrus
Copy link
Contributor

Thanks, merged to master

@asfgit asfgit closed this in 360ed83 Oct 27, 2015
@yanboliang yanboliang deleted the spark-11303 branch October 27, 2015 10:35
@marmbrus
Copy link
Contributor

I'm going to pick this into branch-1.5 too.

asfgit pushed a commit that referenced this pull request Oct 28, 2015
When sampling and then filtering DataFrame, the SQL Optimizer will push down filter into sample and produce wrong result. This is due to the sampler is calculated based on the original scope rather than the scope after filtering.

Author: Yanbo Liang <ybliang8@gmail.com>

Closes #9294 from yanboliang/spark-11303.
markhamstra pushed a commit to markhamstra/spark that referenced this pull request Nov 3, 2015
When sampling and then filtering DataFrame, the SQL Optimizer will push down filter into sample and produce wrong result. This is due to the sampler is calculated based on the original scope rather than the scope after filtering.

Author: Yanbo Liang <ybliang8@gmail.com>

Closes apache#9294 from yanboliang/spark-11303.
markhamstra pushed a commit to markhamstra/spark that referenced this pull request Nov 3, 2015
When sampling and then filtering DataFrame, the SQL Optimizer will push down filter into sample and produce wrong result. This is due to the sampler is calculated based on the original scope rather than the scope after filtering.

Author: Yanbo Liang <ybliang8@gmail.com>

Closes apache#9294 from yanboliang/spark-11303.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants