Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-18507][SQL] HiveExternalCatalog.listPartitions should only call getTable once #15978

Closed
wants to merge 1 commit into from

Conversation

cloud-fan
Copy link
Contributor

What changes were proposed in this pull request?

HiveExternalCatalog.listPartitions should only call getTable once, instead of calling it for every partitions.

How was this patch tested?

N/A

@cloud-fan
Copy link
Contributor Author

cc @yhuai @mallman @ericl

@SparkQA
Copy link

SparkQA commented Nov 22, 2016

Test build #68993 has finished for PR 15978 at commit 1b4785a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@mallman
Copy link
Contributor

mallman commented Nov 22, 2016

I'll give this patch a try. Thanks.

@mallman
Copy link
Contributor

mallman commented Nov 22, 2016

This patch resolves the performance problem I was seeing. Thank you!

@ericl
Copy link
Contributor

ericl commented Nov 22, 2016

Lgtm. Should we also add a metric on the number if hive calls and check that in the table perf stats suite to prevent future regression?

@andrewor14
Copy link
Contributor

Oops, that was my fault. Thanks merging into master 2.1

asfgit pushed a commit that referenced this pull request Nov 22, 2016
…l getTable once

## What changes were proposed in this pull request?

HiveExternalCatalog.listPartitions should only call `getTable` once, instead of calling it for every partitions.

## How was this patch tested?

N/A

Author: Wenchen Fan <wenchen@databricks.com>

Closes #15978 from cloud-fan/perf.

(cherry picked from commit 702cd40)
Signed-off-by: Andrew Or <andrewor14@gmail.com>
@andrewor14
Copy link
Contributor

@cloud-fan can you make a patch for 2.0?

@asfgit asfgit closed this in 702cd40 Nov 22, 2016
@andrewor14
Copy link
Contributor

andrewor14 commented Nov 22, 2016

(Actually never mind, not my fault! :p)

robert3005 pushed a commit to palantir/spark that referenced this pull request Dec 2, 2016
…l getTable once

## What changes were proposed in this pull request?

HiveExternalCatalog.listPartitions should only call `getTable` once, instead of calling it for every partitions.

## How was this patch tested?

N/A

Author: Wenchen Fan <wenchen@databricks.com>

Closes apache#15978 from cloud-fan/perf.
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
…l getTable once

## What changes were proposed in this pull request?

HiveExternalCatalog.listPartitions should only call `getTable` once, instead of calling it for every partitions.

## How was this patch tested?

N/A

Author: Wenchen Fan <wenchen@databricks.com>

Closes apache#15978 from cloud-fan/perf.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
5 participants