Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save #4864

Closed
wants to merge 2 commits into from

Conversation

jkbradley
Copy link
Member

Issue: When the Python DecisionTree example in the programming guide is run, it runs out of Java Heap Space when using the default memory settings for the spark shell.

This prints a warning.

CC: @mengxr

@SparkQA
Copy link

SparkQA commented Mar 3, 2015

Test build #28211 has started for PR 4864 at commit 7ecb1ed.

  • This patch merges cleanly.

// SPARK-6120: We do a hacky check here so users understand why save() is failing
// when they run the ML guide example.
// TODO: Fix this issue for real.
val driverMemory = sc.getConf.getOption("spark.driver.memory")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Move this into the the if (sc.isLocal) branch.

@SparkQA
Copy link

SparkQA commented Mar 3, 2015

Test build #28213 has started for PR 4864 at commit 02e8daf.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Mar 3, 2015

Test build #28213 has finished for PR 4864 at commit 02e8daf.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28213/
Test FAILed.

@SparkQA
Copy link

SparkQA commented Mar 3, 2015

Test build #28211 has finished for PR 4864 at commit 7ecb1ed.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28211/
Test PASSed.

asfgit pushed a commit that referenced this pull request Mar 3, 2015
Issue: When the Python DecisionTree example in the programming guide is run, it runs out of Java Heap Space when using the default memory settings for the spark shell.

This prints a warning.

CC: mengxr

Author: Joseph K. Bradley <joseph@databricks.com>

Closes #4864 from jkbradley/dt-save-heap and squashes the following commits:

02e8daf [Joseph K. Bradley] fixed based on code review
7ecb1ed [Joseph K. Bradley] Added warnings about memory when calling tree and ensemble model save with too small a Java heap size

(cherry picked from commit c2fe3a6)
Signed-off-by: Xiangrui Meng <meng@databricks.com>
@mengxr
Copy link
Contributor

mengxr commented Mar 3, 2015

LGTM. Merged into master and branch-1.3. Thanks!

@asfgit asfgit closed this in c2fe3a6 Mar 3, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants