[SPARK-12251] Document and improve off-heap memory configurations #10237

JoshRosen · 2015-12-10T02:23:00Z

This patch adds documentation for Spark configurations that affect off-heap memory and makes some naming and validation improvements for those configs.

Change spark.memory.offHeapSize to spark.memory.offHeap.size. This is fine because this configuration has not shipped in any Spark release yet (it's new in Spark 1.6).
Deprecated spark.unsafe.offHeap in favor of a new spark.memory.offHeap.enabled configuration. The motivation behind this change is to gather all memory-related configurations under the same prefix.
Add a check which prevents users from setting spark.memory.offHeap.enabled=true when spark.memory.offHeap.size == 0. After SPARK-11389 ([SPARK-11389][CORE] Add support for off-heap memory to MemoryManager #9344), which was committed in Spark 1.6, Spark enforces a hard limit on the amount of off-heap memory that it will allocate to tasks. As a result, enabling off-heap execution memory without setting spark.memory.offHeap.size will lead to immediate OOMs. The new configuration validation makes this scenario easier to diagnose, helping to avoid user confusion.
Document these configurations on the configuration page.

…p is used

JoshRosen · 2015-12-10T04:08:03Z

Any preferences RE: offHeap vs offheap?

SparkQA · 2015-12-10T04:54:16Z

Test build #47469 has finished for PR 10237 at commit 2dd2d9e.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2015-12-10T05:21:48Z

Jenkins, retest this please.

SparkQA · 2015-12-10T06:02:58Z

Test build #47482 has finished for PR 10237 at commit 216fc46.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-12-10T07:04:25Z

Test build #47486 has finished for PR 10237 at commit 216fc46.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

andrewor14 · 2015-12-10T20:28:01Z

LGTM will merge once it passes tests

SparkQA · 2015-12-10T22:25:45Z

Test build #2200 has finished for PR 10237 at commit 216fc46.

This patch fails Spark unit tests.
This patch does not merge cleanly.
This patch adds the following public classes (experimental):\n * public class JavaBinarizerExample\n * public class JavaBucketizerExample\n * public class JavaDCTExample\n * public class JavaElementwiseProductExample\n * public class JavaMinMaxScalerExample\n * public class JavaNGramExample\n * public class JavaNormalizerExample\n * public class JavaOneHotEncoderExample\n * public class JavaPCAExample\n * public class JavaPolynomialExpansionExample\n * public class JavaRFormulaExample\n * public class JavaStandardScalerExample\n * public class JavaStopWordsRemoverExample\n * public class JavaStringIndexerExample\n * public class JavaTokenizerExample\n * public class JavaVectorAssemblerExample\n * public class JavaVectorIndexerExample\n * public class JavaVectorSlicerExample\n

JoshRosen · 2015-12-10T22:33:00Z

Haha, because of the Spark PRs update issues / downtime, your NewSparkPullRequestBuilder runs triggered runs of an older commit rather than the latest commit containing the fix for that test failure.

SparkQA · 2015-12-10T22:42:59Z

Test build #2199 has finished for PR 10237 at commit 216fc46.

This patch fails Spark unit tests.
This patch does not merge cleanly.
This patch adds the following public classes (experimental):\n * public class JavaBinarizerExample\n * public class JavaBucketizerExample\n * public class JavaDCTExample\n * public class JavaElementwiseProductExample\n * public class JavaMinMaxScalerExample\n * public class JavaNGramExample\n * public class JavaNormalizerExample\n * public class JavaOneHotEncoderExample\n * public class JavaPCAExample\n * public class JavaPolynomialExpansionExample\n * public class JavaRFormulaExample\n * public class JavaStandardScalerExample\n * public class JavaStopWordsRemoverExample\n * public class JavaStringIndexerExample\n * public class JavaTokenizerExample\n * public class JavaVectorAssemblerExample\n * public class JavaVectorIndexerExample\n * public class JavaVectorSlicerExample\n * logInfo(s\"HBase class not found $e\")\n * logDebug(\"HBase class not found\", e)\n

SparkQA · 2015-12-10T22:54:19Z

Test build #2201 has finished for PR 10237 at commit 216fc46.

This patch fails Spark unit tests.
This patch does not merge cleanly.
This patch adds no public classes.

SparkQA · 2015-12-10T23:03:41Z

Test build #47541 has finished for PR 10237 at commit 12ffce3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

andrewor14 · 2015-12-10T23:28:46Z

Merged into master 1.6

This patch adds documentation for Spark configurations that affect off-heap memory and makes some naming and validation improvements for those configs. - Change `spark.memory.offHeapSize` to `spark.memory.offHeap.size`. This is fine because this configuration has not shipped in any Spark release yet (it's new in Spark 1.6). - Deprecated `spark.unsafe.offHeap` in favor of a new `spark.memory.offHeap.enabled` configuration. The motivation behind this change is to gather all memory-related configurations under the same prefix. - Add a check which prevents users from setting `spark.memory.offHeap.enabled=true` when `spark.memory.offHeap.size == 0`. After SPARK-11389 (#9344), which was committed in Spark 1.6, Spark enforces a hard limit on the amount of off-heap memory that it will allocate to tasks. As a result, enabling off-heap execution memory without setting `spark.memory.offHeap.size` will lead to immediate OOMs. The new configuration validation makes this scenario easier to diagnose, helping to avoid user confusion. - Document these configurations on the configuration page. Author: Josh Rosen <joshrosen@databricks.com> Closes #10237 from JoshRosen/SPARK-12251. (cherry picked from commit 23a9e62) Signed-off-by: Andrew Or <andrew@databricks.com>

JoshRosen added 4 commits December 9, 2015 17:59

Deprecate spark.unsafe.offHeap in favor of spark.memory.useOffHeap.

30744be

Add test to check that offHeap memory size is configured when off hea…

f69f438

…p is used

Fix mis-ordered parameters

2dd2d9e

More conf. renaming.

216fc46

Fixed PackedRecordPointerSuite test.

12ffce3

asfgit closed this in 23a9e62 Dec 10, 2015

JoshRosen deleted the SPARK-12251 branch August 29, 2016 19:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-12251] Document and improve off-heap memory configurations #10237

[SPARK-12251] Document and improve off-heap memory configurations #10237

JoshRosen commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

andrewor14 commented Dec 10, 2015

SparkQA commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

andrewor14 commented Dec 10, 2015

[SPARK-12251] Document and improve off-heap memory configurations #10237

[SPARK-12251] Document and improve off-heap memory configurations #10237

Conversation

JoshRosen commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

andrewor14 commented Dec 10, 2015

SparkQA commented Dec 10, 2015

JoshRosen commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

SparkQA commented Dec 10, 2015

andrewor14 commented Dec 10, 2015