[SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker #23055

HyukjinKwon · 2018-11-16T02:06:00Z

What changes were proposed in this pull request?

resource package is a Unix specific package. See https://docs.python.org/2/library/resource.html and https://docs.python.org/3/library/resource.html.

Note that we document Windows support:

Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS).

This should be backported into branch-2.4 to restore Windows support in Spark 2.4.1.

How was this patch tested?

Manually mocking the changed logics.

HyukjinKwon · 2018-11-16T02:07:10Z

cc @rdblue, @vanzin and @haydenjeune

python/pyspark/worker.py

core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala

rdblue · 2018-11-16T03:23:37Z

Thanks for fixing this so quickly, @HyukjinKwon! I'd like a couple of changes, but overall it is going in the right direction.

We should also plan on porting this to the 2.4 branch when it is committed since it is a regression.

SparkQA · 2018-11-16T06:28:58Z

Test build #98892 has finished for PR 23055 at commit 2d3315a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-17T08:05:02Z

Test build #98952 has finished for PR 23055 at commit 52a91cc.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2018-11-17T10:14:28Z

retest this please

SparkQA · 2018-11-17T14:47:14Z

Test build #98962 has finished for PR 23055 at commit 52a91cc.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2018-11-18T08:44:30Z

adding @BryanCutler and @ueshin as well FYI.

viirya · 2018-11-18T14:41:32Z

btw, there is a typo: a Unit specific package, in the PR description.

dongjoon-hyun · 2018-11-21T12:04:19Z

Retest this please.

SparkQA · 2018-11-21T16:37:10Z

Test build #99120 has finished for PR 23055 at commit 52a91cc.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

docs/configuration.md

python/pyspark/worker.py

HyukjinKwon · 2018-11-21T23:37:26Z

Sorry, may I ask to take another look please? I thought it's quite a straightforward fix by a consistent way of fixing.

SparkQA · 2018-11-22T04:04:23Z

Test build #99148 has finished for PR 23055 at commit fd92a4e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2018-11-26T16:08:28Z

retest this please

HyukjinKwon · 2018-11-26T16:08:40Z

Let me get this in in few days if there are no more comments.

SparkQA · 2018-11-26T20:01:19Z

Test build #99278 has finished for PR 23055 at commit fd92a4e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

python/pyspark/worker.py

SparkQA · 2018-11-30T02:48:47Z

Test build #99490 has finished for PR 23055 at commit 741d64a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-30T03:13:46Z

Test build #99491 has finished for PR 23055 at commit bd1acf2.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

docs/configuration.md

rdblue · 2018-11-30T18:50:40Z

+1 once the docs are updated to note that resource requests still include python memory, even in Windows.

HyukjinKwon · 2018-12-01T02:52:19Z

@vanzin and @rdblue, I updated the doc because it sounds not wrong to me. But, for clarification, we shouldn't really document we support something that's not tested (in particular such case above that the failure case was found). Also, IMHO, it's better to make it simple when there's Windows issue in terms of maintenance since not so many Windows maintainers exist in Spark.

The main functionality of that configuration is limiting resource if i'm not mistaken. The allocation ones in some modes is secondary if I am not mistaken. Technically someone should test it and fix the doc with showing how it works in another PR.

SparkQA · 2018-12-01T02:57:21Z

Test build #99531 has finished for PR 23055 at commit 5fe1c09.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rdblue · 2018-12-01T03:08:21Z

+1 with the latest changes. Thanks for taking care of this, @HyukjinKwon!

Functionality is in two parts: changing the resource requests (which doesn't change) and limiting memory use in python.

It is too bad that this broke, but I'm not sure how to deal with a platform that, as you say, has few contributors. I certainly wouldn't want to gate a feature like this on making sure someone tested it in Windows, unless we have CI set up for Windows builds.

HyukjinKwon · 2018-12-02T09:40:38Z

Thanks, @rdblue.

Merged to master and branch-2.4.

…n worker ## What changes were proposed in this pull request? `resource` package is a Unix specific package. See https://docs.python.org/2/library/resource.html and https://docs.python.org/3/library/resource.html. Note that we document Windows support: > Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS). This should be backported into branch-2.4 to restore Windows support in Spark 2.4.1. ## How was this patch tested? Manually mocking the changed logics. Closes #23055 from HyukjinKwon/SPARK-26080. Lead-authored-by: hyukjinkwon <gurwls223@apache.org> Co-authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit 9cda9a8) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

rdblue · 2018-12-03T20:30:08Z

@HyukjinKwon, for the future, I should note that I'm not a committer so my +1 for a PR is not binding. I'm fairly sure @vanzin would +1 this commit as well, but it's best not to merge based on my approval.

vanzin · 2018-12-03T20:34:50Z

(Belated +1.) Doc update looks fine. The previous one was misleading for reasons that Ryan explains above, it has nothing to do with whether it's Windows or not.

HyukjinKwon · 2018-12-04T00:02:27Z

Last comment was a minor comment for a doc - actually the whole point was a minor one. It does related with Windows.

…n worker ## What changes were proposed in this pull request? `resource` package is a Unix specific package. See https://docs.python.org/2/library/resource.html and https://docs.python.org/3/library/resource.html. Note that we document Windows support: > Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS). This should be backported into branch-2.4 to restore Windows support in Spark 2.4.1. ## How was this patch tested? Manually mocking the changed logics. Closes apache#23055 from HyukjinKwon/SPARK-26080. Lead-authored-by: hyukjinkwon <gurwls223@apache.org> Co-authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

manojfaria · 2019-02-22T03:32:44Z

I am new to this. So this may be a simple/stupid question. When is this fix getting released? I suppose this fix is planned for Apache Spark 2.4.1 for which pre-built binaries are not yet available for download (sounds right?). If I need this fix (even if it is not a stable release) how do I get the pre-built binaries for 2.4.1 release?

BryanCutler · 2019-02-22T23:19:28Z

@manojfaria , this fix will be in the 2.4.1 and 3.0.0 releases, see SPARK-26080. If you want it before then, you will have to apply the patch and build Spark yourself. The patch can be downloaded by adding a .diff to the pull request link. Hope that helps!

parshvms · 2019-03-15T19:55:42Z

@HyukjinKwon thanks for the fix. Workers run well with 2.4.0 now.

…n worker ## What changes were proposed in this pull request? `resource` package is a Unix specific package. See https://docs.python.org/2/library/resource.html and https://docs.python.org/3/library/resource.html. Note that we document Windows support: > Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS). This should be backported into branch-2.4 to restore Windows support in Spark 2.4.1. ## How was this patch tested? Manually mocking the changed logics. Closes apache#23055 from HyukjinKwon/SPARK-26080. Lead-authored-by: hyukjinkwon <gurwls223@apache.org> Co-authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit 9cda9a8) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

* [SPARK-26709][SQL] OptimizeMetadataOnlyQuery does not handle empty records correctly ## What changes were proposed in this pull request? When reading from empty tables, the optimization `OptimizeMetadataOnlyQuery` may return wrong results: ``` sql("CREATE TABLE t (col1 INT, p1 INT) USING PARQUET PARTITIONED BY (p1)") sql("INSERT INTO TABLE t PARTITION (p1 = 5) SELECT ID FROM range(1, 1)") sql("SELECT MAX(p1) FROM t") ``` The result is supposed to be `null`. However, with the optimization the result is `5`. The rule is originally ported from https://issues.apache.org/jira/browse/HIVE-1003 in apache#13494. In Hive, the rule is disabled by default in a later release(https://issues.apache.org/jira/browse/HIVE-15397), due to the same problem. It is hard to completely avoid the correctness issue. Because data sources like Parquet can be metadata-only. Spark can't tell whether it is empty or not without actually reading it. This PR disable the optimization by default. ## How was this patch tested? Unit test Closes apache#23635 from gengliangwang/optimizeMetadata. Lead-authored-by: Gengliang Wang <gengliang.wang@databricks.com> Co-authored-by: Xiao Li <gatorsmile@gmail.com> Signed-off-by: gatorsmile <gatorsmile@gmail.com> (cherry picked from commit f5b9370) Signed-off-by: gatorsmile <gatorsmile@gmail.com> * [SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker ## What changes were proposed in this pull request? `resource` package is a Unix specific package. See https://docs.python.org/2/library/resource.html and https://docs.python.org/3/library/resource.html. Note that we document Windows support: > Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS). This should be backported into branch-2.4 to restore Windows support in Spark 2.4.1. ## How was this patch tested? Manually mocking the changed logics. Closes apache#23055 from HyukjinKwon/SPARK-26080. Lead-authored-by: hyukjinkwon <gurwls223@apache.org> Co-authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit 9cda9a8) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> * [SPARK-26873][SQL] Use a consistent timestamp to build Hadoop Job IDs. ## What changes were proposed in this pull request? Updates FileFormatWriter to create a consistent Hadoop Job ID for a write. ## How was this patch tested? Existing tests for regressions. Closes apache#23777 from rdblue/SPARK-26873-fix-file-format-writer-job-ids. Authored-by: Ryan Blue <blue@apache.org> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com> (cherry picked from commit 33334e2) Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com> * [SPARK-26745][SPARK-24959][SQL][BRANCH-2.4] Revert count optimization in JSON datasource by ## What changes were proposed in this pull request? This PR reverts JSON count optimization part of apache#21909. We cannot distinguish the cases below without parsing: ``` [{...}, {...}] ``` ``` [] ``` ``` {...} ``` ```bash # empty string ``` when we `count()`. One line (input: IN) can be, 0 record, 1 record and multiple records and this is dependent on each input. See also apache#23665 (comment). ## How was this patch tested? Manually tested. Closes apache#23708 from HyukjinKwon/SPARK-26745-backport. Authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> * [SPARK-26677][BUILD] Update Parquet to 1.10.1 with notEq pushdown fix. ## What changes were proposed in this pull request? Update to Parquet Java 1.10.1. ## How was this patch tested? Added a test from HyukjinKwon that validates the notEq case from SPARK-26677. Closes apache#23704 from rdblue/SPARK-26677-fix-noteq-parquet-bug. Lead-authored-by: Ryan Blue <blue@apache.org> Co-authored-by: Hyukjin Kwon <gurwls223@apache.org> Co-authored-by: Ryan Blue <rdblue@users.noreply.github.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit f72d217) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> * [SPARK-26677][FOLLOWUP][BRANCH-2.4] Update Parquet manifest with Hadoop-2.6 ## What changes were proposed in this pull request? During merging Parquet upgrade PR, `hadoop-2.6` profile dependency manifest is missed. ## How was this patch tested? Manual. ``` ./dev/test-dependencies.sh ``` Also, this will recover `branch-2.4` with `hadoop-2.6` build. - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-branch-2.4-test-sbt-hadoop-2.6/281/ Closes apache#23738 from dongjoon-hyun/SPARK-26677-2. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> * [SPARK-26708][SQL][BRANCH-2.4] Incorrect result caused by inconsistency between a SQL cache's cached RDD and its physical plan ## What changes were proposed in this pull request? When performing non-cascading cache invalidation, `recache` is called on the other cache entries which are dependent on the cache being invalidated. It leads to the the physical plans of those cache entries being re-compiled. For those cache entries, if the cache RDD has already been persisted, chances are there will be inconsistency between the data and the new plan. It can cause a correctness issue if the new plan's `outputPartitioning` or `outputOrdering` is different from the that of the actual data, and meanwhile the cache is used by another query that asks for specific `outputPartitioning` or `outputOrdering` which happens to match the new plan but not the actual data. The fix is to keep the cache entry as it is if the data has been loaded, otherwise re-build the cache entry, with a new plan and an empty cache buffer. ## How was this patch tested? Added UT. Closes apache#23678 from maryannxue/spark-26708-2.4. Authored-by: maryannxue <maryannxue@apache.org> Signed-off-by: Takeshi Yamamuro <yamamuro@apache.org> * [SPARK-26267][SS] Retry when detecting incorrect offsets from Kafka (2.4) ## What changes were proposed in this pull request? Backport apache#23324 to branch-2.4. ## How was this patch tested? Jenkins Closes apache#23365 from zsxwing/SPARK-26267-2.4. Authored-by: Shixiong Zhu <zsxwing@gmail.com> Signed-off-by: Shixiong Zhu <zsxwing@gmail.com> * [SPARK-26706][SQL] Fix `illegalNumericPrecedence` for ByteType This PR contains a minor change in `Cast$mayTruncate` that fixes its logic for bytes. Right now, `mayTruncate(ByteType, LongType)` returns `false` while `mayTruncate(ShortType, LongType)` returns `true`. Consequently, `spark.range(1, 3).as[Byte]` and `spark.range(1, 3).as[Short]` behave differently. Potentially, this bug can silently corrupt someone's data. ```scala // executes silently even though Long is converted into Byte spark.range(Long.MaxValue - 10, Long.MaxValue).as[Byte] .map(b => b - 1) .show() +-----+ |value| +-----+ | -12| | -11| | -10| | -9| | -8| | -7| | -6| | -5| | -4| | -3| +-----+ // throws an AnalysisException: Cannot up cast `id` from bigint to smallint as it may truncate spark.range(Long.MaxValue - 10, Long.MaxValue).as[Short] .map(s => s - 1) .show() ``` This PR comes with a set of unit tests. Closes apache#23632 from aokolnychyi/cast-fix. Authored-by: Anton Okolnychyi <aokolnychyi@apple.com> Signed-off-by: DB Tsai <d_tsai@apple.com> * [SPARK-26078][SQL][BACKPORT-2.4] Dedup self-join attributes on IN subqueries ## What changes were proposed in this pull request? When there is a self-join as result of a IN subquery, the join condition may be invalid, resulting in trivially true predicates and return wrong results. The PR deduplicates the subquery output in order to avoid the issue. ## How was this patch tested? added UT Closes apache#23449 from mgaido91/SPARK-26078_2.4. Authored-by: Marco Gaido <marcogaido91@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> * [SPARK-26233][SQL][BACKPORT-2.4] CheckOverflow when encoding a decimal value ## What changes were proposed in this pull request? When we encode a Decimal from external source we don't check for overflow. That method is useful not only in order to enforce that we can represent the correct value in the specified range, but it also changes the underlying data to the right precision/scale. Since in our code generation we assume that a decimal has exactly the same precision and scale of its data type, missing to enforce it can lead to corrupted output/results when there are subsequent transformations. ## How was this patch tested? added UT Closes apache#23232 from mgaido91/SPARK-26233_2.4. Authored-by: Marco Gaido <marcogaido91@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> * [SPARK-27097][CHERRY-PICK 2.4] Avoid embedding platform-dependent offsets literally in whole-stage generated code ## What changes were proposed in this pull request? Spark SQL performs whole-stage code generation to speed up query execution. There are two steps to it: - Java source code is generated from the physical query plan on the driver. A single version of the source code is generated from a query plan, and sent to all executors. - It's compiled to bytecode on the driver to catch compilation errors before sending to executors, but currently only the generated source code gets sent to the executors. The bytecode compilation is for fail-fast only. - Executors receive the generated source code and compile to bytecode, then the query runs like a hand-written Java program. In this model, there's an implicit assumption about the driver and executors being run on similar platforms. Some code paths accidentally embedded platform-dependent object layout information into the generated code, such as: ```java Platform.putLong(buffer, /* offset */ 24, /* value */ 1); ``` This code expects a field to be at offset +24 of the `buffer` object, and sets a value to that field. But whole-stage code generation generally uses platform-dependent information from the driver. If the object layout is significantly different on the driver and executors, the generated code can be reading/writing to wrong offsets on the executors, causing all kinds of data corruption. One code pattern that leads to such problem is the use of `Platform.XXX` constants in generated code, e.g. `Platform.BYTE_ARRAY_OFFSET`. Bad: ```scala val baseOffset = Platform.BYTE_ARRAY_OFFSET // codegen template: s"Platform.putLong($buffer, $baseOffset, $value);" ``` This will embed the value of `Platform.BYTE_ARRAY_OFFSET` on the driver into the generated code. Good: ```scala val baseOffset = "Platform.BYTE_ARRAY_OFFSET" // codegen template: s"Platform.putLong($buffer, $baseOffset, $value);" ``` This will generate the offset symbolically -- `Platform.putLong(buffer, Platform.BYTE_ARRAY_OFFSET, value)`, which will be able to pick up the correct value on the executors. Caveat: these offset constants are declared as runtime-initialized `static final` in Java, so they're not compile-time constants from the Java language's perspective. It does lead to a slightly increased size of the generated code, but this is necessary for correctness. NOTE: there can be other patterns that generate platform-dependent code on the driver which is invalid on the executors. e.g. if the endianness is different between the driver and the executors, and if some generated code makes strong assumption about endianness, it would also be problematic. ## How was this patch tested? Added a new test suite `WholeStageCodegenSparkSubmitSuite`. This test suite needs to set the driver's extraJavaOptions to force the driver and executor use different Java object layouts, so it's run as an actual SparkSubmit job. Authored-by: Kris Mok <kris.mokdatabricks.com> Closes apache#24032 from gatorsmile/testFailure. Lead-authored-by: Kris Mok <kris.mok@databricks.com> Co-authored-by: gatorsmile <gatorsmile@gmail.com> Signed-off-by: DB Tsai <d_tsai@apple.com> * [SPARK-26188][SQL] FileIndex: don't infer data types of partition columns if user specifies schema ## What changes were proposed in this pull request? This PR is to fix a regression introduced in: https://github.com/apache/spark/pull/21004/files#r236998030 If user specifies schema, Spark don't need to infer data type for of partition columns, otherwise the data type might not match with the one user provided. E.g. for partition directory `p=4d`, after data type inference the column value will be `4.0`. See https://issues.apache.org/jira/browse/SPARK-26188 for more details. Note that user specified schema **might not cover all the data columns**: ``` val schema = new StructType() .add("id", StringType) .add("ex", ArrayType(StringType)) val df = spark.read .schema(schema) .format("parquet") .load(src.toString) assert(df.schema.toList === List( StructField("ex", ArrayType(StringType)), StructField("part", IntegerType), // inferred partitionColumn dataType StructField("id", StringType))) // used user provided partitionColumn dataType ``` For the missing columns in user specified schema, Spark still need to infer their data types if `partitionColumnTypeInferenceEnabled` is enabled. To implement the partially inference, refactor `PartitioningUtils.parsePartitions` and pass the user specified schema as parameter to cast partition values. ## How was this patch tested? Add unit test. Closes apache#23165 from gengliangwang/fixFileIndex. Authored-by: Gengliang Wang <gengliang.wang@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit 9cfc3ee) Signed-off-by: Wenchen Fan <wenchen@databricks.com> * [SPARK-25921][PYSPARK] Fix barrier task run without BarrierTaskContext while python worker reuse ## What changes were proposed in this pull request? Running a barrier job after a normal spark job causes the barrier job to run without a BarrierTaskContext. This is because while python worker reuse, BarrierTaskContext._getOrCreate() will still return a TaskContext after firstly submit a normal spark job, we'll get a `AttributeError: 'TaskContext' object has no attribute 'barrier'`. Fix this by adding check logic in BarrierTaskContext._getOrCreate() and make sure it will return BarrierTaskContext in this scenario. ## How was this patch tested? Add new UT in pyspark-core. Closes apache#22962 from xuanyuanking/SPARK-25921. Authored-by: Yuanjian Li <xyliyuanjian@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit c00e72f) Signed-off-by: Wenchen Fan <wenchen@databricks.com>

Disable 'spark.executor.pyspark.memory' on Windows always

2d3315a

HyukjinKwon changed the title ~~[SPARK-26080][SQL] Disable 'spark.executor.pyspark.memory' always on Windows~~ [SPARK-26080][PYTHON] Disable 'spark.executor.pyspark.memory' always on Windows Nov 16, 2018

rdblue reviewed Nov 16, 2018

View reviewed changes

python/pyspark/worker.py Outdated Show resolved Hide resolved

rdblue reviewed Nov 16, 2018

View reviewed changes

core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala Outdated Show resolved Hide resolved

Add flag for resource package

52a91cc

BryanCutler reviewed Nov 21, 2018

View reviewed changes

docs/configuration.md Outdated Show resolved Hide resolved

python/pyspark/worker.py Outdated Show resolved Hide resolved

Address comments

fd92a4e

HyukjinKwon commented Nov 28, 2018

View reviewed changes

python/pyspark/worker.py Show resolved Hide resolved

Remove JVM side check

741d64a

HyukjinKwon changed the title ~~[SPARK-26080][PYTHON] Disable 'spark.executor.pyspark.memory' always on Windows~~ [SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker Nov 30, 2018

Fix comment

bd1acf2

vanzin reviewed Nov 30, 2018

View reviewed changes

docs/configuration.md Outdated Show resolved Hide resolved

Improve doc

5fe1c09

asfgit closed this in 9cda9a8 Dec 2, 2018

HyukjinKwon deleted the SPARK-26080 branch March 3, 2020 01:20

Hezhexi2002 mentioned this pull request Jul 21, 2021

ModuleNotFoundError: No module named 'resource' Megvii-BaseDetection/YOLOX#17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker #23055

[SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker #23055

HyukjinKwon commented Nov 16, 2018 •

edited

Loading

HyukjinKwon commented Nov 16, 2018

rdblue commented Nov 16, 2018

SparkQA commented Nov 16, 2018

SparkQA commented Nov 17, 2018

HyukjinKwon commented Nov 17, 2018

SparkQA commented Nov 17, 2018

HyukjinKwon commented Nov 18, 2018

viirya commented Nov 18, 2018

dongjoon-hyun commented Nov 21, 2018

SparkQA commented Nov 21, 2018

HyukjinKwon commented Nov 21, 2018

SparkQA commented Nov 22, 2018

HyukjinKwon commented Nov 26, 2018

HyukjinKwon commented Nov 26, 2018

SparkQA commented Nov 26, 2018

SparkQA commented Nov 30, 2018

SparkQA commented Nov 30, 2018

rdblue commented Nov 30, 2018

HyukjinKwon commented Dec 1, 2018 •

edited

Loading

SparkQA commented Dec 1, 2018

rdblue commented Dec 1, 2018

HyukjinKwon commented Dec 2, 2018

rdblue commented Dec 3, 2018

vanzin commented Dec 3, 2018

HyukjinKwon commented Dec 4, 2018

manojfaria commented Feb 22, 2019 •

edited

Loading

BryanCutler commented Feb 22, 2019

parshvms commented Mar 15, 2019

[SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker #23055

[SPARK-26080][PYTHON] Skips Python resource limit on Windows in Python worker #23055

Conversation

HyukjinKwon commented Nov 16, 2018 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

HyukjinKwon commented Nov 16, 2018

rdblue commented Nov 16, 2018

SparkQA commented Nov 16, 2018

SparkQA commented Nov 17, 2018

HyukjinKwon commented Nov 17, 2018

SparkQA commented Nov 17, 2018

HyukjinKwon commented Nov 18, 2018

viirya commented Nov 18, 2018

dongjoon-hyun commented Nov 21, 2018

SparkQA commented Nov 21, 2018

HyukjinKwon commented Nov 21, 2018

SparkQA commented Nov 22, 2018

HyukjinKwon commented Nov 26, 2018

HyukjinKwon commented Nov 26, 2018

SparkQA commented Nov 26, 2018

SparkQA commented Nov 30, 2018

SparkQA commented Nov 30, 2018

rdblue commented Nov 30, 2018

HyukjinKwon commented Dec 1, 2018 • edited Loading

SparkQA commented Dec 1, 2018

rdblue commented Dec 1, 2018

HyukjinKwon commented Dec 2, 2018

rdblue commented Dec 3, 2018

vanzin commented Dec 3, 2018

HyukjinKwon commented Dec 4, 2018

manojfaria commented Feb 22, 2019 • edited Loading

BryanCutler commented Feb 22, 2019

parshvms commented Mar 15, 2019

HyukjinKwon commented Nov 16, 2018 •

edited

Loading

HyukjinKwon commented Dec 1, 2018 •

edited

Loading

manojfaria commented Feb 22, 2019 •

edited

Loading