Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-3114] [PySpark] bugfix: disable compression of command #2026

Closed
wants to merge 1 commit into from

Conversation

davies
Copy link
Contributor

@davies davies commented Aug 19, 2014

compressed commands break Python UDF.

compressed commands break Python UDF.
@SparkQA
Copy link

SparkQA commented Aug 19, 2014

QA tests have started for PR 2026 at commit e6a5630.

  • This patch merges cleanly.

@davies davies changed the title bugfix: disable compression of command [SPARK-3114] [PySpark] bugfix: disable compression of command Aug 19, 2014
@SparkQA
Copy link

SparkQA commented Aug 19, 2014

QA tests have finished for PR 2026 at commit e6a5630.

  • This patch passes unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@JoshRosen
Copy link
Contributor

Do you think it's better to disable compression of the command or to update the Spark SQL code to use the compressing serializer? The latter might buy us a small benefit if we expect task closures to be large; on the other hand, maybe the compression wouldn't be worth it for small closures (will it add a lot of time overhead)?

@JoshRosen
Copy link
Contributor

Opened a new PR that includes this fix, plus a commit to re-enable the sql.py tests.

@davies
Copy link
Contributor Author

davies commented Aug 19, 2014

There are some test cases in Scala needed to fix before enable compress. We can do it later.

@davies davies closed this Aug 19, 2014
asfgit pushed a commit that referenced this pull request Aug 19, 2014
This fixes SPARK-3114, an issue where we inadvertently broke Python UDFs in Spark SQL.

This PR modifiers the test runner script to always run the PySpark SQL tests, irrespective of whether SparkSQL itself has been modified.  It also includes Davies' fix for the bug.

Closes #2026.

Author: Josh Rosen <joshrosen@apache.org>
Author: Davies Liu <davies.liu@gmail.com>

Closes #2027 from JoshRosen/pyspark-sql-fix and squashes the following commits:

9af2708 [Davies Liu] bugfix: disable compression of command
0d8d3a4 [Josh Rosen] Always run Python Spark SQL tests.

(cherry picked from commit 1f1819b)
Signed-off-by: Josh Rosen <joshrosen@apache.org>
asfgit pushed a commit that referenced this pull request Aug 19, 2014
This fixes SPARK-3114, an issue where we inadvertently broke Python UDFs in Spark SQL.

This PR modifiers the test runner script to always run the PySpark SQL tests, irrespective of whether SparkSQL itself has been modified.  It also includes Davies' fix for the bug.

Closes #2026.

Author: Josh Rosen <joshrosen@apache.org>
Author: Davies Liu <davies.liu@gmail.com>

Closes #2027 from JoshRosen/pyspark-sql-fix and squashes the following commits:

9af2708 [Davies Liu] bugfix: disable compression of command
0d8d3a4 [Josh Rosen] Always run Python Spark SQL tests.
xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
This fixes SPARK-3114, an issue where we inadvertently broke Python UDFs in Spark SQL.

This PR modifiers the test runner script to always run the PySpark SQL tests, irrespective of whether SparkSQL itself has been modified.  It also includes Davies' fix for the bug.

Closes apache#2026.

Author: Josh Rosen <joshrosen@apache.org>
Author: Davies Liu <davies.liu@gmail.com>

Closes apache#2027 from JoshRosen/pyspark-sql-fix and squashes the following commits:

9af2708 [Davies Liu] bugfix: disable compression of command
0d8d3a4 [Josh Rosen] Always run Python Spark SQL tests.
@davies davies deleted the fix_compress branch September 15, 2014 22:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants