[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python DataFrame #11445

cloud-fan · 2016-03-01T13:59:11Z

What changes were proposed in this pull request?

Remove map, flatMap, mapPartitions from python DataFrame, to prepare for Dataset API in the future.

How was this patch tested?

existing tests

cloud-fan · 2016-03-01T13:59:31Z

cc @rxin @yhuai

SparkQA · 2016-03-01T14:15:08Z

Test build #52244 has finished for PR 11445 at commit 86ec0ff.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

yhuai · 2016-03-01T19:25:04Z

python/pyspark/sql/dataframe.py

-        4
-        """
-        return self.rdd.mapPartitions(f, preservesPartitioning)
-


Should we also remove foreach and foreachPartition?

those are fine, since they don't return anything.

SparkQA · 2016-03-02T01:03:37Z

Test build #52273 has finished for PR 11445 at commit bf4b9d5.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-02T03:09:59Z

Test build #52285 has finished for PR 11445 at commit d0a69fa.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-02T04:22:52Z

Test build #52287 has finished for PR 11445 at commit 5e711e3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-02T23:25:35Z

Thanks - merging this in master.

mydpy · 2016-03-15T14:31:39Z

This change surprised me as a user of Pyspark on the 2.0.0-Snapshot. Thanks for documenting this well. Since I usually use the Scala API, it was not clear to me that Pyspark didn't support the Datasets API yet (i.e., df.rdd.flatMap(...) returns a PythonRDD as-opposed to a Dataset)

…thon DataFrame ## What changes were proposed in this pull request? Remove `map`, `flatMap`, `mapPartitions` from python DataFrame, to prepare for Dataset API in the future. ## How was this patch tested? existing tests Author: Wenchen Fan <wenchen@databricks.com> Closes apache#11445 from cloud-fan/python-clean.

maver1ck · 2016-07-19T09:22:10Z

@rxin
As we're not planning to implement DataSets in Python is there a plan to revert this PR?

remove typed operations from python DataFrame

86ec0ff

yhuai reviewed Mar 1, 2016
View reviewed changes

more clean

bf4b9d5

update

5e711e3

cloud-fan force-pushed the python-clean branch from d0a69fa to 5e711e3 Compare March 2, 2016 04:00

asfgit closed this in 4dd2481 Mar 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python DataFrame #11445

[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python DataFrame #11445

Uh oh!

cloud-fan commented Mar 1, 2016

Uh oh!

cloud-fan commented Mar 1, 2016

Uh oh!

SparkQA commented Mar 1, 2016

Uh oh!

yhuai Mar 1, 2016

Uh oh!

rxin Mar 1, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

rxin commented Mar 2, 2016

Uh oh!

mydpy commented Mar 15, 2016

Uh oh!

maver1ck commented Jul 19, 2016 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python DataFrame #11445

[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python DataFrame #11445

Uh oh!

Conversation

cloud-fan commented Mar 1, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

cloud-fan commented Mar 1, 2016

Uh oh!

SparkQA commented Mar 1, 2016

Uh oh!

yhuai Mar 1, 2016

Choose a reason for hiding this comment

Uh oh!

rxin Mar 1, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

rxin commented Mar 2, 2016

Uh oh!

mydpy commented Mar 15, 2016

Uh oh!

maver1ck commented Jul 19, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

maver1ck commented Jul 19, 2016 •

edited

Loading