Allow sequences (tuples and lists) as pivot values argument in PySpark. #33083

wrobell · 2021-06-25T11:16:13Z

Both tuples and lists are accepted by PySpark on runtime.

holdenk · 2021-06-25T18:10:57Z

Jenkins ok to test

holdenk · 2021-06-25T18:11:54Z

This looks reasonable to me, I'm not very familiar with the typing code for Python yet so cc @zero323

SparkQA · 2021-06-25T19:06:45Z

Test build #140338 has finished for PR 33083 at commit 8187108.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2021-06-25T19:19:32Z

Kubernetes integration test unable to build dist.

exiting with code: 1
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44869/

zero323 · 2021-06-25T19:35:01Z

Technically speaking, Sequence is more than Tuple or List (I mention that, because we have quite a few cases where we explicitly restrict inputs to these two), but Py4J should be able map any Sequence in the same way (I've done some rough testing on custom Sequence implementation to be sure, and it seems to work fine).

zero323

LGTM, subject to passing tests.

AmplabJenkins · 2021-06-25T23:54:55Z

Can one of the admins verify this patch?

HyukjinKwon · 2021-06-28T03:03:53Z

@wrobell, can you file a JIRA (see https://spark.apache.org/contributing.html), and enable GitHub Actions in your fork repo (see https://github.com/apache/spark/pull/33083/checks?check_run_id=2913538608)?

Also please keep the GIthub PR template (https://github.com/apache/spark/blob/master/.github/PULL_REQUEST_TEMPLATE) and format PR title properly.

HyukjinKwon · 2021-06-28T03:04:44Z

Otherwise, looks fine to me too. I'll leave it to him.

zero323 · 2021-07-08T20:06:30Z

Otherwise, looks fine to me too. I'll leave it to him.

Sure, I'll handle this once pending comments are addressed.

zero323 · 2021-08-04T12:34:22Z

Gentle ping @wrobell

wrobell · 2021-08-13T09:36:26Z

Sorry, but due to personal circumstances I will not be able to help with this for next couple of weeks.

dchvn · 2021-10-26T04:31:02Z

any update here? python/pyspark/sql/group.pyi has been removed by #34197, so can I create a JIRA ticket and a PR for this issue? @HyukjinKwon @zero323 @wrobell
Thanks!

Allow sequences (tuples and lists) as pivot values argument in PySpark.

8187108

Both tuples and lists are accepted by PySpark on runtime.

github-actions bot added PYTHON SQL labels Jun 25, 2021

zero323 approved these changes Jun 25, 2021

View reviewed changes

HyukjinKwon closed this Oct 26, 2021

dchvn mentioned this pull request Oct 26, 2021

[SPARK-37116][PYTHON] Allow sequences (tuples and lists) as pivot values argument in PySpark #34392

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow sequences (tuples and lists) as pivot values argument in PySpark. #33083

Allow sequences (tuples and lists) as pivot values argument in PySpark. #33083

wrobell commented Jun 25, 2021

holdenk commented Jun 25, 2021

holdenk commented Jun 25, 2021

SparkQA commented Jun 25, 2021

SparkQA commented Jun 25, 2021

zero323 commented Jun 25, 2021 •

edited

zero323 left a comment

AmplabJenkins commented Jun 25, 2021

HyukjinKwon commented Jun 28, 2021

HyukjinKwon commented Jun 28, 2021

zero323 commented Jul 8, 2021

zero323 commented Aug 4, 2021

wrobell commented Aug 13, 2021

dchvn commented Oct 26, 2021

Allow sequences (tuples and lists) as pivot values argument in PySpark. #33083

Allow sequences (tuples and lists) as pivot values argument in PySpark. #33083

Conversation

wrobell commented Jun 25, 2021

holdenk commented Jun 25, 2021

holdenk commented Jun 25, 2021

SparkQA commented Jun 25, 2021

SparkQA commented Jun 25, 2021

zero323 commented Jun 25, 2021 • edited

zero323 left a comment

Choose a reason for hiding this comment

AmplabJenkins commented Jun 25, 2021

HyukjinKwon commented Jun 28, 2021

HyukjinKwon commented Jun 28, 2021

zero323 commented Jul 8, 2021

zero323 commented Aug 4, 2021

wrobell commented Aug 13, 2021

dchvn commented Oct 26, 2021

zero323 commented Jun 25, 2021 •

edited