Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-21727][R] Allow multi-element atomic vector as column type in SparkR DataFrame #20352

Closed
wants to merge 3 commits into from

Conversation

neilalex
Copy link
Contributor

What changes were proposed in this pull request?

A fix to https://issues.apache.org/jira/browse/SPARK-21727, "Operating on an ArrayType in a SparkR DataFrame throws error"

How was this patch tested?

  • Ran tests at R\pkg\tests\run-all.R (see below attached results)
  • Tested the following lines in SparkR, which now seem to execute without error:
indices <- 1:4
myDf <- data.frame(indices)
myDf$data <- list(rep(0, 20))
mySparkDf <- as.DataFrame(myDf)
collect(mySparkDf)

2018-01-22 SPARK-21727 Test Results.txt

@felixcheung @yanboliang @sun-rui @shivaram

The contribution is my original work and I license the work to the project under the project’s open source license

@shivaram
Copy link
Contributor

Jenkins, ok to test

@shivaram
Copy link
Contributor

@neilalex Can you add the code snippet in the PR description as a new test case ? That way we will ensure this behavior is tested going forward

@neilalex
Copy link
Contributor Author

sure

@SparkQA
Copy link

SparkQA commented Jan 22, 2018

Test build #86494 has finished for PR 20352 at commit f8ae698.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@neilalex
Copy link
Contributor Author

@shivaram alright, should be good with the tests now -- let me know how it seems

@SparkQA
Copy link

SparkQA commented Jan 23, 2018

Test build #86500 has finished for PR 20352 at commit 01fc9e1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@shivaram
Copy link
Contributor

Thanks @neilalex - Change LGTM. Lets also see if @felixcheung has any comments.

@felixcheung
Copy link
Member

felixcheung commented Jan 23, 2018 via email

asfgit pushed a commit that referenced this pull request Jan 24, 2018
…SparkR DataFrame

## What changes were proposed in this pull request?

A fix to https://issues.apache.org/jira/browse/SPARK-21727, "Operating on an ArrayType in a SparkR DataFrame throws error"

## How was this patch tested?

- Ran tests at R\pkg\tests\run-all.R (see below attached results)
- Tested the following lines in SparkR, which now seem to execute without error:

```
indices <- 1:4
myDf <- data.frame(indices)
myDf$data <- list(rep(0, 20))
mySparkDf <- as.DataFrame(myDf)
collect(mySparkDf)
```

[2018-01-22 SPARK-21727 Test Results.txt](https://github.com/apache/spark/files/1653535/2018-01-22.SPARK-21727.Test.Results.txt)

felixcheung yanboliang sun-rui shivaram

_The contribution is my original work and I license the work to the project under the project’s open source license_

Author: neilalex <neil@neilalex.com>

Closes #20352 from neilalex/neilalex-sparkr-arraytype.

(cherry picked from commit f54b65c)
Signed-off-by: Felix Cheung <felixcheung@apache.org>
@asfgit asfgit closed this in f54b65c Jan 24, 2018
@felixcheung
Copy link
Member

merged to master/2.3. we could revisit migration guide if necessary.
thanks!

@neilalex
Copy link
Contributor Author

@felixcheung @yanboliang @shivaram thank you for your guidance!

@neilalex neilalex deleted the neilalex-sparkr-arraytype branch January 24, 2018 14:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants