-
Notifications
You must be signed in to change notification settings - Fork 28.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-12645] [SparkR] SparkR support hash function #10597
Conversation
Test build #48757 has finished for PR 10597 at commit
|
cc @sun-rui |
c41eb1f
to
995fd06
Compare
Test build #48820 has finished for PR 10597 at commit
|
LGTM |
looks good. no conflict with base/stats |
LGTM. Thanks @yanboliang - Merging this to master and |
Add ```hash``` function for SparkR ```DataFrame```. Author: Yanbo Liang <ybliang8@gmail.com> Closes #10597 from yanboliang/spark-12645. (cherry picked from commit 3d77cff) Signed-off-by: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
It breaks hadoop 1 test (https://amplab.cs.berkeley.edu/jenkins/job/spark-branch-1.6-test-sbt-hadoop-1.0/30/console). Can you take a look? |
Hmm is |
looks like 2.2 tests are broken as well. How about we revert it from 1.6 for now? If it is good, can you do the revert? Thanks! |
sure - sounds good to me. Can you open a JIRA and cc me and @yanboliang on it ? |
let's just reuse the original jira. I will reopen it. |
Actually, I am not sure if we should merge this to branch 1.6 since it is a new feature. |
The SparkR module is still considered alpha, so we do include new features in minor updates when it is appropriate. In this case it was a small new function, so it seemed fine to me. |
it looks like
so shouldn't be in 1.6.0 |
@shivaram I will revert it from 1.6 branch. |
reverted from 1.6 |
@yanboliang Could you test why this doesn't with branch 1.6 ? |
@shivaram in my comment above, hash was added only in 2.0.0. |
@shivaram Just like @felixcheung commented, the |
Ah I didn't notice @felixcheung earlier comment. Thanks for clarifying. I guess there is nothing to do here as the JIRA rightly says this feature is fixed in 2.0.0 |
Add
hash
function for SparkRDataFrame
.