Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SW-631] Ensure that we do not access RDDs in pipelines ( to unblock … #511

Merged
merged 3 commits into from
Dec 27, 2017

Conversation

jakubhava
Copy link
Contributor

@jakubhava jakubhava commented Dec 21, 2017

…the deployment)

This changes follows the code from H2O world you implemented to by-pass using RDDs as part of the pipelines. I have also written test for it where we load pipeline model trained in Pysparkling, run predictions on it (mojos) and then we train pipeline model right away in scala, do predictions and compare the outputs.

@jakubhava jakubhava merged commit 8d6d8f6 into master Dec 27, 2017
@jakubhava jakubhava deleted the jh/jira/sw-631 branch December 27, 2017 14:14
jakubhava added a commit that referenced this pull request Jan 3, 2018
jakubhava added a commit that referenced this pull request Jan 3, 2018
jakubhava added a commit that referenced this pull request Jan 3, 2018
jakubhava added a commit that referenced this pull request Jan 4, 2018
…unblock … (#511)"

This reverts commit 0b7d0de.

On Spark 2.0 this works, however Spark is giving wrong results for
predictions. This is tracked under [SW-663]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant