New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-23120][PYSPARK][ML] Add basic PMML export support to PySpark #21172
[SPARK-23120][PYSPARK][ML] Add basic PMML export support to PySpark #21172
Conversation
…nearRegressionModel as the first implementation
Test build #89899 has finished for PR 21172 at commit
|
Test build #89900 has finished for PR 21172 at commit
|
Fix sc ref
9b14a1c
to
64d6db8
Compare
Test build #89902 has finished for PR 21172 at commit
|
… have useful error messages
Test build #90234 has finished for PR 21172 at commit
|
Test build #90236 has finished for PR 21172 at commit
|
…been checking for PMML string
Test build #90515 has finished for PR 21172 at commit
|
Merged to master |
Here's a pointer to another PySpark-to-PMML conversion tool: https://github.com/jpmml/pyspark2pmml |
What changes were proposed in this pull request?
Adds basic PMML export support for Spark ML stages to PySpark as was previously done in Scala. Includes LinearRegressionModel as the first stage to implement.
How was this patch tested?
Doctest, the main testing work for this is on the Scala side. (TODO holden add the unittest once I finish locally).