New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-7000] [ml] Refactor prediction and tree abstractions to be under ml.prediction subpackage #5585
Conversation
Test build #30574 has finished for PR 5585 at commit
|
@jkbradley I don't quite understand the purpose of this PR, e.g., why |
It's mainly to reduce clutter in the spark.ml namespace. We'll get more and more items shared between classification and regression:
Once the prediction Dev APIs are made public (Predictor, etc.), then we'll have a spark.ml.prediction subpackage anyways. At that point, tree and ensemble abstractions seem like they would belong in that subpackage, rather than in the spark.ml namespace. I'm OK if you prefer to keep these items in the .ml namespace, but if you're ambivalent, then I'd prefer fewer subpackages under spark.ml |
Oh, I misread one thing you wrote: |
One more thought: Later on, I could imagine us having other types of trees, such as for hierarchical clustering. Those would live under the |
|
Ok, so you'd vote for having separate subpackages for each type of classification/prediction abstraction?
|
What do we want to put under |
I'm not sure what else would go under |
Closing this pending discussions |
I'm copying the 2 small edits to [https://github.com//pull/5567] |
From JIRA:
I refactored using IntelliJ.
The only additional changes I made were:
CC: @mengxr