-
Notifications
You must be signed in to change notification settings - Fork 325
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate Mars DataFrame with Ray MLDataset #2294
Conversation
Initially implemented mldataset api commit before deep dive Initially tested mars with xgboost Code refactoring Initially implemented ray MLDataset integration
Is |
Code refactoring and bug fixes
611a2ca
to
f7f6d4e
Compare
f7f6d4e
to
cc60a01
Compare
4fdd0d8
to
b463133
Compare
b5ede98
to
ca125fe
Compare
ca125fe
to
9cb3faf
Compare
5424542
to
63ba725
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall, left some comments.
db74157
to
71e9ac6
Compare
71e9ac6
to
5fd5a80
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rename all filters
as field
as filters
imply query conditions.
35d2720
to
7556758
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
What do these changes do?
In order that Mars can support converting Mars DataFrame to Ray MLDataset, the following changes were made.
RayMLDataset
to take over the entire process where Ray MLDataset is created from ParallelIterator.fetch
-like universal procedure namefetch_infos
so thatray.ObjectRef
is now reachable.Furthermore,
xgboost_ray
is initially tested working with Mars DataFrame. Integration with Ray Dataset will come in a future PR.Related issue number
Fixes #2230