[REP] Refining the Ray AIR Surface API #36

pcmoritz · 2023-07-09T21:24:51Z

This REP proposes to remove the ray.air namespace and put the functionality into the respective libraries Ray Data, Ray Serve and Ray Train.

reps/2023-07-08-air-surface-syntax.md

ericl · 2023-07-09T21:32:39Z

reps/2023-07-08-air-surface-syntax.md

+
+## Open Questions
+
+We are likely going to remove `PredictorWrapper` and `PredictorDeployment` and migrate the examples to use Ray Serve deployments


Could we flesh out an example of this migration in the REP? It may make it more clear the implications of this change.

Yes I will do that, great point.

krfricke

I'm fine with this proposal and only have detail questions left.

Just for completeness (as it's not mentioned in the REP). An alternative to resolving this is by moving everything into AIR instead. Thus we would have ray.air.training, ray.air.tuning, etc. In such a world we would soften the boundaries between the libraries and keep shared modules in the AIR namespace. By moving the libraries "one level down" we would enforce a separation from Ray Core and double down on AIR as an umbrella for our downstream libraries.

It also avoids the question of where to put which modules. Integrations and callbacks can quite naturally remain in ray.air, and both training and tuning modules can access them.

reps/2023-07-08-air-surface-syntax.md

krfricke · 2023-07-10T23:45:32Z

reps/2023-07-08-air-surface-syntax.md

+## Open Questions
+
+We are likely going to remove `PredictorWrapper` and `PredictorDeployment` and migrate the examples to use Ray Serve deployments
+direcly, and we are also likely going to move `air.integrations` to `train.integrations` and tentatively the predictors to `ray.train`.


There's a few more:

air.callbacks --> also train?

air.examples for cross-library examples --> docs? (related question: how do we restructure the docs? I guess out of scope for this REP...)

air.execution --> tentatively train?

air.util - looks like mostly data?

Thanks for bringing these up :)

air.callbacks has already been moved to air.integrations, which will move to train.integrations, right?

air.examples these should not be living in the source tree, so yes, docs would probably be the right place. I don't consider air.examples to be part of the AIR API -- on the docs: @richardliaw is owning the docs restructuring :)

air.execution these are internal APIs, so are not a concern for this REP. We should feel free to put them into some utility namespace of either train or tune, depending what makes the most sense going forward.

air.util the tensor extension stuff should go into ray.data, the torch stuff into ray.train.torch (I'm a bit confused on whether this is a public API or not -- it seems to be used in the OPT example at the moment? -- we should clarify that and handle it appropriately. Is there anything else here we need to decide about?

Would it make sense to capture all these in the REP to make it more comprehensive, i.e. to describe what it takes to fully "Disband the ray.air namespace"?

pcmoritz · 2023-07-11T01:07:09Z

On your higher level comment: Not only would introducing ray.air.training, ray.air.tuning (and we would also need ray.air.serving) be a far higher amount of API changes, but it would also lead us to a worse outcome -- everything actually fits neatly into one of the libraries and there is really no need for a ray.air namespace. The libraries ray.train, ray.tune, ray.serve, ray.data and ray.rllib are already well separated from ray core, so I don't think there is danger of confusion here :)

reps/2023-07-08-air-surface-syntax.md

matthewdeng · 2023-07-12T03:34:48Z

reps/2023-07-08-air-surface-syntax.md

+## Open Questions
+
+We are likely going to remove `PredictorWrapper` and `PredictorDeployment` and migrate the examples to use Ray Serve deployments
+direcly, and we are also likely going to move `air.integrations` to `train.integrations` and tentatively the predictors to `ray.train`.


Would it make sense to capture all these in the REP to make it more comprehensive, i.e. to describe what it takes to fully "Disband the ray.air namespace"?

amogkam · 2023-07-12T04:46:38Z

The only concern is if we are ok with this type of usage for Ray Tune only use case?

from ray import tune
from ray import train

def objective(config):
    score = config["a"] ** 2 + config["b"]
    train.report({"score": score})

search_space = {
    "a": tune.grid_search([0.001, 0.01, 0.1, 1.0]),
    "b": tune.choice([1, 2, 3]),
}

tuner = tune.Tuner(objective, param_space=search_space)

results = tuner.fit()
print(results.get_best_result(metric="score", mode="min").config)

This exposes the ray.train namespace to Tune only users...which may lend to confusion (why do I need to import Ray Train if I am just using Ray Tune?) This was one of the motivations for consolidating under ray.air, so just want to make sure we are considering this case.

ericl · 2023-07-18T00:24:48Z

reps/2023-07-08-air-surface-syntax.md

+@serve.deployment
+class XGBoostService:
+    def __init__(self, checkpoint):
+        self.predictor = XGBoostPredictor.from_checkpoint(checkpoint)


Suggested change

self.predictor = XGBoostPredictor.from_checkpoint(checkpoint)

local_dir = checkpoint.to_directory()

self.model: xgboost.Booster = pickle.loads(local_dir + "/saved_booster.pkl")

``

Since we are de-emphasizing predictors, shall we also change this to show something like this instead?

I fixed this now by adopting the new way from https://docs.google.com/document/d/1J-09US8cXc-tpl2A1BpOrlHLTEDMdIJp6Ah1ifBUw7Y/edit :)

krfricke

Thanks for the updates

This is bringing the API up-to-date with ray-project/enhancements#36

pcmoritz · 2023-07-31T19:00:22Z

@amogkam We decided that it is ok for Tune to depend on Train, since tuning training runs is going to be the most important use case of Ray Tune and also that's how most machine learning engineers / practitioners think about the relation between training and tuning :)

matthewdeng

Looks great and feels a lot cleaner!

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

Co-authored-by: Eric Liang <ekhliang@gmail.com> Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

ericl · 2023-08-04T23:48:18Z

@zhe-thoughts this can be merged

zhe-thoughts · 2023-08-07T14:05:38Z

Thanks for the work @pcmoritz @ericl @krfricke @matthewdeng @amogkam

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: NripeshN <nn2012@hw.ac.uk>

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: harborn <gangsheng.wu@intel.com>

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: Victor <vctr.y.m@example.com>

pcmoritz assigned ericl Jul 9, 2023

pcmoritz requested review from matthewdeng and krfricke July 9, 2023 21:27

ericl reviewed Jul 9, 2023

View reviewed changes

krfricke reviewed Jul 10, 2023

View reviewed changes

pcmoritz mentioned this pull request Jul 11, 2023

[data][doc] Add DatasetConfig -> DataConfig migration guide ray-project/ray#37278

Merged

8 tasks

matthewdeng reviewed Jul 12, 2023

View reviewed changes

ericl reviewed Jul 18, 2023

View reviewed changes

ericl approved these changes Jul 18, 2023

View reviewed changes

krfricke approved these changes Jul 25, 2023

View reviewed changes

pcmoritz mentioned this pull request Jul 28, 2023

[Train] Make get_checkpoint top level function in ray.train ray-project/ray#37906

Merged

8 tasks

pcmoritz added a commit to ray-project/ray that referenced this pull request Jul 29, 2023

[Train] Make get_checkpoint top level function in ray.train (#37906)

f8a4a79

This is bringing the API up-to-date with ray-project/enhancements#36

matthewdeng approved these changes Jul 31, 2023

View reviewed changes

pcmoritz and others added 12 commits July 31, 2023 16:47

[REP] Refining the Ray AIR Surface API

e0ca7ec

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

6d3e2ed

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

Update reps/2023-07-08-air-surface-syntax.md

59fbd4a

Co-authored-by: Eric Liang <ekhliang@gmail.com> Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

577708b

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

479932c

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

d29874b

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

d6bf43f

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

d2ac815

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

eac7d22

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

0aebe99

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

d18ff84

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

update

9f6c6c0

Signed-off-by: Philipp Moritz <pcmoritz@gmail.com>

pcmoritz force-pushed the ray-air-surface-syntax branch from f2cd7d1 to 9f6c6c0 Compare July 31, 2023 23:47

pcmoritz mentioned this pull request Aug 1, 2023

[air] pyarrow.fs persistence (3/n): Introduce new Checkpoint API ray-project/ray#37925

Merged

8 tasks

ericl added pending-committer-vote vote-approved labels Aug 4, 2023

zhe-thoughts merged commit abb378e into main Aug 7, 2023
1 check passed

NripeshN pushed a commit to NripeshN/ray that referenced this pull request Aug 15, 2023

[Train] Make get_checkpoint top level function in ray.train (ray-proj…

ffc62a3

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: NripeshN <nn2012@hw.ac.uk>

harborn pushed a commit to harborn/ray that referenced this pull request Aug 17, 2023

[Train] Make get_checkpoint top level function in ray.train (ray-proj…

14cc134

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36

vymao pushed a commit to vymao/ray that referenced this pull request Oct 11, 2023

[Train] Make get_checkpoint top level function in ray.train (ray-proj…

3007443

…ect#37906) This is bringing the API up-to-date with ray-project/enhancements#36 Signed-off-by: Victor <vctr.y.m@example.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REP] Refining the Ray AIR Surface API #36

[REP] Refining the Ray AIR Surface API #36

pcmoritz commented Jul 9, 2023

ericl Jul 9, 2023

pcmoritz Jul 9, 2023

pcmoritz Jul 15, 2023

krfricke left a comment

krfricke Jul 10, 2023

pcmoritz Jul 11, 2023

matthewdeng Jul 12, 2023

pcmoritz commented Jul 11, 2023

matthewdeng Jul 12, 2023

amogkam commented Jul 12, 2023 •

edited

Loading

ericl Jul 18, 2023

pcmoritz Jul 31, 2023

krfricke left a comment

pcmoritz commented Jul 31, 2023

matthewdeng left a comment

ericl commented Aug 4, 2023

zhe-thoughts commented Aug 7, 2023


		## Open Questions

		We are likely going to remove `PredictorWrapper` and `PredictorDeployment` and migrate the examples to use Ray Serve deployments

-        self.predictor = XGBoostPredictor.from_checkpoint(checkpoint)
+        local_dir = checkpoint.to_directory()
+        self.model: xgboost.Booster = pickle.loads(local_dir + "/saved_booster.pkl")
+``
+Since we are de-emphasizing predictors, shall we also change this to show something like this instead?

[REP] Refining the Ray AIR Surface API #36

[REP] Refining the Ray AIR Surface API #36

Conversation

pcmoritz commented Jul 9, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krfricke left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pcmoritz commented Jul 11, 2023

Choose a reason for hiding this comment

amogkam commented Jul 12, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krfricke left a comment

Choose a reason for hiding this comment

pcmoritz commented Jul 31, 2023

matthewdeng left a comment

Choose a reason for hiding this comment

ericl commented Aug 4, 2023

zhe-thoughts commented Aug 7, 2023

amogkam commented Jul 12, 2023 •

edited

Loading