[air] Add batch predictor class #23808

krfricke · 2022-04-08T23:33:28Z

Why are these changes needed?

What: This class adds a generic BatchPredictor class that offers an interface to run batch inference on Ray datasets. It takes a Predictor class and checkpoint as an input, and provides a predict(dataset) method to run scalable scoring inference.

Why: Currently users have to implement scorers themselves. This is mostly boilerplate and prone to errors, so we should provide a simple solution instead.

Note that this predictor also implements the Predictor interface.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Yard1 · 2022-04-09T00:04:54Z

I don't really like the score nomenclature here. Scoring implies that you will receive a score - a single value. Can we do BatchPredictor, or something similar?

clarkzinzow · 2022-04-08T23:48:19Z

python/ray/ml/scorer.py

+            batch_size: Split dataset into batches of this size for prediction.
+            max_scoring_actors: If set, specify the maximum number of scoring actors.
+            ray_remote_args: Additional resource requirements to request from
+                ray (e.g., num_gpus=1 to request GPUs for the map tasks).


I'm wondering if num_gpus should be a top-level arg, since GPU-based batch inference is pretty fundamental. 🤔

Agreed, this should be easily accessible. I've added it to the top level API

clarkzinzow · 2022-04-09T01:36:00Z

python/ray/ml/tests/test_predictor.py

+    scorer = BatchScorer(DummyPredictor, Checkpoint.from_dict({"factor": 2.0}))
+
+    test_dataset = ray.data.from_items([1.0, 2.0, 3.0, 4.0])
+    assert scorer.score(test_dataset).to_pandas().to_numpy().squeeze().tolist() == [


If looking for a more concise conversion + comparison, content should be assertable without this chain, something like:

assert scorer.score(test_dataset).take() == [2.0, 4.0, 6.0, 8.0]

This leads to

E AssertionError: assert [{'value': 8.0}, {'value': 6.0}, {'value': 4.0}, {'value': 2.0}] == [2.0, 4.0, 6.0, 8.0]

how do I get them as a series and not dicts?

Also, the order problem seems to remain, any insights on that?

krfricke · 2022-04-11T08:36:40Z

Regarding the naming, I don't think that "scoring" implies it's only 1 measure, but yeah, it's not ideal because we're not actually scoring a dataset, but perform inference on it.

I wouldn't want to go with BatchPredictor or anything *Predictor, as we do have a top-level Predictor interface and this wrapper does not implement it.

Maybe BatchInference?

Yard1 · 2022-04-11T16:46:44Z

BatchInference is better, yeah.

krfricke · 2022-04-11T16:59:38Z

Actually, we can go with BatchPredictor if we implement the Predictor interface - which works quite well here tbh. I've changed this in the last commit, let me know what you think

Yard1 · 2022-04-11T17:03:49Z

BatchPredictor is even better :)

python/ray/ml/predictor.py

python/ray/ml/tests/test_predictor.py

python/ray/ml/predictor.py

ericl · 2022-04-12T22:29:59Z

doc/source/ray-air/getting-started.rst

@@ -89,6 +89,9 @@ Predictors
 .. autoclass:: ray.ml.predictor.Predictor
    :members:

+.. autoclass:: ray.ml.batch_predictor.BatchPredictor


[air/wip] Add batch scorer class

d728efd

krfricke assigned matthewdeng and clarkzinzow Apr 8, 2022

krfricke requested review from matthewdeng and clarkzinzow April 8, 2022 23:34

typo

5a5fbe7

Merge remote-tracking branch 'upstream/master' into air/scorer

c512c6f

clarkzinzow reviewed Apr 9, 2022

View reviewed changes

Kai Fricke added 2 commits April 11, 2022 09:30

Merge remote-tracking branch 'upstream/master' into air/scorer

fce740d

Add num_gpus as top level arg

8e1f7f0

BatchScorer --> BatchPredictor

09fc165

krfricke marked this pull request as ready for review April 11, 2022 16:59

krfricke requested review from ericl and richardliaw April 11, 2022 16:59

krfricke assigned ericl and richardliaw Apr 11, 2022

krfricke changed the title ~~[air/wip] Add batch scorer class~~ [air/wip] Add batch predictor class Apr 11, 2022

Yard1 reviewed Apr 11, 2022

View reviewed changes

python/ray/ml/predictor.py Outdated Show resolved Hide resolved

Kai Fricke added 2 commits April 11, 2022 18:14

num_cpus

17d2d1e

Batch prediction test

286b5dc

krfricke commented Apr 11, 2022

View reviewed changes

python/ray/ml/tests/test_predictor.py Outdated Show resolved Hide resolved

ericl reviewed Apr 11, 2022

View reviewed changes

python/ray/ml/predictor.py Outdated Show resolved Hide resolved

ericl reviewed Apr 11, 2022

View reviewed changes

python/ray/ml/predictor.py Outdated Show resolved Hide resolved

ericl reviewed Apr 11, 2022

View reviewed changes

python/ray/ml/predictor.py Outdated Show resolved Hide resolved

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 11, 2022

Kai Fricke added 2 commits April 12, 2022 08:43

Merge remote-tracking branch 'upstream/master' into air/scorer

eb4a075

Move into separate file

3b7e64e

krfricke removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 12, 2022

Fix docs

a44c4f5

ericl approved these changes Apr 12, 2022

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 12, 2022

ericl reviewed Apr 12, 2022

View reviewed changes

krfricke merged commit 40d3a62 into ray-project:master Apr 13, 2022

krfricke deleted the air/scorer branch April 13, 2022 07:58

amogkam changed the title ~~[air/wip] Add batch predictor class~~ [air] Add batch predictor class Apr 20, 2022

amogkam mentioned this pull request Apr 20, 2022

[air] LightGBM Trainer should use num_cpus_per_actor=2 by default #23449

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[air] Add batch predictor class #23808

[air] Add batch predictor class #23808

krfricke commented Apr 8, 2022 •

edited

Yard1 commented Apr 9, 2022

clarkzinzow Apr 8, 2022

krfricke Apr 11, 2022

clarkzinzow Apr 9, 2022

krfricke Apr 11, 2022

krfricke commented Apr 11, 2022

Yard1 commented Apr 11, 2022

krfricke commented Apr 11, 2022

Yard1 commented Apr 11, 2022

ericl Apr 12, 2022

[air] Add batch predictor class #23808

[air] Add batch predictor class #23808

Conversation

krfricke commented Apr 8, 2022 • edited

Why are these changes needed?

Related issue number

Checks

Yard1 commented Apr 9, 2022

clarkzinzow Apr 8, 2022

Choose a reason for hiding this comment

krfricke Apr 11, 2022

Choose a reason for hiding this comment

clarkzinzow Apr 9, 2022

Choose a reason for hiding this comment

krfricke Apr 11, 2022

Choose a reason for hiding this comment

krfricke commented Apr 11, 2022

Yard1 commented Apr 11, 2022

krfricke commented Apr 11, 2022

Yard1 commented Apr 11, 2022

ericl Apr 12, 2022

Choose a reason for hiding this comment

krfricke commented Apr 8, 2022 •

edited