DOC, TST: Wrapping of PyTorch models #699

stsievert · 2020-07-15T04:46:34Z

What does this PR implement?
It provides wrappers for Keras/PyTorch models, primarily aimed at model selection. This PR wraps PyTorch/Keras by relying on SciKeras/Skorch.

Of course, this is a very thin wrapper. I think it's warranted for the following reasons:

PyTorch and Keras/Tensorflow are very popular. No other deep learning library comes close (i.e, Chainer/MXNet), and both PyTorch and Keras are about 15x more popular than Scikit-learn on Google Trends.
Skorch is not suited for model selection (too much printing, has validation split by default, too many epochs for partial_fit).

References issues/PRs

Provide wrappers for popular ML libraries #696: "Provide a wrapper for popular ML libraries"
ENH: implement partial_fit for SciKeras models adriangb/scikeras#17: "ENH: Implement partial_fit for SciKeras models"
Default parameters of build_fn are not returned by get_params adriangb/scikeras#18: "Default parameters of build_fn are not returned by get_params."

This PR will be a WIP until adriangb/scikeras#19 is resolved.

edit Now, the Dask-ML documentation shows the following in the sidebar:

A "Keras" bullet will be added when #713 is merged.

TomAugspurger

Can you expand a bit more on the value added by our wrappers? In particular, to what extent can the changes here be pushed upstream? For example

Skorch is not suited for model selection (too much printing, has validation split by default, too many epochs for partial_fit).

Could the Skorch defaults be changed? Could we document appropriate defaults to use instead?

Regardless of how we handle the wrappers, I think there is value in ensuring that our model selection algorithms work with these estimators. So having the tests, and ideally examples at dask-examples will be valuable.

TomAugspurger · 2020-07-15T11:58:12Z

ci/environment-3.8.yaml

@@ -30,3 +30,7 @@ dependencies:
  - pip
  - pip:
    - pytest-azurepipelines
+    - tensorflow


Can you keep as much as possible in the conda section? Especially for things like tensorflow & torch.

Also... I don't feel great about including these huge dependencies just for a small subset of the library. I like how dask-gateway does things https://github.com/dask/dask-gateway/blob/master/.travis.yml#L37. That's using Travis. Can you see if something similar is possible for azure-pipelines?

Would it be okay to only run the PyTorch/Keras tests on the master branch? I'm not sure I can get a commit message trigger working, but a branch trigger looks straightforward.

Yeah I think that's fine.

Will probably need to remove these here, and add a secondary conda / pip install that only runs on certain jobs.

Thanks. That's in posix.yaml. This PR isn't ready for merge; SciKeras should have a new release soon that incorporates the recent changes.

Azure pipelines are installing Scikeras from master right now

dask_ml/wrappers.py

mrocklin · 2020-07-15T13:47:26Z

What would be necessary to support PyTorch/Keras natively? Could we provide a function that called fit/partial_fit on Scikit-Learn estimators and something else on torch/keras models?

TomAugspurger · 2020-07-15T14:20:21Z

I'm not sure, but I don't think that'll suffice. It would also need things like an sklearn-compatible `.score`, `get_params` & `set_params` to support the search. And at this point it feels like we'd be implementing (a poorer version of) scikeras / skorch. But again, that's just a guess. Maybe things are easier than I expect.

…

On Wed, Jul 15, 2020 at 8:47 AM Matthew Rocklin ***@***.***> wrote: What would be necessary to support PyTorch/Keras natively? Could we provide a function that called fit/partial_fit on Scikit-Learn estimators and something else on torch/keras models? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#699 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAKAOIUMXGRA7XSIT2IJG6TR3WXP5ANCNFSM4O2E23NA> .

mrocklin · 2020-07-15T14:30:55Z

Ah, fair point On Wed, Jul 15, 2020 at 7:20 AM Tom Augspurger <notifications@github.com> wrote:

…

I'm not sure, but I don't think that'll suffice. It would also need things like an sklearn-compatible `.score`, `get_params` & `set_params` to support the search. And at this point it feels like we'd be implementing (a poorer version of) scikeras / skorch. But again, that's just a guess. Maybe things are easier than I expect. On Wed, Jul 15, 2020 at 8:47 AM Matthew Rocklin ***@***.***> wrote: > What would be necessary to support PyTorch/Keras natively? Could we > provide a function that called fit/partial_fit on Scikit-Learn estimators > and something else on torch/keras models? > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#699 (comment)>, or > unsubscribe > < https://github.com/notifications/unsubscribe-auth/AAKAOIUMXGRA7XSIT2IJG6TR3WXP5ANCNFSM4O2E23NA > > . > — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#699 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACKZTF4FI5JOC45VO34CC3R3W3LNANCNFSM4O2E23NA> .

stsievert · 2020-07-15T16:26:10Z

Can you expand a bit more on the value added by our wrappers? In particular, to what extent can the changes here be pushed upstream? ... having the tests, and ideally examples at dask-examples will be valuable

That's part of why I opened this PR. I think having tests, an example at dask-examples and another Dask-ML documentation page would suffice to meet my goals. I aim for these implementations to be easy to discover and work well with model selection. I think those needs are met without adding an implementation in dask_ml.wrappers.

I'll rework this PR to be focused on documentation/testing, and open a PR in dask-examples.

tests/model_selection/test_wrappers.py

stsievert · 2020-07-16T15:29:38Z

Could we provide a function that called fit/partial_fit on Scikit-Learn estimators and something else on torch/keras models?

For model selection not really – @TomAugspurger is right. I suppose it would be possible to hack a solution to use arbitrary training functions using distributed variables:

from keras import fit
from distributed import Variable
from sklearn.base import BaseEstimator

def train_model(hidden=4, epochs=10, data: Variable = None, stop: Variable = None):
    model = create_model(hidden=hidden)
    X_train, y_train, X_test, y_test = ...
    for epoch in range(epochs):
        keras.fit(model, X_train, y_train, epochs=1, workers=4)
        score = model.score(X_test, y_test)
        datum = {"score": score, "pf_calls": epoch}
        d = data.get()
        data.set(d + [datum])
        if stop.get():
            break

class FunctionTrainer(BaseEstimator):
    def __init__(self, fn, **kwargs):
        vars(self).update(kwargs)
        self.fn = fn
        self._data = Variable("_data")
        self._stop = Variable("_stop")
        self._pf_calls = 0

    def _wait_for_training_to_complete(self) -> Dict[str, Any]:
        while True:
            data = self._data.get()
            pf_calls = {d["pf_calls"] for d in data}
            if self._pf_calls in pf_calls:
                break
            await asyncio.sleep(0.1)
        return data[-1]

    def _initialize(self):
        self.data.set([])
        self._stop.set(False)
        kwargs = self.get_params()
        client.submit(self.fn, kwargs, data=self._data, stop=self._stop)

    def _stop_training(self):
        self._stop.set(True)

    def partial_fit(self, X, y):
        datum = self._wait_for_training_to_complete()
        self._pf_calls += 1
        return self

    def score(self, X, y):
        datum = self._wait_for_training_to_complete()
        return datum["score"]

We could make this work with Dask-ML's model selection; it'd have to call FunctionTrainer._stop_training when it kills a model.

It's easier for training a single model. The PyTorch/Keras implementation would be similar to dask-glm: the model and optimization would be reside client side, and the Dask workers would be tasked with computing the gradient.

docs/source/keras.rst

TomAugspurger · 2020-07-16T18:29:31Z

ci/environment-3.8.yaml

@@ -30,3 +30,7 @@ dependencies:
  - pip
  - pip:
    - pytest-azurepipelines
+    - tensorflow


Will probably need to remove these here, and add a secondary conda / pip install that only runs on certain jobs.

TomAugspurger

Looking pretty good @stsievert. I think there's a -y missing in the conda install.

I'll also try this out locally a little later.

ci/posix.yaml

Co-authored-by: Tom Augspurger <TomAugspurger@users.noreply.github.com>

stsievert · 2020-07-26T01:29:17Z

I've put Keras in a separate PR, #713. I think it needs some more work, and I don't think it should block this PR. For more detail on the issues, see #713 (comment).

Now this PR focuses on documenting a PyTorch wrapper and reorganizing the documentation.

dask_ml/model_selection/_incremental.py

stsievert · 2020-07-27T12:00:23Z

Are we particularly tied to isort 4.3.21? I can not get the pytest.importorskip and the following torch imports to work with isort 4.3.21. It works under isort >= 5 because isort >=5.0.0 supports action comments (source) like "isort: skip" or "isort: split."

TomAugspurger · 2020-07-27T13:50:15Z

Are we particularly tied to isort 4.3.21?

No, but I don't think we'd want those anyway. 5b6e20c will hopefully work.

TomAugspurger

Seems that the tests are being skipped

https://dev.azure.com/dask-dev/dask/_build/results?buildId=1414&view=logs&j=73591ef5-68f9-550f-ee9a-e10f200815d9&t=379889d0-092a-5c6b-9e5b-85a9d12cf357&l=900

Based on https://dev.azure.com/dask-dev/dask/_build/results?buildId=1411&view=logs&j=73591ef5-68f9-550f-ee9a-e10f200815d9&t=379889d0-092a-5c6b-9e5b-85a9d12cf357&l=895, it looks like torch isn't being installed correct. Perhaps make sure that we're using the right pip?

ci/posix.yaml

TomAugspurger · 2020-07-27T14:29:49Z

Ah @stsievert I think that torch & skorch are being installed into the base conda environment. I think the tests are run in the dask-ml-test env.

we have all the deps already

stsievert · 2020-07-29T01:01:04Z

The Windows CI is failing on the model selection tests named test_small and test_warns_scores_per_fit. The traceback reports TimeoutError or CancelledError, and they pass on my non-Windows machine.

TomAugspurger

Great, thanks. I see that error on 25-50% of the CI runs.

I'll push a commit uncommenting the condition.

ci/posix.yaml

TomAugspurger · 2020-07-29T01:58:23Z

Thanks @stsievert!

stsievert added 2 commits July 14, 2020 23:09

ENH: Wrap PyTorch/Keras models

52f929d

MAINT: ci requirements

57a12d6

stsievert changed the title ~~ENH: Wrap PyTorch/Keras models~~ WIP: ENH: Wrap PyTorch/Keras models Jul 15, 2020

stsievert mentioned this pull request Jul 15, 2020

ENH: implement partial_fit for SciKeras models adriangb/scikeras#17

Merged

TomAugspurger reviewed Jul 15, 2020

View reviewed changes

stsievert added 4 commits July 15, 2020 11:44

Remove wrapper impl

b766c60

delete unused note for now

69a7886

Add doc framework for integration

1f54dad

Combine XGBoost and LightGBM

36209a2

stsievert changed the title ~~WIP: ENH: Wrap PyTorch/Keras models~~ WIP: DOC, TST: Wrapping of PyTorch/Keras models Jul 15, 2020

stsievert added 3 commits July 15, 2020 15:47

Skip if package not installed

0c27c66

Start to fill out docs

33449f7

Don't depend on dask-examples

ea6e95c

TomAugspurger reviewed Jul 15, 2020

View reviewed changes

tests/model_selection/test_wrappers.py Outdated Show resolved Hide resolved

stsievert added 6 commits July 15, 2020 16:37

Build on master

3dfd914

Add note to tests

c3aa74f

REVERT: run on this PR too

77a0e26

Update note

aa53c21

typo

102a5aa

isort

c071c9e

stsievert changed the title ~~WIP: DOC, TST: Wrapping of PyTorch/Keras models~~ DOC, TST: Wrapping of PyTorch/Keras models Jul 15, 2020

Temporarily install from source

2786099

remove print statement; resolve warning

419ef97

TomAugspurger reviewed Jul 16, 2020

View reviewed changes

MAINT: allow models to be scattered

12ec08b

stsievert added 2 commits July 21, 2020 18:44

Merge branch 'master' into ms-model-docs

f9e50a6

Pass check_scoring to submit

19ec22e

TomAugspurger reviewed Jul 24, 2020

View reviewed changes

ci/posix.yaml Outdated Show resolved Hide resolved

ci/posix.yaml Outdated Show resolved Hide resolved

stsievert and others added 3 commits July 24, 2020 14:47

Update ci/posix.yaml

645353f

Co-authored-by: Tom Augspurger <TomAugspurger@users.noreply.github.com>

remove keras, give joblib edits

4d30692

Remove extra installs

c31c600

stsievert changed the title ~~DOC, TST: Wrapping of PyTorch/Keras models~~ DOC, TST: Wrapping of PyTorch models Jul 26, 2020

stsievert mentioned this pull request Jul 26, 2020

DOC, TST: Wrapping of Keras models #713

Merged

stsievert added 4 commits July 26, 2020 17:25

lint

4dc1d0a

skip isort for pytest importskip

bfaf65a

isort

b944890

isort skip

96489f5

mrocklin reviewed Jul 27, 2020

View reviewed changes

dask_ml/model_selection/_incremental.py Outdated Show resolved Hide resolved

stsievert added 2 commits July 26, 2020 21:35

clean

99f2fd0

isort

580d77e

lint

5b6e20c

TomAugspurger reviewed Jul 27, 2020

View reviewed changes

ci/posix.yaml Outdated Show resolved Hide resolved

ci/posix.yaml Show resolved Hide resolved

stsievert added 4 commits July 28, 2020 11:47

try install deps in right env

9478728

no cuda on ci

79b8891

remove --no-deps for skorch

ea2b4f5

we have all the deps already

quiet

262815d

TomAugspurger reviewed Jul 29, 2020

View reviewed changes

ci/posix.yaml Outdated Show resolved Hide resolved

Update ci/posix.yaml

58b105c

TomAugspurger merged commit 5c3179e into dask:master Jul 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC, TST: Wrapping of PyTorch models #699

DOC, TST: Wrapping of PyTorch models #699

stsievert commented Jul 15, 2020 •

edited

Loading

TomAugspurger left a comment •

edited

Loading

TomAugspurger Jul 15, 2020

stsievert Jul 15, 2020 •

edited

Loading

TomAugspurger Jul 15, 2020

TomAugspurger Jul 16, 2020

stsievert Jul 16, 2020

stsievert Jul 16, 2020

mrocklin commented Jul 15, 2020

TomAugspurger commented Jul 15, 2020 via email

mrocklin commented Jul 15, 2020 via email

stsievert commented Jul 15, 2020

stsievert commented Jul 16, 2020

TomAugspurger Jul 16, 2020

TomAugspurger left a comment

stsievert commented Jul 26, 2020

stsievert commented Jul 27, 2020

TomAugspurger commented Jul 27, 2020

TomAugspurger left a comment

TomAugspurger commented Jul 27, 2020

stsievert commented Jul 29, 2020

TomAugspurger left a comment

TomAugspurger commented Jul 29, 2020

DOC, TST: Wrapping of PyTorch models #699

DOC, TST: Wrapping of PyTorch models #699

Conversation

stsievert commented Jul 15, 2020 • edited Loading

TomAugspurger left a comment • edited Loading

Choose a reason for hiding this comment

TomAugspurger Jul 15, 2020

Choose a reason for hiding this comment

stsievert Jul 15, 2020 • edited Loading

Choose a reason for hiding this comment

TomAugspurger Jul 15, 2020

Choose a reason for hiding this comment

TomAugspurger Jul 16, 2020

Choose a reason for hiding this comment

stsievert Jul 16, 2020

Choose a reason for hiding this comment

stsievert Jul 16, 2020

Choose a reason for hiding this comment

mrocklin commented Jul 15, 2020

TomAugspurger commented Jul 15, 2020 via email

mrocklin commented Jul 15, 2020 via email

stsievert commented Jul 15, 2020

stsievert commented Jul 16, 2020

TomAugspurger Jul 16, 2020

Choose a reason for hiding this comment

TomAugspurger left a comment

Choose a reason for hiding this comment

stsievert commented Jul 26, 2020

stsievert commented Jul 27, 2020

TomAugspurger commented Jul 27, 2020

TomAugspurger left a comment

Choose a reason for hiding this comment

TomAugspurger commented Jul 27, 2020

stsievert commented Jul 29, 2020

TomAugspurger left a comment

Choose a reason for hiding this comment

TomAugspurger commented Jul 29, 2020

stsievert commented Jul 15, 2020 •

edited

Loading

TomAugspurger left a comment •

edited

Loading

stsievert Jul 15, 2020 •

edited

Loading