Develop #127

alattner · 2020-05-07T12:19:38Z

No description provided.

Fix Dask hyperparam search example to work with latest sklearn&dask

Update example on how to use Dask to do grid search

An attempt to fix an issue on Travis where rpy2 is installed with pip (in addition to conda) and appears to be buggy.

Remove julia and rpy2 from docs extra requirements

The idea is that we sometimes want to attach files to models, such as HTML reports or the like, and in the backend, these files should be stored separately, to allow easy access. Here we're implemeting this idea for the 'FileLike' model persister and testing it for the 'File' subclass. This should work for 'Rest' and 'S3' as well, but I thought it's best to add tests when we all agreed on the idea. Usage is demonstrated in 'TestFileAttachments'. The contract is as follows: Use 'palladium.util.annotate' to add an arbitrary number of attachments to the model, like so: ``` annotate(model1, {'attachments/myatt.txt': 'aGV5', 'attachments/my2ndatt.txt': 'aG8='}) ``` Note that the keys of such attachments must start with 'attachments/', with the rest indicating a filename. The values must be base64 encoded but converted from bytes to strings. This is arguably a bit awkward, but we do this because the attachments dictionary must in general be JSON serializable, and using bytes would violate this. When 'model1' is persisted, 'FileLike' will create one file for each attachment and call them 'model-1-myatt.txt' and 'model-1-my2ndatt.txt'. The implementation chooses to use flat files rather than a folder to hold all attachments for a given model. This is done so that we do not need to add extra methods to 'FileLikeIO' (such as mkdir), which means we should get support for other 'FileLike' implementations such as 'Rest' and 'S3' for free. Moreover, the attachments will be removed from the model's pickle and from the metadata files, in order not to blow up the size of those. When the model is loaded back through the model persister, the attachments are loaded and put back into the model's metadata dictionary. What's a good time to add the attachments to the model? Use the 'write_model_decorators' pluggable decorator hook to add a decorator that adds your attachment just before it's persisted. A toy example: ``` def my_write_model_decorator(self, model): report = my_make_report(model) # assume returns an HTML string report_encoded = b64encode(report.encode('utf-8')).decode('ascii') annotate(model, {'attachments/report.html': report_encoded}) ``` Let me know what you think. Once we've settled on the right way to do this, we'll put this into proper docs and examples.

…k-as-factory In configuration, use exclamation mark '!' instead of '__factory__'

…ples Examples on how to use Keras and XGBoost with Palladium

Proposal implementation for handling model attachments

avoid loading stale metadata in S3 persister

As suggested by yv in #124

Used https://pypi.org/project/pur/ for update for requirements.

Feature/update dependencies

coveralls · 2020-05-07T12:24:49Z

Coverage decreased (-9.9%) to 89.847% when pulling 99bf061 on develop into 20e369b on master.

dnouri and others added 23 commits August 29, 2019 14:32

Update example on how to use Dask to do grid search

9f5ef37

Fix Dask hyperparam search example to work with latest sklearn&dask

a2b5760

Merge pull request #117 from ottogroup/bugfix/docs-parallelism

a69e18a

Fix Dask hyperparam search example to work with latest sklearn&dask

Merge pull request #116 from ottogroup/features/docs-dask-hyperparameter

fd9d57c

Update example on how to use Dask to do grid search

Back merge master

e3e931c

Bumped version

336466b

Remove julia and rpy2 from docs extra requirements

2a03acb

An attempt to fix an issue on Travis where rpy2 is installed with pip (in addition to conda) and appears to be buggy.

Merge pull request #120 from ottogroup/bugfix/travis-docs-require

28fa35d

Remove julia and rpy2 from docs extra requirements

Examples on how to use Keras and XGBoost with Palladium

9207271

In configuration, use exclamation mark '!' instead of '__factory__'

22939c5

Merge pull request #122 from ottogroup/feature/config-exclamation-mar…

6a363c8

…k-as-factory In configuration, use exclamation mark '!' instead of '__factory__'

Merge pull request #121 from ottogroup/feature/keras-and-xgboost-exam…

1be16ce

…ples Examples on how to use Keras and XGBoost with Palladium

Merge pull request #119 from ottogroup/feature/model-attachments

a4efa8a

Proposal implementation for handling model attachments

avoid loading stale metadata in S3 persister

ed6842d

Merge pull request #125 from yv/issue/fix-invalid-json

312d341

avoid loading stale metadata in S3 persister

Added path to error message when no active model can be found

41fd2a2

As suggested by yv in #124

Bump psutil version to 5.7.0 to fix security vulnerability

f29a3ce

Fixed no active model exists test

0dfcb12

Updated dependencies

e235059

Used https://pypi.org/project/pur/ for update for requirements.

Fixed flaky tests

304dad3

Bumped version

8ddf698

Merge pull request #126 from ottogroup/feature/update-dependencies

99bf061

Feature/update dependencies

alattner merged commit 3e7bd7d into master May 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop #127

Develop #127

alattner commented May 7, 2020

coveralls commented May 7, 2020

Develop #127

Develop #127

Conversation

alattner commented May 7, 2020

coveralls commented May 7, 2020