Allow user to specify a selection heuristic #75

ablaom · 2020-09-11T00:55:26Z

Context: JuliaAI/MLJ.jl#487 #40
Replaces: #74

Currently the implementer of a new strategy implements a best method for deciding on how to extract the history entry corresponding to the "best" (optimal) model. All the current strategies apply the "naive" heuristic which simply miminizes (or maximises) the user-specified measure evaluations, aggregated over all samples (folds) in resampling.

This PR adds an interface point for the user to specify the "selection heuristic", with the tuning strategy surrendering that responsibility. The idea is that most "selection heuristics" would apply generically (ie, to any tuning strategy) but the PR leaves open the possibility that some future heuristics might be strategy-specific.

Formally, a selection heuristic is a subtype of a new abstract type SelectionHeuristic and a new concrete subtype implements a best method (and a trait supports_heuristic to indicate if it is generic or strategy-specific). An moderately sophisticated user could add their own custom heuristics; see here.

Only a single selection heuristic NaiveSelection is introduced in the current PR.

This PR will break the externally implemented strategy TreeParzen. I will be happy to make the necessary PR to TreeParzen.jl to fix this. The built-in strategies Grid and RandomSearch are fixed in this PR.

edit The models! method is replaced with non-mutating models method to make the whole interface "functional" (state objects can now be immutable, and models adds new_state to its return value).

New for users

Adds keyword selection_heuristic=... to constructor of TunedModel instances. Default is NaiveSelection(), which gives the existing behaviour (with an option to specify weights for multiple measures).

Breaking for users

This PR also tweaks the reporting for TunedModel instances. If mach is a machine trained on a TunedModel instance, and r = report(mach) then:

r.best_result no longer exists (replaced with r.best_history_entry)
r.history is now a vector of named tuples (true also for the internal representation of the history)
r.history excludes implementation-specific "model metadata" (necessarily included in the internal history)

Main changes for the tuning strategy API

No best method to implement
result method is renamed extras method with a simplified role (no need to handle "model metadata", which is automatically include in the history under the hood)
tuning_report has a simplified role (no need to generate the user form of history)
edit models! is replaced by a non-mutating models method with identical signature but newstate added to return value (now a tuple, (vector_of_models, new_state)) instead of vector_of_models).

For details refer to the revised README.md

To do before merging

Check that TreeParzen.jl can be adapted to new API (I will do this)

cc @yalwan-iqvia

more changes typos docs: changes to the form of history and output of report typos more typos minor typos implement changes in docs - first attempt with passing tests add forgotten file typo rename the default selection heurisic; add weights to it

ablaom · 2020-09-11T03:44:34Z

cc @azev77

yalwan-iqvia · 2020-09-11T06:38:54Z

I'm not going to straight up say it is definitely ok for us (because I didn't do anything yet more than a cursory look), however, should be ok. Most of our implementation is in models! only, we would have to redo result (extras) but its such a puny method I can't really see it being a massive deal provided all of the same information is still there.

As I said, an early "quick glance" conclusion based on what you've written here.

ablaom · 2020-09-13T21:26:26Z

Thanks for the feedback @iqml . As I say, I'm happy to check this out myself and make the necessary PR. Working on this today.

ablaom · 2020-09-14T04:41:42Z

Okay, it's possible to implement the new API at TreeParzen. This branch passes tests in conjunction with the current PR on my machine under Julia 1.4.

Below is a sketch of the necessary changes to TreeParzen.jl

@yalwan-iqvia I can give more details in the final PR if you are happy with the general direction.

results can be dumped. There is no need to implement extras because I am not including anything extra in the history that is intended for the user to inspect
instead of "completing" trial objects in results I added a trial history to state; the models method now begins by updating this trial history as follows:
- look at each entry in the new part of MLJ history, which includes a performance estimate measurement and a trial object
- complete a deep copy of the trial object by injecting measurement using tell!(not sure if a deep copy is really required - but this was the status quo)
- append the new completed trial objects to the existing trial history in state

The rest of models is the same as before, except it now needs to include the updated state in its return value because models! is now models.

Previously there was not separate trial history in state, but one need to create one every call to models using the MLJ history. It seems to me the new approach is equivalent to the old approach at worst and slightly more efficient at best, but I have not benchmarked this.

A difference to note is that the trial history is no-longer exposed to the user in the reported history. If desired, this could be remedied either by implemented extras (or by overloading report_history) but I'm not sure this is needed, as the trial objects are implentation-specific, yes?. The user still has access to the the MLJ representation of the model and corresponding performance estimates (and indeed for all specified measures - not just the first one, as currently implemented).

@yalwan-iqvia If you are happy with this I will merge the current PR, and later make a formal PR to TreeParzen.

yalwan-iqvia · 2020-09-14T07:06:02Z

@ablaom I've opened what you did as a PR agains treeparzen, a few comments there but nothing to do with the direction of the work. This looks fine to me.

whitespace

update API #40

68d3f4b

more changes typos docs: changes to the form of history and output of report typos more typos minor typos implement changes in docs - first attempt with passing tests add forgotten file typo rename the default selection heurisic; add weights to it

ablaom added breaking next minor release labels Sep 11, 2020

This was referenced Sep 11, 2020

For a 0.5 release #76

Closed

New API to accomodate different selection heuristics #74

Closed

ablaom mentioned this pull request Sep 13, 2020

Extend [compat] MLJTuning = "^0.3,^0.4" IQVIA-ML/TreeParzen.jl#44

Merged

ablaom added 3 commits September 14, 2020 15:02

models! -> models (make models non-mutating returning new state)

2fcf079

models! stub replaced with model stub

b3f688c

doc tweak

5b665c8

ablaom mentioned this pull request Sep 14, 2020

MLJTuning PR 75 (API Changes) IQVIA-ML/TreeParzen.jl#46

Merged

4 tasks

mv utility function signature() into utilities.jl; ditto test

52f0d2d

whitespace

ablaom merged commit 774cf17 into dev Sep 15, 2020

This was referenced Sep 15, 2020

For a 0.5 release #77

Merged

For a 0.15 release JuliaAI/MLJ.jl#659

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow user to specify a selection heuristic #75

Allow user to specify a selection heuristic #75

ablaom commented Sep 11, 2020 •

edited

ablaom commented Sep 11, 2020

yalwan-iqvia commented Sep 11, 2020

ablaom commented Sep 13, 2020

ablaom commented Sep 14, 2020 •

edited

yalwan-iqvia commented Sep 14, 2020

Allow user to specify a selection heuristic #75

Allow user to specify a selection heuristic #75

Conversation

ablaom commented Sep 11, 2020 • edited

ablaom commented Sep 11, 2020

yalwan-iqvia commented Sep 11, 2020

ablaom commented Sep 13, 2020

ablaom commented Sep 14, 2020 • edited

yalwan-iqvia commented Sep 14, 2020

ablaom commented Sep 11, 2020 •

edited

ablaom commented Sep 14, 2020 •

edited