Feature request: Optimizing hyperparameters under unknown constraints #884

sethaxen · 2022-10-12T14:20:42Z

Description

The ability to optimize along with a univariate hyperparameter an uknown/hidden constraint for the parameter, where invalid parameter values are identified by failed function evaluations.

This may be related to #403.

Specific application

For the simplest case I have, we can assume:

the hyperparameter $x$ is independent of all other hyperparameters
all valid values occur below some threshold $x_c$. All $x \ge x_c$ are invalid.
so once an invalid value is encountered, we never want to test a value larger than it.

An initial idea

The KI Campus MOOC on AutoML mentioned that the Expected Constrained Improvement acquisition function, given in Eq.4 of https://engineering.ucsc.edu/sites/default/files/technical-reports/UCSC-SOE-10-10.pdf as
$$x' = \arg\max_{x \in \mathcal{X}} \mathbb{E}[I(x)] h(x)$$
is one approach that could work for something like this, where $I$ is an improvement statistic, and $h(x)$ is the predicted probability that the parameter $x$ is valid, for which they use a random forest classifier.

I suppose a more general approach would allow a user to provide

a function $h: \mathcal{X} \to [0, 1]$, which could be multiplied by any acquisition function
a method to update $h$ given a new evaluation $x_k$,

sethaxen · 2022-12-09T14:31:39Z

@mlindauer we spoke about this briefly at the AutoML fall school, and you said it might be straightforward to add. If so, I could try to contribute this feature, but it would be helpful if someone familiar with the codebase could point me in the right direction.

mlindauer · 2022-12-09T17:00:54Z

Hi,
Thanks for pinging us again. We are focusing all the manpower on the next major SMAC release right now.
Nevertheless, here are some pointers:

You need to build a binary label classification dataset with successful vs. unsuccessful runs, being extracted from the runhistory https://github.com/automl/SMAC3/tree/main/smac/runhistory/encoder
You need to use that data to feed it to a model (https://github.com/automl/SMAC3/tree/main/smac/model) but do some probabilistic classification (instead of regression). You can e.g., slightly modify the RF for that since it also supports classification and probabilistic predictions.
Last but not least, you need to implement a new acquisition function to take that into account (e.g., in the same way as you wrote above). https://github.com/automl/SMAC3/tree/main/smac/acquisition/function

Does that help you?
I hope that either Rene (only available next week) or Difan can provide more pointers in case you need more detailed help.

Best,
Marius

PS: I point you directly to the new code base since it would be wasted effort to do it in the old code base now.

sethaxen · 2023-01-26T22:34:24Z

Thanks @mlindauer for the pointers! These were very helpful. I was able to put together an example that seems to work well on some toy target functions with unknown constraints: https://gist.github.com/sethaxen/331402e156537a933121133fe9965573 . Major thanks to @KEggensperger who helped me navigate SMAC.

I'm planning to deploy this shortly on an expensive model, so if someone has ideas for improvement, I'd love to hear them. Also, if you think this would be a useful example to include in the docs, I'm happy to contribute it.

alexandertornede · 2023-01-27T08:49:10Z

Thanks for posting your outcome, @sethaxen . We will have a look at it during our meeting next week!

dengdifan · 2023-02-02T10:59:31Z

@sethaxen Thanks for the updates! It looks quite promising to me and we would like to integrate this as an example for our model. We would appreciate it if you could provide a PR for us (ideally for development 2.0 branch) (you can also come to us if you need any support with the codebase).
Just a small question, does this approach only work with EI acquisition function?

sethaxen · 2023-02-02T12:41:20Z

We would appreciate it if you could provide a PR for us (ideally for development 2.0 branch) (you can also come to us if you need any support with the codebase).

Sure, I'd be happy to! This would mean adding a new section to https://github.com/automl/SMAC3/tree/development/examples, right?

Just a small question, does this approach only work with EI acquisition function?

I'm new to AutoML, so I I can't say for certain. I came across examples where a classifier was incorporated into the acquisition function in more complicated ways (e.g. https://arxiv.org/abs/1004.4027), and the 2 papers I read that took the approach used here both used EI.

Naively, it seems intuitive that this could work with other acquisition functions, so I could generalize EIConstrained to a ConstraintWeightedAcquisition that one would pass a standard acquisition function and a classifier to.

dengdifan · 2023-02-02T15:17:37Z

Sure, I'd be happy to! This would mean adding a new section to https://github.com/automl/SMAC3/tree/development/examples, right?

Thanks! Exactly, the interface is overall the same, so it should not be so hard to transform to the development branch

Naively, it seems intuitive that this could work with other acquisition functions,

I think this might only work for improvement-based acquisition functions (EI and PI). If you have an LCB as an acquisition function, then its acquisition function value might become negative, multiplying this value with predict_probability might not work as expected.

sethaxen changed the title ~~Optimizing hyperparameters under unknown constraints~~ Feature request: Optimizing hyperparameters under unknown constraints Oct 12, 2022

renesass added the feature label Oct 17, 2022

mlindauer assigned renesass and dengdifan Dec 9, 2022

timruhkopf unassigned renesass Jan 26, 2023

This was referenced Mar 30, 2023

Constraint in Search Space #403

Closed

Properly handle memout and timeout for quality optimization and memout for runtime optimization #146

Open

alexandertornede self-assigned this Apr 11, 2023

alexandertornede added this to the v3.0.0 milestone Apr 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Optimizing hyperparameters under unknown constraints #884

Feature request: Optimizing hyperparameters under unknown constraints #884

sethaxen commented Oct 12, 2022

sethaxen commented Dec 9, 2022

mlindauer commented Dec 9, 2022

sethaxen commented Jan 26, 2023

alexandertornede commented Jan 27, 2023

dengdifan commented Feb 2, 2023 •

edited

sethaxen commented Feb 2, 2023

dengdifan commented Feb 2, 2023

Feature request: Optimizing hyperparameters under unknown constraints #884

Feature request: Optimizing hyperparameters under unknown constraints #884

Comments

sethaxen commented Oct 12, 2022

Description

Specific application

An initial idea

sethaxen commented Dec 9, 2022

mlindauer commented Dec 9, 2022

sethaxen commented Jan 26, 2023

alexandertornede commented Jan 27, 2023

dengdifan commented Feb 2, 2023 • edited

sethaxen commented Feb 2, 2023

dengdifan commented Feb 2, 2023

dengdifan commented Feb 2, 2023 •

edited