Local-search based technique to optimize the acquisition function of trees and friends #74

MechCoder · 2016-04-29T05:55:18Z

We cannot optimize the acquisition function of using conventional gradient / 2nd order information based methods. SMAC does it in the following way described in page 13 of http://www.cs.ubc.ca/~hutter/papers/10-TR-SMAC.pdf

Some terminology.

If we have p parameters and a parameter configuration, a one-exchange neighbourhood is defined as a parameter configuration that is different in exactly one parameter.
For a parameter (say X) that is continuous, this neighbor is sampled from a Gaussian centered at X with std 0.2 keeping all other parameters constant.
For a parameter (say Y) that is categorical, this neighbour is any other categorical parameter keeping all other parameters constant.

Seems like they do a multi-start local search with 10 points. For each local search:

Initialize a random point p
Check the acquisition values at "4X + Y" neighbours.
If none of the neighbours have a lesser acquisition than p, then terminate
Else reassign p to the neighbour with minimum acquisition value.

Then return the minimum of all the 10 local searches.

The text was updated successfully, but these errors were encountered:

MechCoder · 2016-04-29T05:56:16Z

ping @glouppe @betatim

MechCoder · 2016-04-29T05:57:25Z

They also say "we plan to investigate better mechanisms in the future". Might be a good place for us to investigate as well :P

glouppe · 2016-04-29T08:07:38Z

Thanks for digging this up. This does look very hackish indeed and it would definitely be nice to find a better solution for that.

MechCoder · 2016-04-29T13:01:26Z

Do we first implement this as a baseline?

betatim · 2016-06-15T23:29:48Z

Having re-read the SMAC paper I think we should stick with random sampling for now. It seems quite convoluted. They combine the result of 10 local searches with 10000 random samples and it is not terribly clear to me that it does better than simple random sampling.

Do you understand the last paragraph in section 4.3?

MechCoder · 2016-06-17T07:24:37Z

Hmm.. They claim however in the paragraph before the last that the "ten
results of local search typically had larger EI than all randomly sampled configurations".

So it might be worth a try to just do the local searches independent of the random sampling.

I have it in my branch here (https://github.com/MechCoder/scikit-optimize/tree/paramils) and will send a PR once we test that the _optimize methods indeed handles categorical parameters.

betatim · 2016-06-18T09:32:20Z

What I couldn't work out was: If the ten local searches "always" outperform the 10000 random samples, why do they continue to do them?

The other thing I find confusing is the statement of interleaving random samples for training purposes and unbiasedness.

Looking forward to a PR, then we can benchmark things and compare how much we gain from using a smarter/more complicated method.

MechCoder · 2016-07-15T08:07:57Z

OK, I now have a working version but it fails 2 out of the 6 minimizers. I shall verify tomorrow and let you know if the method performs worse or if it is a bug in the code. (I hope it is the latter)

glouppe · 2016-07-16T17:51:46Z

Great! Could you make a WIP PR? I am curious to see how you do that :)

MechCoder · 2016-07-16T20:22:18Z

done :-)

MechCoder added Enhancement Major labels Apr 29, 2016

betatim mentioned this issue Apr 30, 2016

Implement RF based model selection #57

Closed

MechCoder mentioned this issue Jun 7, 2016

0.1 release #79

Closed

13 tasks

MechCoder mentioned this issue Jul 16, 2016

[WIP] Add smarter sampling to optimize the acquisition function #109

Open

3 tasks

glouppe mentioned this issue Jul 19, 2016

Methods to optimize the acquisition function #35

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local-search based technique to optimize the acquisition function of trees and friends #74

Local-search based technique to optimize the acquisition function of trees and friends #74

MechCoder commented Apr 29, 2016

MechCoder commented Apr 29, 2016

MechCoder commented Apr 29, 2016

glouppe commented Apr 29, 2016

MechCoder commented Apr 29, 2016

betatim commented Jun 15, 2016

MechCoder commented Jun 17, 2016 •

edited

betatim commented Jun 18, 2016

MechCoder commented Jul 15, 2016

glouppe commented Jul 16, 2016

MechCoder commented Jul 16, 2016

Local-search based technique to optimize the acquisition function of trees and friends #74

Local-search based technique to optimize the acquisition function of trees and friends #74

Comments

MechCoder commented Apr 29, 2016

MechCoder commented Apr 29, 2016

MechCoder commented Apr 29, 2016

glouppe commented Apr 29, 2016

MechCoder commented Apr 29, 2016

betatim commented Jun 15, 2016

MechCoder commented Jun 17, 2016 • edited

betatim commented Jun 18, 2016

MechCoder commented Jul 15, 2016

glouppe commented Jul 16, 2016

MechCoder commented Jul 16, 2016

MechCoder commented Jun 17, 2016 •

edited