Issues #8

apoorvagnihotri · 2019-10-14T05:17:18Z

New text
Given the fact that we are only interested in knowing the location where the maximum occurs, it might be a good idea to evaluate at locations where our surrogate model's prediction mean is the highest, i.e. to exploit. But unfortunately, our model mean is not always accurate (since we have limited observations), so we need to correct our model, which can be done by reducing variance or exploration. BO looks at both exploitation and exploration, whereas in the case of active learning, we only cared about exploration.

Acquisition Functions
Text should be:

We just discussed that our original optimisation problem (equation) is hard given the expensive nature of evaluating f. The key idea of BO is to transform this original difficult optimisation into a sequence of easier inexpensive optimisations called an acquisition function (alpha(x)). Each of these sequence of easier inexpensive optimisations involves finding the next point to sample. Thus, we can interpret the acquisition function as commensurate
with how desirable evaluating f at x is expected to be for the maximisation problem [CITE: https://www.cse.wustl.edu/~garnett/cse515t/spring_2015/files/lecture_notes/12.pdf]

While we have just now discussed that our goal is to transform the original optimisation into a sequence of easier optimisation, where is the "Bayesian" in this optimisation, and how is the acquisition function related? Let us re-wind and go back to our surrogate model and build the link between all the things we have discussed thus far, by noting the steps of BO [CITE: https://www.youtube.com/watch?list=PLZ_xn3EIbxZHoq8A3-2F4_rLyy61vkEpU&v=EnXxO3BAgYk]:

Choose a surrogate model and its prior over space of objectives f
Given the set of observations (function sampling), use Bayes rule to obtain the posterior
Use an acquisition function (alpha(x)), which is a function of the posterior to decide where to sample next (x_t = argmax()..)
Add new sampled data to the set of observations and Goto Step TODO Master issue #2 till convergence or budget elapses

We now have three core ideas associated with acquisition functions: i) they are a function of the surrogate posterior; ii) they combine exploration and exploitation; and iii) they are inexpensive to evaluate. Let us now look into a few examples of commonly used acquisition functions to understand the concept better.

Remove the following text
Let us understand this concept in two cases:

We have two points of similar means (of function values (gold in our case)). We now want to choose one of these to obtain the labels or values. We will choose the one with higher variance. This basically says that given the same exploitability, we choose the one with higher exploration value.

We have two points having the same variance. We would now choose the point with the higher mean. This basically says that given the same explorability, we will choose the one with higher exploitation value.

Remove the text "hero plot" and instead write below plot
In Intuition behind E -> change spread out sigma to symbol sigma
Everywhere except the title - change Active Learning to active learning
SVM example remove the GIFs for random and GP-UCB. Also, mention the optimal <C, gamma> found via grid search and via EI and PI.
dimentions --> dimensions

apoorvagnihotri closed this as completed Apr 30, 2020

apoorvagnihotri mentioned this issue Jul 5, 2019

Revision -- Feedback 3 #6

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues #8

Issues #8

apoorvagnihotri commented Oct 14, 2019 •

edited

Loading

Issues #8

Issues #8

Comments

apoorvagnihotri commented Oct 14, 2019 • edited Loading

apoorvagnihotri commented Oct 14, 2019 •

edited

Loading