Parallel Surrogate Models Through Ask-tell Interface #435

dreycenfoiles · 2023-06-11T02:09:06Z

As mentioned in #278, it would be nice to be able to be able to have surrogate models ask for multiple points at once to enable parallelized surrogate modeling. This PR introduces an Ask function that a user can supply a surrogate model and a set number of points. To maximize the expected improvement of a given model in parallel, temporary virtual values will have to be assigned sequentially for each parallel point. I have implemented 6 different methods of getting virtual points. These are

Minimum Constant Liar: virtual points are chosen based on the smallest known value of the provided surrogate
Mean Constant Liar: virtual points are chosen based on the smallest known value of the provided surrogate
Maximum Constant Liar: virtual points are chosen based on the smallest known value of the provided surrogate
Kriging Believer: virtual points are chosen using the value of the current Kriging model
Kriging Believer Upper bound: virtual points are chosen using the $3\sigma$ upper bound of the current Kriging model
Kriging Believer Lower bound: virtual points are chosen using the $3\sigma$ lower bound of the current Kriging model

More information about these virtual point methods can be found in the documentation for the surrogate modeling toolbox and this paper that Scikit Optimize references.

Below is an example of how Ask could be used. The function add_point, works as a "tell", so you can have an Ask-tell interface.

using Surrogates

lb = 0.0
ub = 10.0
f = x -> log(x) * exp(x)
x = sample(5, lb, ub, SobolSample())
y = f.(x)

my_k = Kriging(x, y, lb, ub)

for _ in 1:10
    new_x, eis = Ask(EI(), my_k, SobolSample(), 3, CLmean!)
    add_point!(my_k, new_x, f.(new_x))
end

In its current state, this PR add an Ask that only works for Kriging models with an EI acquisition function. I also have not added added documentation yet. I plan to add docs and I would be willing to get this to work for more surrogate model types of acquisition functions, I would just like to get a confirmation that this looks good so far and that I'm on the right track. If there are any suggestions for improvements or ways to other features to add to this PR, I would be happy to hear them.

Also mentioned in #278 was that some samplers won't work with parallel sampling. Like it was said there, I think it would make sense to add a type restriction to Ask so that only parallelizable samplers work. Is there any way I could add a subtype of the samplers in this repository or would that change have to be made in QuasiMonteCarlo.jl?

ChrisRackauckas · 2023-08-09T01:30:39Z

Hey, this is really cool and it's sad to see it just live as a draft. Is there an intention to complete this?

dreycenfoiles · 2023-08-09T01:46:36Z

Absolutely! I apologize that I haven't finished this; I honestly completely forgot about it. I'll probably have some time to do some more this weekend.

dreycenfoiles · 2023-09-17T03:13:52Z

Sorry for the delay, but I think the PR is finally ready to look over. Please let me know your thoughts if you have the time and I'd be happy to make the necessary changes.

docs/src/parallel.md

src/Optimization.jl

docs/src/parallel.md

src/VirtualStrategy.jl

src/Optimization.jl

ChrisRackauckas · 2023-09-17T21:05:37Z

src/Optimization.jl

+    dtol = 1e-3 * norm(ub - lb)
+    eps = 0.01
+
+    tmp_krig = deepcopy(krig) # Temporary copy of the kriging model to store virtual points


This can be pretty heavy. Did you check it did not cause a significant performance issue?

I did some tests and it seems like in all cases, deepcopy will never be anywhere close to your bottleneck. The bottleneck is always either evaluating your objective function or calculating the expected improvement or merit function.

ChrisRackauckas · 2023-09-17T21:07:15Z

Overall it looks good, but just needs a few bikeshedding tweaks in order to be ready to merge. We need to be careful that things specialize and exporting 2-3 letter words is generally not a good idea for something that's widely used, so we need to just change a few surface level things.

dreycenfoiles · 2023-09-20T03:56:51Z

I think I was able to address your concerns. Is there anything else you'd like me to change?

docs/src/parallel.md

ChrisRackauckas · 2023-09-21T13:37:04Z

docs/src/parallel.md

+my_k = Kriging(x, y, lb, ub)
+
+for _ in 1:10
+    new_x, eis = potential_optimal_points(EI(), lb, ub, my_k, SobolSample(), 3, CLmean!)


Suggested change

new_x, eis = potential_optimal_points(EI(), lb, ub, my_k, SobolSample(), 3, CLmean!)

new_x, eis = potential_optimal_points(EI(), lb, ub, my_k, SobolSample(), 3, MeanConstantLiar())

?

ChrisRackauckas · 2023-09-21T13:50:14Z

I think this looks great! It may need to wait for #441 to fix tests on master and rebase on top of that, but I think this is looking ready to merge if tests pass.

…tch on types

codecov · 2023-09-21T23:47:50Z

Codecov Report

Merging #435 (ea4eef2) into master (1ee0a5c) will decrease coverage by 8.11%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##           master     #435      +/-   ##
==========================================
- Coverage   73.68%   65.58%   -8.11%     
==========================================
  Files          22       23       +1     
  Lines        2930     3115     +185     
==========================================
- Hits         2159     2043     -116     
- Misses        771     1072     +301

Files Changed	Coverage Δ
src/Optimization.jl	`63.50% <0.00%> (-17.81%)`	⬇️
src/Surrogates.jl	`21.73% <ø> (-17.40%)`	⬇️
src/VirtualStrategy.jl	`0.00% <0.00%> (ø)`

... and 2 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ChrisRackauckas · 2023-09-22T00:20:12Z

🎉

dreycenfoiles marked this pull request as ready for review September 17, 2023 03:12