GP bug for noise_learn = false underprediction of uncertainty #143

odunbar · 2022-05-28T00:24:05Z

Purpose

Fix the noise_learn = false bug greatly underpredicting uncertainty. In Lorenz example, the following two should be similar. with noise_learn=false and true:

co-author with @lm2612

In this PR

Initial bugfix to correct the regularization noise vs white kernel
Tightened runtests thanks to improved prediction,
new alg_reg_noise optional argument to GP to set the regularization when noise_learn=true (removing magic_number)
Lorenz example - with SKLJL() option

codecov · 2022-05-28T00:30:59Z

Codecov Report

Merging #143 (48598e3) into master (ad38cb5) will increase coverage by 0.55%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #143      +/-   ##
==========================================
+ Coverage   88.15%   88.71%   +0.55%     
==========================================
  Files           4        4              
  Lines         380      381       +1     
==========================================
+ Hits          335      338       +3     
+ Misses         45       43       -2

Impacted Files	Coverage Δ
src/GaussianProcess.jl	`93.00% <100.00%> (+2.09%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ad38cb5...48598e3. Read the comment docs.

odunbar · 2022-05-28T01:08:44Z

bors try

bors · 2022-05-28T01:27:43Z

try

Build succeeded:

lm2612 · 2022-06-01T00:27:25Z

Looks good with SKLJL.
When I test with GPJL and noise_learn=false, I'm finding the predicted covariance matrix y_var to now be around 2x Γy. Is it possible that this was already included for GPJL?

odunbar · 2022-06-01T00:48:26Z

The predicted covariance (when returned with transform_to_real=true) will be the sum of 2 contributions

one from the GP kernel approximation (e.g. the fact that we use an RBF to approximate something)
one from the observational noise.

So in theory the y_var must be >= observational noise. The amount that it is bigger will be due to the error in the GP approximation.

does it return something similar with noise_learn = true? This will fix contribution 2. = observational noise, and only learns contribution 1. If they are similar, this would indicate that it's just not a great approximation if they are not similar then maybe there is an issue (note they are unlikely to be exactly the same).

I did try to do something in the unit tests like this - (see the added file)

odunbar · 2022-06-01T01:08:27Z

Ignore what i said: from the GaussianProcess.jl source code I found this:
thus they add the "lognoise" (i.e. the regularization noise) back in during prediction

function predict_y(gp::GPE, x::AbstractMatrix; full_cov::Bool=false)
    μ, σ2 = predict_f(gp, x; full_cov=full_cov)
    if full_cov
        npred = size(x, 2)
        return μ, σ2 + ScalMat(npred, exp(2*gp.logNoise))
    else
        return μ, σ2 .+ exp(2*gp.logNoise)
    end
end

Thanks! I will update accordingly

lm2612 · 2022-06-01T01:12:10Z

Ok great! I forgot to say in my previous comment I was referring to the Lorenz example

odunbar · 2022-06-01T02:15:33Z

OK

I've moved the magic_number to be an input argument for GP.
I've tightened up the new unit tests, so now predicted means, and variances, with noise_learn and without have to be within 5% of each other (was 50% before - hence why it missed the bug!)

lm2612

Great, I get virtually the same results for both GPJL and SKLJL now, thanks!

odunbar · 2022-06-01T19:33:19Z

bors r+

bors · 2022-06-01T19:52:42Z

Build succeeded:

refined runtest and added bugfix

4f41bb6

odunbar requested a review from lm2612 May 28, 2022 00:32

runtest to loosely compare noise_learn true and false

45223fb

bors bot added a commit that referenced this pull request May 28, 2022

Try #143:

4ddb555

remove magic_number, move bugfix to skljl only

48598e3

lm2612 approved these changes Jun 1, 2022

View reviewed changes

bors bot merged commit a073451 into master Jun 1, 2022

bors bot deleted the orad/bugfix-gp-reg branch June 1, 2022 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GP bug for noise_learn = false underprediction of uncertainty #143

GP bug for noise_learn = false underprediction of uncertainty #143

odunbar commented May 28, 2022 •

edited

Loading

codecov bot commented May 28, 2022 •

edited

Loading

odunbar commented May 28, 2022

bors bot commented May 28, 2022

lm2612 commented Jun 1, 2022

odunbar commented Jun 1, 2022 •

edited

Loading

odunbar commented Jun 1, 2022 •

edited

Loading

lm2612 commented Jun 1, 2022

odunbar commented Jun 1, 2022 •

edited

Loading

lm2612 left a comment

odunbar commented Jun 1, 2022

bors bot commented Jun 1, 2022

GP bug for noise_learn = false underprediction of uncertainty #143

GP bug for noise_learn = false underprediction of uncertainty #143

Conversation

odunbar commented May 28, 2022 • edited Loading

Purpose

In this PR

codecov bot commented May 28, 2022 • edited Loading

Codecov Report

odunbar commented May 28, 2022

bors bot commented May 28, 2022

try

lm2612 commented Jun 1, 2022

odunbar commented Jun 1, 2022 • edited Loading

odunbar commented Jun 1, 2022 • edited Loading

lm2612 commented Jun 1, 2022

odunbar commented Jun 1, 2022 • edited Loading

lm2612 left a comment

Choose a reason for hiding this comment

odunbar commented Jun 1, 2022

bors bot commented Jun 1, 2022

odunbar commented May 28, 2022 •

edited

Loading

codecov bot commented May 28, 2022 •

edited

Loading

odunbar commented Jun 1, 2022 •

edited

Loading

odunbar commented Jun 1, 2022 •

edited

Loading

odunbar commented Jun 1, 2022 •

edited

Loading