Fix predict with confidence intervals #253

nalimilan · 2018-09-11T11:11:42Z

Use keyword arguments, which are passed through by the DataFrameRegressionModel method.

While we're at it, I wonder whether we should change the API a bit. interval=:confint and interval=:predint could be changed to interval=:confidence and interval=:prediction, which are the names used by R (since "int" is redundant with interval).

Cc: @mkborregaard

codecov-io · 2018-09-11T11:27:34Z

Codecov Report

Merging #253 into master will decrease coverage by 0.27%.
The diff coverage is 28.57%.

@@            Coverage Diff            @@
##           master    #253      +/-   ##
=========================================
- Coverage   51.97%   51.7%   -0.28%     
=========================================
  Files           6       6              
  Lines         583     588       +5     
=========================================
+ Hits          303     304       +1     
- Misses        280     284       +4

Impacted Files	Coverage Δ
src/lm.jl	`49.46% <28.57%> (-1.68%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4966a05...c2a5bd2. Read the comment docs.

mkborregaard · 2018-09-11T11:58:47Z

Wonderful. I also agree wholeheartedly with the name change.

mkborregaard · 2018-09-11T11:59:27Z

Does this mean this will work with DataFrames now? There was a PR on DataFrames I never got merged...

nalimilan · 2018-09-11T12:05:36Z

Does this mean this will work with DataFrames now? There was a PR on DataFrames I never got merged...

Do you have a link? It must have worked at some point, since it's in the examples, but I wonder since when it's been broken.

mkborregaard · 2018-09-11T12:16:49Z

JuliaData/DataFrames.jl#1160

I don't think it's ever worked from DataFrames...

mkborregaard · 2018-09-11T12:17:35Z

I'll rebase and write the test if that's still doable, but I'm guessing the interface has changed a lot?

nalimilan · 2018-09-11T12:29:40Z

Thanks. Actually I now realize I hadn't tested the example based on DataFrames correctly. The test in runtests.jl only covers matrices. So this PR shouldn't be merged without fixing DataFrameRegressionModel first.

The code now lives in StatsModels.jl. If we make interval-related arguments keyword arguments rather than positional, I think we can avoid defining a separate function as in JuliaData/DataFrames.jl#1160. Instead, you can just check whether yp is a vector or a matrix, and slightly adapt the code to suit each possibility.

mkborregaard · 2018-09-11T13:51:34Z

Here? https://github.com/JuliaStats/StatsModels.jl/blob/33f7b9cceac7e6624bf9e4802df3c4c8b2f5b83d/src/statsmodel.jl#L100

mkborregaard · 2018-09-11T13:52:46Z

Or here in GLM?

nalimilan · 2018-09-11T13:56:43Z

Yes, in StatsModels. GLM should be ready with this PR.

mkborregaard · 2018-09-23T13:36:21Z

By the way @nalimilan I just rememembered that the way I implemented it like it is, and not with a keyword as in this PR, is because the keyword version breaks type stability of predict. With confidence intervals it returns a three-column Matrix, without it's a Vector. Making the interval an optional positional argument allows the compiler to figure out the return type from the method signature.

nalimilan · 2018-09-23T15:01:16Z

AFAIK type-stability isn't a problem anymore with keyword arguments:

julia> f(; x=nothing) = x === nothing ? 1 : 1.0
f (generic function with 1 method)

julia> @code_warntype f()
Body::Int64
[...]

julia> @code_warntype f(x=nothing)
Body::Int64
[...]

julia> @code_warntype f(x=1)
Body::Float64
[...]

mkborregaard · 2018-09-23T17:58:05Z

sweet 🎉

Use keyword arguments, which are passed through by the DataFrameRegressionModel method.

nalimilan · 2018-11-10T18:54:51Z

I've pushed a commit to change the names. Is this good to go? IIRC a separate change is needed in StatsModels to support DataFrames, but that's already broken currently.

mkborregaard · 2018-11-11T10:50:28Z

Yes, I'll have a look a StatsModels some time during the week

mkborregaard · 2018-11-11T18:59:35Z

@nalimilan I can't see that the name has been changed? It still says :confint but I agree that :confidence makes more sense.

Here's the StatsModels PR:
JuliaStats/StatsModels.jl#77

I have a question - we currently return a three-row Matrix as [lower prediction upper]. Would it make more sense / be more useful to return a Tuple, e.g. (prediction, (lower, upper)) or maybe a NamedTuple? Allows things like pred, int = predict(mymodel, newdata, interval = :confidence); plot(pred, ribbon = int) etc.?

mkborregaard · 2018-11-11T19:24:50Z

Oh, I see we already discussed the tuple thing once: #171 (comment)
No reason to flog a dead horse :-)

nalimilan · 2018-11-11T20:25:11Z

Woops, I had just forgotten to push the new commits.

mkborregaard · 2018-11-12T11:56:56Z

I've added prediction intervals in a PR against this branch

nalimilan force-pushed the nl/predict branch from f1a0669 to 49fe437 Compare September 11, 2018 11:19

JuliaStats deleted a comment from codecov-io Sep 11, 2018

nalimilan added 3 commits November 10, 2018 17:49

Fix predict with confidence intervals

2149af2

Use keyword arguments, which are passed through by the DataFrameRegressionModel method.

Add test for predicted values

2e1fef4

Rename :confint to :confidence

c2a5bd2

mkborregaard mentioned this pull request Nov 11, 2018

predict_confidence JuliaStats/StatsModels.jl#77

Merged

nalimilan force-pushed the nl/predict branch from d7cc9c2 to c2a5bd2 Compare November 11, 2018 20:23

nalimilan merged commit d19c2f8 into master Nov 13, 2018

nalimilan deleted the nl/predict branch January 16, 2019 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix predict with confidence intervals #253

Fix predict with confidence intervals #253

nalimilan commented Sep 11, 2018

codecov-io commented Sep 11, 2018 •

edited

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 23, 2018

nalimilan commented Sep 23, 2018

mkborregaard commented Sep 23, 2018

nalimilan commented Nov 10, 2018

mkborregaard commented Nov 11, 2018 •

edited

mkborregaard commented Nov 11, 2018

mkborregaard commented Nov 11, 2018

nalimilan commented Nov 11, 2018

mkborregaard commented Nov 12, 2018

Fix predict with confidence intervals #253

Fix predict with confidence intervals #253

Conversation

nalimilan commented Sep 11, 2018

codecov-io commented Sep 11, 2018 • edited

Codecov Report

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

mkborregaard commented Sep 11, 2018

nalimilan commented Sep 11, 2018

mkborregaard commented Sep 23, 2018

nalimilan commented Sep 23, 2018

mkborregaard commented Sep 23, 2018

nalimilan commented Nov 10, 2018

mkborregaard commented Nov 11, 2018 • edited

mkborregaard commented Nov 11, 2018

mkborregaard commented Nov 11, 2018

nalimilan commented Nov 11, 2018

mkborregaard commented Nov 12, 2018

codecov-io commented Sep 11, 2018 •

edited

mkborregaard commented Nov 11, 2018 •

edited