integrate changes from StatsModels 0.7 (upcoming release) #664

kleinschmidt · 2023-01-25T02:38:02Z

~~WIP.~~ The major thing that needs to be addressed is how FunctionTerms are represented (JuliaStats/StatsModels.jl#183).

Don't merge until

StatsModels 0.7 released
- fix bug in hasintercept StatsModels.jl#275
- Support unprotected mode for specials with FullRank schemas as well StatsModels.jl#274
GLM release with StatsModels 0.7 compat released add compat for statsmodels 0.7 GLM.jl#512

closes #672

kleinschmidt · 2023-01-26T16:22:24Z

I've got most of the easy stuff sorted out now. The remaining issue is how implicit intercept/full-rank promotion is handled in the ranef terms. Our decision to :just: copy/paste the intercept handling stuff from statsmodels is coming back to bite now: the lhs of a ranef term like (0+f | g) is still a FunciontTerm{+} when we see it in apply_schema(::RanefTerm), so we can't do the normal "has intercept" checks. the mechanism that promotes it to a TupleTerm is...apply_schema, so we're in a kind of chicken/egg situation: we need to handle the implicit intercept behavior before applying schema to the rest, but we need a TupleTerm before we can handle the intercept behavior without some seriously bad and ugly hacks.

My current best idea is to use some kinda wrapper context like WithIntercept that we can use for dispatch. Then apply_schema with this context will carry it through until it gets to a TupleTerm, at which point it'll do the intercept detction/correction behavior and then continue with the original context. But I need to pay around a bit more to see whether that's workable and a good idea. If it is a good idea we can consider upstreaming it to statsmodels.

The other (hackier) thing to do would be to would be to use a "dummy schema" like Schema(Dict(t => t for t in terms(lhs))) to "unprotect" the lhs, then we can proceed with the intercept wrangling as usual.

This reverts commit 15bf274.

codecov · 2023-03-14T22:06:31Z

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (658aea1) 96.27% compared to head (03ce750) 96.28%.

❗ Current head 03ce750 differs from pull request most recent head ced6def. Consider uploading reports for the commit ced6def to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #664   +/-   ##
=======================================
  Coverage   96.27%   96.28%           
=======================================
  Files          29       29           
  Lines        2740     2747    +7     
=======================================
+ Hits         2638     2645    +7     
  Misses        102      102

Impacted Files	Coverage Δ
src/randomeffectsterm.jl	`96.55% <100.00%> (+0.30%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

kleinschmidt · 2023-03-15T14:04:55Z

this is ready for review with the caveat that I have no idea why the documenter build failed and haven't looked into it other than looking at the logs

palday · 2023-03-16T21:44:38Z

@kleinschmidt This is what's failing (it's a non sense model but it was chosen to show off the formula syntax on something quick to fit)

using MixedModels
sleepstudy = MixedModels.dataset(:sleepstudy)
fit(MixedModel, @formula(reaction ~ 1 + days + (1|subj) + zerocorr(days|subj)), sleepstudy,
    contrasts = Dict(:days => DummyCoding()))

kleinschmidt · 2023-03-16T22:28:14Z

@kleinschmidt This is what's failing (it's a non sense model but it was chosen to show off the formula syntax on something quick to fit)
using MixedModels
sleepstudy = MixedModels.dataset(:sleepstudy)
fit(MixedModel, @formula(reaction ~ 1 + days + (1|subj) + zerocorr(days|subj)), sleepstudy,
    contrasts = Dict(:days => DummyCoding()))

hmmmmmmmm that makes me think there may be an actual error there, it looks like (Intercept) is showing up twice when it should only be once...

kleinschmidt · 2023-04-07T21:57:26Z

AH I think I know what's happened here 😞

the hacky FunctionTerm{typeof(|)}-has-infinite-degree thing is causing the "bare" ranef to get sorted last:

julia> ff = @formula(reaction ~ 1 + days + (1|subj) + zerocorr(days|subj))
FormulaTerm
Response:
  reaction(unknown)
Predictors:
  1
  days(unknown)
  (days,subj)->zerocorr(days | subj)
  (subj)->1 | subj

I think we can work around this by adding a method for degree(::FunctionTerm{typeof(zerocorr)}) buuuut I don't like this precedent...

src/randomeffectsterm.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…statsmodels-007

WIP statsmodels 0.7 upgrade

d62bf84

palday mentioned this pull request Jan 25, 2023

roadmap to 1.0 release JuliaStats/StatsModels.jl#271

Open

5 tasks

kleinschmidt added 4 commits February 4, 2023 16:26

sort out ranef implicit intercept

59aac8a

use predictor matrix instead of model matrix (removed)

15bf274

compat requirement for GLM

401902b

Revert "use predictor matrix instead of model matrix (removed)"

1dff86e

This reverts commit 15bf274.

kleinschmidt marked this pull request as ready for review March 15, 2023 14:04

fix sorting order for zerocorr

06d9c2b

github-actions bot reviewed Apr 7, 2023

View reviewed changes

src/randomeffectsterm.jl Show resolved Hide resolved

palday and others added 6 commits April 7, 2023 17:04

format

9ab9566

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

show methods for zerocorr

bb7cc37

add in test from docs

563573a

Minor version bump, NEWS

8e26a0d

Merge branch 'main' of github.com:JuliaStats/MixedModels.jl into dfk/…

ccec668

…statsmodels-007

bump julia version in docs ci

ced6def

palday approved these changes Apr 10, 2023

View reviewed changes

palday merged commit c68bb4b into main Apr 10, 2023
7 checks passed

palday deleted the dfk/statsmodels-007 branch April 10, 2023 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

integrate changes from StatsModels 0.7 (upcoming release) #664

integrate changes from StatsModels 0.7 (upcoming release) #664

kleinschmidt commented Jan 25, 2023 •

edited by palday

kleinschmidt commented Jan 26, 2023

codecov bot commented Mar 14, 2023 •

edited

kleinschmidt commented Mar 15, 2023

palday commented Mar 16, 2023

kleinschmidt commented Mar 16, 2023

kleinschmidt commented Apr 7, 2023

integrate changes from StatsModels 0.7 (upcoming release) #664

integrate changes from StatsModels 0.7 (upcoming release) #664

Conversation

kleinschmidt commented Jan 25, 2023 • edited by palday

kleinschmidt commented Jan 26, 2023

codecov bot commented Mar 14, 2023 • edited

Codecov Report

kleinschmidt commented Mar 15, 2023

palday commented Mar 16, 2023

kleinschmidt commented Mar 16, 2023

kleinschmidt commented Apr 7, 2023

kleinschmidt commented Jan 25, 2023 •

edited by palday

codecov bot commented Mar 14, 2023 •

edited