add _fit_auto_regression functions #139

mathause · 2022-03-31T17:00:22Z

Tests added
Passes isort . && black . && flake8
Fully documented, including CHANGELOG.rst

Adds _fit_auto_regression_xr and _fit_auto_regression_np - thin wrappers around statsmodels.tsa.ar_model.AutoReg.

In contrast to the linear regression I have:

reworked the internal code by creating a dummy DataArray - I think that simplifies the code and looks pretty neat
not written any 'numerical' tests - it only checks the shape etc. of the results

I am not sure we can follow exactly the same pattern as for the LinearRegression class because the coeffs are averaged over the scens. It would be nice to have the same pattern for both but I am not sure which will have to give...

@yquilcaille @znicholls

mathause · 2022-04-01T09:22:48Z

mesmer/calibrate_mesmer/train_gv.py

-    params_gv["AR_order_sel"] = AR_order_sel
-    params_gv["AR_std_innovs"] = 0
-
+    res = list()


Rename to result? res might stand for residuals.

Yep although if the function is small enough it will be clear from the context

mesmer/core/auto_regression.py

tests/integration/test_auto_regression.py

mesmer/calibrate_mesmer/train_gv.py

tests/integration/test_auto_regression.py

mesmer/core/auto_regression.py

codecov-commenter · 2022-04-01T10:25:10Z

Codecov Report

Merging #139 (7d286f8) into master (71a9303) will increase coverage by 0.15%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #139      +/-   ##
==========================================
+ Coverage   79.43%   79.59%   +0.15%     
==========================================
  Files          29       30       +1     
  Lines        1405     1416      +11     
==========================================
+ Hits         1116     1127      +11     
  Misses        289      289

Flag	Coverage Δ
unittests	`79.59% <100.00%> (+0.15%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mesmer/calibrate_mesmer/train_gv.py	`84.72% <100.00%> (-0.62%)`	⬇️
mesmer/calibrate_mesmer/train_lv.py	`81.66% <100.00%> (-0.60%)`	⬇️
mesmer/core/auto_regression.py	`100.00% <100.00%> (ø)`
mesmer/core/utils.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 71a9303...7d286f8. Read the comment docs.

mathause · 2022-04-01T11:48:08Z

mesmer/core/auto_regression.py

+    )
+
+    # TODO: are the names appropriate?
+    data_vars = {"intercept": intercept, "coeffs": coeffs, "standard_deviation": std}


TODO: attach the order

I'm always in favour of full names so coefficients rather than coeffs but I don't mind too much

znicholls

Beautiful

znicholls · 2022-04-02T05:04:18Z

mesmer/core/auto_regression.py

+    )
+
+    # TODO: are the names appropriate?
+    data_vars = {"intercept": intercept, "coeffs": coeffs, "standard_deviation": std}


I'm always in favour of full names so coefficients rather than coeffs but I don't mind too much

znicholls · 2022-04-02T05:04:49Z

mesmer/core/auto_regression.py

+        Standard deviation of the residuals.
+    """
+
+    from statsmodels.tsa.ar_model import AutoReg


Why here and not wrapped in try except in top level? Is that an xarray pattern?

I did this for the linear regression because the sklearn class has the same name as ours (LinearRegression) so I just followed the same pattern.

znicholls · 2022-04-02T05:16:30Z

I am not sure we can follow exactly the same pattern as for the LinearRegression class because the coeffs are averaged over the scens. It would be nice to have the same pattern for both but I am not sure which will have to give

Hmm this is indeed tricky. Can we split into two steps: calculate coeffs for each scenario separately, then average over as a second? That might help us to have the same internal patterns, even if the interfaces do different stuff

znicholls · 2022-04-02T05:19:30Z

mesmer/calibrate_mesmer/train_gv.py

-        AR_int_tmp = 0
-        AR_coefs_tmp = np.zeros(AR_order_sel)
-        AR_std_innovs_tmp = 0
+        data = gv[scen]


If we split out a function for this inner loop, we might be able to make the pattern look more like linear regression does

Although I guess it's almost as small as it can be already, the inner function would have to be called something like _fit_auto_regression_with_mean_over_runs I guess.

(Side note, what does 'run' mean here? Is that ensemble member?)

I think this is definitely a good idea. However, I am still not sure what our "outer" data structure should be (I have a prototype of a DataList but I am not entirely convinced).

run is an ensemble member. I just followed Lea's terminology. But I agree we should standardize this stuff...

mathause · 2022-04-03T13:44:19Z

Thanks for the feedback. I am still very unsure how this whole thing should look like in the end but I think these changes make sense regardless... So I would probably merge this more or less as is and try to refactor the next chunk and hope this helps me to see some patterns...

I have a slight preference towards shorter names (as long as their meaning is clear, which is subjective so maybe I should just use the long ones anyway :-P).

mathause added 2 commits March 31, 2022 18:50

add _fit_auto_regression functions

1f01350

add CHANGELOG entry

a7b8d1a

mathause commented Apr 1, 2022

View reviewed changes

mesmer/core/auto_regression.py Outdated Show resolved Hide resolved

mathause commented Apr 1, 2022

View reviewed changes

mesmer/core/auto_regression.py Outdated Show resolved Hide resolved

mathause commented Apr 1, 2022

View reviewed changes

mesmer/core/auto_regression.py Outdated Show resolved Hide resolved

Apply suggestions from code review

bb12838

mathause commented Apr 1, 2022

View reviewed changes

znicholls approved these changes Apr 2, 2022

View reviewed changes

znicholls reviewed Apr 2, 2022

View reviewed changes

mathause added 4 commits April 3, 2022 14:54

Merge branch 'master' into add_fit_auto_regression

2d91015

add numerical test

1dd2846

update comments

92adbbf

rename: res -> params_scen

439bac3

fix bug

7d286f8

mathause merged commit 91a2868 into MESMER-group:master Apr 7, 2022

mathause deleted the add_fit_auto_regression branch April 7, 2022 11:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add _fit_auto_regression functions #139

add _fit_auto_regression functions #139

mathause commented Mar 31, 2022 •

edited

mathause Apr 1, 2022

znicholls Apr 2, 2022

codecov-commenter commented Apr 1, 2022 •

edited

mathause Apr 1, 2022

znicholls Apr 2, 2022

znicholls left a comment

znicholls Apr 2, 2022

znicholls Apr 2, 2022

mathause Apr 3, 2022

znicholls commented Apr 2, 2022

znicholls Apr 2, 2022

znicholls Apr 2, 2022

mathause Apr 3, 2022

mathause commented Apr 3, 2022

add _fit_auto_regression functions #139

add _fit_auto_regression functions #139

Conversation

mathause commented Mar 31, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Apr 1, 2022 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

znicholls left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

znicholls commented Apr 2, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mathause commented Apr 3, 2022

mathause commented Mar 31, 2022 •

edited

codecov-commenter commented Apr 1, 2022 •

edited