Fix Probabilistic metrics #232

aaronspring · 2019-08-28T15:29:15Z

Description

added threshold_brier_score and brier_score from xskillscore, correctly implement crps and crpss https://github.com/raybellwaves/xskillscore/pull/20
numba added to env
compute functions can now take kwargs (to provide threshold, comparison, ...) and metrics check themselves in kwargs for needed parameters
change x_METRICS to DETERMINISTIC_x_METRICS
slow tests with crpss not assuming gaussian distribution skipped for time reasons but works
compute_pm and compute_hindcast take arg dim to assign dimension to calculate metric over [cannot work for ACC]) https://github.com/bradyrx/climpred/issues/187

small fixes:

reduce tests of perfect-model and hindcast. now pytest lasts now shorter https://github.com/bradyrx/climpred/issues/122
rename DPP to dpp https://github.com/bradyrx/climpred/issues/218

Fixes https://github.com/bradyrx/climpred/issues/122 https://github.com/bradyrx/climpred/issues/22 https://github.com/bradyrx/climpred/issues/218 https://github.com/bradyrx/climpred/issues/187 https://github.com/bradyrx/climpred/issues/233

Type of change

Please delete options that are not relevant.

Breaking change (renaming)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How Has This Been Tested?

pytest
trust properscoring

Checklist (while developing)

I have added docstrings to all new functions.
I have commented my code, particularly in hard-to-understand areas
Tests added for pytest, if necessary.
I have updated the sphinx documentation, if necessary.

Pre-Merge Checklist (final steps)

I have rebased onto master or develop (wherever I am merging) and dealt with any conflicts.
I have squashed commits to a reasonable amount, and force-pushed the squashed commits.
I have run make html on the documents to make sure example notebooks still compile.

References

Please add any references to manuscripts, textbooks, etc.

bradyrx · 2019-08-28T15:34:48Z

Is there any reason this wasn't already in the package? Just something we missed? Anyways, thanks for adding this @aaronspring , it's an important one. It would be good to have some demos in this PR thread of it in use, since we haven't really tested the probabilistic ones too heavily in my view.

coveralls · 2019-08-28T16:03:26Z

Coverage decreased (-0.1%) to 85.407% when pulling 7d9ecba on AS_add_brier into 8f6d435 on master.

aaronspring · 2019-08-28T16:04:33Z

I left it out because of this additional threshold argument needed. doppyo just calculate brier score (and the decomposition) based on True, False input. that would be another option to have this comparison outside the metric function (now its inside the threshold_brier_score function from properscoring, though they also have the plain brier_score function.)

regarding testing of the brier score: I took it from properscoring where it is tested.

aaronspring · 2019-08-29T14:48:23Z

wait until https://github.com/raybellwaves/xskillscore/issues are done

bradyrx · 2019-09-04T18:38:59Z

Haven't looked at this yet, but make sure to update setup/requirements/environments to require xskillscore at the new version with your updates from https://github.com/raybellwaves/xskillscore/pull/20.

aaronspring · 2019-09-21T11:49:55Z

I get reasonable results (reproducing Kadow et al. 2016) for crpss_es, but not for less so far.

climpred/tests/test_hindcast_prediction.py

climpred/bootstrap.py

bradyrx

added some more comments

aaronspring · 2019-09-22T09:08:28Z

added some more comments

I always forget to check the Changed files tab. now resolved all. please go over them and unresolve in case you think a sections needs a modification

This PR is now ready. the only metric I dont fully understand the results of is LESS. should I add a note of caution there? I am quite confident for the rest.

climpred/bootstrap.py

climpred/comparisons.py

climpred/metrics.py

climpred/prediction.py

bradyrx · 2019-09-22T19:59:30Z

Okay @aaronspring, I added a few more comments. Mostly about docstrings and stuff just to polish it up. But we should be good to merge this after you finish.

This PR is now ready. the only metric I dont fully understand the results of is LESS. should I add a note of caution there? I am quite confident for the rest.

If you don't fully understand the results/are confident in them, I think you should remove LESS entirely from the package and docs. Then open an issue on implementing LESS and we can address it as its own small PR. I don't want any metrics in here that we aren't confident in, since people will be using them for their analyses

Also, can you explain once more the reason why we need or want the dim argument on our compute functions? I still think it adds confusion. Isn't this a duplication of the comparison argument?

I'll merge and update the version number after this is all done. Thanks a lot!!

climpred/tests/test_compute_dims.py

bradyrx

added one small comment to testing..

aaronspring · 2019-09-23T08:26:59Z

reasoning for the `dim` argument

The argument dim defines over which dimension a metric is computed. We can apply a metric over dims in [init, member, [member,init in PM]]. The resulting skill is then reduced by this dim specified as argument. Therefore, applying a metric over dim='member' we can receive a skill for all inits individually. This can show the initial conditions dependence of skill. Likewise when computing skill over init, we get skill for each member. this might not be useful because all members i from all inits have nothing in common.
The comparison argument is different because this just specifies how forecast and reference are defined.

The above logic applies to deterministic metrics. Probabilistic metrics need to be applied API-wise to dim member and m2r or m2m, m2c comparison.

aaronspring · 2019-09-23T09:55:39Z

implemented all. please use the Squash and merge button.

bradyrx · 2019-09-23T17:44:14Z

Thanks @aaronspring so much for all the work on this!! Will merge after these tests run, then will roll out v1.1

AS added 3 commits August 28, 2019 17:08

reduce number of bootstrap tests

4887d23

make threshold_brier possible

401ada5

write kwargs to attrs

2c4c10c

AS added 4 commits August 28, 2019 17:52

reduce number of tests hindcast

48c4719

rename DPP to dpp

c0b3e34

renaming dpp in notebook

4e5f2bc

brier no threshold mean

70b6701

rm brier_score from metrics list

6f6f711

AS added 3 commits September 2, 2019 19:13

first hacky propabilistic version

6e42cac

add DETERMINISTIC_METRICS to constants

273b907

first draft probabilistic hindcast

60e0d89

aaronspring changed the title ~~Add Brier~~ Fix Probabilistic metrics Sep 3, 2019

basic working version

022eea7

AS and others added 3 commits September 8, 2019 14:19

new API compute(...,dim="member")

fc45197

update changelog

58d141d

Merge branch 'master' into AS_add_brier

1a78a10

aaronspring self-assigned this Sep 8, 2019

aaronspring added the bug label Sep 8, 2019

AS added 2 commits September 8, 2019 16:06

docu updates

e9a5c1c

skip less, docu

33323f9

aaronspring requested a review from bradyrx September 8, 2019 14:49

aaronspring added the feature request label Sep 8, 2019

AS added 3 commits September 9, 2019 16:43

adapt for bootstrap_compute and del unnecessary attrs

01b6464

ignore C901 linter

51329fc

make crpss_es and less work

7d9ecba

AS added 3 commits September 21, 2019 10:16

bf

f31cd15

dpp docs

1251a9b

rm print, relax crps mean m

d557a2d

ahuang11 reviewed Sep 21, 2019

View reviewed changes

climpred/tests/test_hindcast_prediction.py Show resolved Hide resolved

AS added 2 commits September 21, 2019 18:07

mod test less uacc

5467e58

lint

62f9906

bradyrx reviewed Sep 21, 2019

View reviewed changes

climpred/bootstrap.py Outdated Show resolved Hide resolved

bradyrx requested changes Sep 21, 2019

View reviewed changes

resolved last comments, change to stack_dims

84869ab

enhanced tests compute_dims

f6ed9be

aaronspring requested a review from bradyrx September 22, 2019 10:33

bradyrx requested changes Sep 22, 2019

View reviewed changes

bradyrx reviewed Sep 22, 2019

View reviewed changes

climpred/tests/test_compute_dims.py Outdated Show resolved Hide resolved

bradyrx requested changes Sep 22, 2019

View reviewed changes

improved docstrings and comments

825054b

aaronspring added the v1.1 label Sep 23, 2019

AS added 2 commits September 23, 2019 11:48

rm less

6e451ba

changelog

cd8312f

bradyrx mentioned this pull request Sep 23, 2019

Documentation page on "dims" argument #241

Closed

bradyrx added 2 commits September 23, 2019 11:37

change kwargs to metric_kwargs in metrics

bb30303

minor docs edits

de99671

bradyrx approved these changes Sep 23, 2019

View reviewed changes

aaronspring merged commit 6e33ff0 into master Sep 23, 2019

bradyrx deleted the AS_add_brier branch September 23, 2019 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Probabilistic metrics #232

Fix Probabilistic metrics #232

aaronspring commented Aug 28, 2019 •

edited

bradyrx commented Aug 28, 2019

coveralls commented Aug 28, 2019 •

edited

aaronspring commented Aug 28, 2019

aaronspring commented Aug 29, 2019

bradyrx commented Sep 4, 2019

aaronspring commented Sep 21, 2019

bradyrx left a comment

aaronspring commented Sep 22, 2019 •

edited

bradyrx commented Sep 22, 2019 •

edited

bradyrx left a comment

aaronspring commented Sep 23, 2019 •

edited

aaronspring commented Sep 23, 2019

bradyrx commented Sep 23, 2019

Fix Probabilistic metrics #232

Fix Probabilistic metrics #232

Conversation

aaronspring commented Aug 28, 2019 • edited

Description

Type of change

How Has This Been Tested?

Checklist (while developing)

Pre-Merge Checklist (final steps)

References

bradyrx commented Aug 28, 2019

coveralls commented Aug 28, 2019 • edited

aaronspring commented Aug 28, 2019

aaronspring commented Aug 29, 2019

bradyrx commented Sep 4, 2019

aaronspring commented Sep 21, 2019

bradyrx left a comment

Choose a reason for hiding this comment

aaronspring commented Sep 22, 2019 • edited

bradyrx commented Sep 22, 2019 • edited

bradyrx left a comment

Choose a reason for hiding this comment

aaronspring commented Sep 23, 2019 • edited

reasoning for the dim argument

aaronspring commented Sep 23, 2019

bradyrx commented Sep 23, 2019

aaronspring commented Aug 28, 2019 •

edited

coveralls commented Aug 28, 2019 •

edited

aaronspring commented Sep 22, 2019 •

edited

bradyrx commented Sep 22, 2019 •

edited

aaronspring commented Sep 23, 2019 •

edited

reasoning for the `dim` argument