perf: drop use of awkward in yield uncertainty calculation #408

ekauffma · 2023-05-31T07:48:47Z

ak.sum is much more expensive than anticipated, so this changes the matrix operations in yield_stdev to use numpy instead, as per suggestion by @alexander-held

partially addresses #409, follows initial improvements done in #315

src/cabinetry/model_utils.py

codecov · 2023-05-31T12:48:54Z

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (5064e38) 100.00% compared to head (ab4ceb5) 100.00%.

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #408   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           22        22           
  Lines         2069      2072    +3     
  Branches       334       337    +3     
=========================================
+ Hits          2069      2072    +3

Impacted Files	Coverage Δ
src/cabinetry/model_utils.py	`100.00% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

src/cabinetry/model_utils.py

This reverts commit 8a58030.

src/cabinetry/model_utils.py

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

src/cabinetry/model_utils.py

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

agoose77 · 2023-06-30T10:24:08Z

I had some perf suggestions after this popped up in my notifications:

Pre-convert the parameters, and use np.ufunc.at to avoid a temporary

# calculate the model distribution for every parameter varied up and down
# within the respective uncertainties
# ensure float for correct addition
float_parameters = parameters.astype(float)
for i_par in range(model.config.npars):
    # central parameter values, but one parameter varied within uncertainties
    up_pars = float_parameters.copy()
    np.add.at(up_pars, i_par, uncertainty)
    down_pars = float_parameters.copy()
    np.subtract.at(down_pars, i_par, uncertainty)

Pre-allocate stacked arrays and use in-place assignment (unsure of shapes here, so it's pseudo-code)

up_comb_next = np.empty(...)
up_comb_next[...] = up_comb
np.sum(up_comb, axis=0, out=up_comb_next[...])

Pre-allocate the up_variants and down_variations arrays, and assign in-place

up_variations = np.empty(..., dtype=...)

for i_par in range(model.config.npars):
    ...
    up_variations[i] = up_yields

It might also be possible to do the above without the

up_yields = np.concatenate((up_comb, up_yields_channel_sum), axis=1)

step, i.e. directly assign the parts. I'm not sure.

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

alexander-held · 2023-06-30T15:09:44Z

Thanks a lot for the additional suggestions @agoose77! I moved them to #415. They look great, I would prefer that we address them in a separate PR to keep the changes a bit more atomic.

src/cabinetry/model_utils.py

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

src/cabinetry/model_utils.py

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

alexander-held

Looks all good to me, thanks a lot for getting this ready!

* drop use of awkward in yield uncertainty calculations
* reduced memory footprint and performance improvements for yield uncertainty calculations

changed awkward operations to numpy in yield_stdev

b0b0fd1

alexander-held reviewed May 31, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

alexander-held changed the title ~~fix: change awkward operations to numpy in yield_stdev~~ perf: change awkward operations to numpy in yield_stdev May 31, 2023

This was referenced May 31, 2023

model_utils.yield_stdev is very slow #315

Closed

Large memory use of model_utils.yield_stdev #409

Open

ekauffma added 2 commits May 31, 2023 12:14

run black on model_utils

1e1d5a0

added import spacing and reduces line lengths for flake8

b94094e

agoose77 reviewed May 31, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

ekauffma added 3 commits May 31, 2023 15:11

get rid of unnecessary tolist

8a58030

Revert "get rid of unnecessary tolist"

c6bef1d

This reverts commit 8a58030.

change to stack and get rid of unnecessary tolist in yield sum

4fb9614

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

*_yields_sum -> *_yields_channel_sum

48f1ac0

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

Update src/cabinetry/model_utils.py

35937e4

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

alexander-held reviewed Jun 30, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

ekauffma and others added 3 commits June 30, 2023 11:55

Update src/cabinetry/model_utils.py

fd70393

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

Update src/cabinetry/model_utils.py

402ab2d

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

Update src/cabinetry/model_utils.py

0919f24

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

ekauffma and others added 2 commits June 30, 2023 12:59

rearranged lines which were too long

89f80b6

Update src/cabinetry/model_utils.py

4677461

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

alexander-held mentioned this pull request Jun 30, 2023

Additional performance improvements for model_utils.yield_stdev #415

Open

Merge branch 'master' into awk-to-numpy

9c7c11f

alexander-held reviewed Jul 4, 2023

View reviewed changes

src/cabinetry/model_utils.py Show resolved Hide resolved

alexander-held reviewed Jul 4, 2023

View reviewed changes

src/cabinetry/model_utils.py Show resolved Hide resolved

alexander-held reviewed Jul 4, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

ekauffma and others added 3 commits July 4, 2023 16:31

n_bins change and flipped order of bin vs channel calculations

4b7866a

Update src/cabinetry/model_utils.py

e541e7c

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

added alex's comment about per-channel calculation

3c7f317

alexander-held reviewed Jul 4, 2023

View reviewed changes

src/cabinetry/model_utils.py Show resolved Hide resolved

alexander-held reviewed Jul 4, 2023

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

ekauffma and others added 2 commits July 5, 2023 08:24

Update src/cabinetry/model_utils.py

2332ce5

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

Update src/cabinetry/model_utils.py

ab4ceb5

Co-authored-by: Alexander Held <45009355+alexander-held@users.noreply.github.com>

alexander-held approved these changes Jul 5, 2023

View reviewed changes

alexander-held changed the title ~~perf: change awkward operations to numpy in yield_stdev~~ perf: drop use of awkward in yield uncertainty calculation Jul 5, 2023

alexander-held merged commit 7b9b6af into scikit-hep:master Jul 5, 2023
7 checks passed

alexander-held mentioned this pull request Jul 5, 2023

build: drop explicit awkward dependency #419

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: drop use of awkward in yield uncertainty calculation #408

perf: drop use of awkward in yield uncertainty calculation #408

ekauffma commented May 31, 2023 •

edited by alexander-held

codecov bot commented May 31, 2023 •

edited

agoose77 commented Jun 30, 2023

alexander-held commented Jun 30, 2023

alexander-held left a comment

perf: drop use of awkward in yield uncertainty calculation #408

perf: drop use of awkward in yield uncertainty calculation #408

Conversation

ekauffma commented May 31, 2023 • edited by alexander-held

codecov bot commented May 31, 2023 • edited

Codecov Report

agoose77 commented Jun 30, 2023

alexander-held commented Jun 30, 2023

alexander-held left a comment

Choose a reason for hiding this comment

ekauffma commented May 31, 2023 •

edited by alexander-held

codecov bot commented May 31, 2023 •

edited