power spectrum module optimization and parallelization #102

lgarrison · 2023-07-12T17:37:34Z

Some optimization, parallelization, and Numba-fication of various parts of the power spectrum module. Benchmark scripts used to produce the timings in the ZCV paper are included.

…the (k,mu) counts

… calc_pk_from_deltak. Refactor handling of poles argument as numba workaround.

lgarrison · 2023-07-12T22:05:07Z

@boryanah Here are the changes we talked about. It's about half optimization and half refactoring. I tried to rename some things that were confusing to me, but let me know if I misunderstood anything. I also tried to cut down on the number of arguments and return values in some places. For example, I changed calc_power() to return an Astropy Table; let me know if you think that makes sense.

boryanah · 2023-07-14T01:47:09Z

That looks great to me! The optimizations make sense (I am sorry I didn't implement the normalization one myself). The astropy table for the power spectrum with the meta data is great, and I think it's a good solution for outputting a single object that contains some useful information about the simulation. I think it makes sense why you got rid of some of the del X; gc.collect() (they probably weren't doing anything as the variables were passed externally rather than locally defined). I also like the variable and function name changes, and I think they make more sense.

lgarrison · 2023-07-14T14:08:43Z

Yeah, that's exactly right about the gc.collect(). The other reason was that garbage collection was making the timings really noisy; there might be one or two that could go back in, but for the most part I think they weren't doing anything. And if we really wanted to save memory, there are other ways: using an in-place FFT (pyfftw supports this, not sure if scipy.fft does), and adding on-the-fly offsets for the interlacing calculation (right now it makes a whole copy of the input data).

lgarrison added 2 commits July 12, 2023 12:35

power: add benchmark scripts

cffade3

power: optimization, parallelization, numba-fication, formatting

402db70

lgarrison force-pushed the pk_bench branch from 43cdd99 to 402db70 Compare July 12, 2023 17:39

lgarrison marked this pull request as draft July 12, 2023 17:39

lgarrison added 5 commits July 12, 2023 13:40

power: doc

ef843d2

Merge branch 'main' into pk_bench

1567c59

power: don't count modes for poles directly; it can be computed from …

5b4e91c

…the (k,mu) counts

power: refactor calc_power to return astropy Table. Rename calc_pk ->…

e9fece0

… calc_pk_from_deltak. Refactor handling of poles argument as numba workaround.

power: update tutorials and scripts for refactored code

900e4ff

lgarrison marked this pull request as ready for review July 12, 2023 22:05

lgarrison requested a review from boryanah July 12, 2023 22:07

changelog

66aa855

lgarrison merged commit b0ae7b7 into main Jul 14, 2023
8 checks passed

lgarrison deleted the pk_bench branch July 14, 2023 14:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

power spectrum module optimization and parallelization #102

power spectrum module optimization and parallelization #102

lgarrison commented Jul 12, 2023

lgarrison commented Jul 12, 2023

boryanah commented Jul 14, 2023

lgarrison commented Jul 14, 2023

power spectrum module optimization and parallelization #102

power spectrum module optimization and parallelization #102

Conversation

lgarrison commented Jul 12, 2023

lgarrison commented Jul 12, 2023

boryanah commented Jul 14, 2023

lgarrison commented Jul 14, 2023