Reduce memory usage #24

mkbosmans · 2022-11-29T10:01:07Z

This adds a performance test script and some memory optimizations for ipfn_np.

On my test system 10 iterations for a 12,000x12,000 matrix go from 36 seconds to 21 s. The peak memory usage goes down from 6.2 GB to 1.8 GB.

It is split in multiple commits, so that each individual change is easy to review.
Changes are on top of the vortechbv/ipfn:cleanup branch, so it probably makes sense to merge #23 first.

User-visible strings should be quoted with " and ' is for internal strings.

The max_itr is now more explicit and the convergence criteria are only checked in one place instead of in the while condition and after the loop. The loop now actually stops after doing max_itr iterations, where previously it would do iterations numbered 0 to max_itr, i.e. one iteration too many.

This will construct a large matrix that converges slowly. The matrix is 12000x12000 single precision numbers, in total 576 MB.

The self.original and self.aggregates are never modified but kept the same as the original user input. The `m` and `aggregates` variables are a copy of the user input and may be modified during the iterations. The `iteration` method is responsible for converting the user input to a numpy array of floats. The `ipfn_np` function only checks whether that condition holds but does no conversion. This allows the `ipfn_np` to be used directly for user that want absolute minimum memory usage without any extra copy of the input array. Instead of always converting to float64, the input arrays are now only checked if they are of floating point type. This means that a single precision float32 array will be kept that way, allowing for a smaller memory footprint, if the problem allows the lower precision.

…matrices This save some memory in case of more than two steps.

Since the previous commit we don't need an integer loop index `inc` anymore, so simply using zip over `aggregates` and `dimensions` is more clear.

mkbosmans and others added 6 commits November 28, 2022 21:06

Some small cleanup

2d8e977

User-visible strings should be quoted with " and ' is for internal strings.

Add performance test

e5d316e

This will construct a large matrix that converges slowly. The matrix is 12000x12000 single precision numbers, in total 576 MB.

Don't store the matrix for every step but just alternate between two …

6cfbf08

…matrices This save some memory in case of more than two steps.

Simplify step loop structure

2421433

Since the previous commit we don't need an integer loop index `inc` anymore, so simply using zip over `aggregates` and `dimensions` is more clear.

mkbosmans force-pushed the memusage branch from 62293a5 to 2421433 Compare November 29, 2022 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage #24

Reduce memory usage #24

mkbosmans commented Nov 29, 2022 •

edited

Loading

Reduce memory usage #24

Are you sure you want to change the base?

Reduce memory usage #24

Conversation

mkbosmans commented Nov 29, 2022 • edited Loading

mkbosmans commented Nov 29, 2022 •

edited

Loading