Optimize compute_kda_ma for memory and speed #857

adelavega · 2024-01-05T00:39:18Z

Implement "summary_array" return type for MKDAKernel, which convolves kernels to coordinates in a 3D dense volume, summing counts in place, saving substantial memory and compute.
Using numba to speed up sphere kernel convolution
@jdkent set types to int to reduce memory usage
Minor improvements throughout

For large-scale MKDAChi2 (i.e. using Neurosynth dataset), memory footprint is reduced ~18-20x (25GB to 1.2GB), and computation is sped up ~3.2x.

codecov · 2024-01-05T01:38:45Z

Codecov Report

Attention: 16 lines in your changes are missing coverage. Please review.

Comparison is base (efae75e) 88.48% compared to head (a89c16d) 88.22%.
Report is 1 commits behind head on main.

❗ Current head a89c16d differs from pull request most recent head 6236f6c. Consider uploading reports for the commit 6236f6c to get more accurate results

Files	Patch %	Lines
nimare/meta/utils.py	67.44%	14 Missing ⚠️
nimare/meta/kernel.py	87.50%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #857      +/-   ##
==========================================
- Coverage   88.48%   88.22%   -0.26%     
==========================================
  Files          48       48              
  Lines        6337     6342       +5     
==========================================
- Hits         5607     5595      -12     
- Misses        730      747      +17

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

adelavega · 2024-01-05T03:12:57Z

@jdkent these improvements make a small difference to run speed, but ultimately it's a pretty hard thing to optimize.

ultimately, it's just slow and fairly expensive process

…mize_kda

@Profile

* Resolve merge * Add sum aross studies * Remove @Profile * Only enable sum across studies for MKDA Chi Squared * Run black * Return dense for MKDACHiSquared * Update nimare/meta/utils.py Co-authored-by: James Kent <jamesdkent21@gmail.com> * Run black * Update nimare/meta/utils.py Co-authored-by: James Kent <jamesdkent21@gmail.com> * Format suggestion * change how number of studies and active voxels are found * add explicit dtype when creating image * make the comment clearer * add the kernel argument to the dictionary * bump minimum required versions * alternative way to sum across studies in a general way * fix arguments and style * pin minimum seaborn version --------- Co-authored-by: Alejandro de la Vega <alejandro@florezita.lan> Co-authored-by: James Kent <jamesdkent21@gmail.com>

adelavega · 2024-01-10T00:07:27Z

Even more speed increases by indexing the ijks array within numba

jdkent

LGTM! it's added and tested. @JulioAPeraza will be adding results for additional confirmation the results looks the same using the updated method.

adelavega · 2024-01-10T21:56:06Z

Awesome. Let's merge once everything passes, and cut a release soon.

JulioAPeraza · 2024-01-15T15:50:07Z

@adelavega, the new changes are fantastic! In the past, I could not train the decoder with more than 16 cores.

I used 40 cores this time, and it took 11 minutes to train an LDA-based decoder with 200 topics, and it only required 1.7 GB of memory.

@jdkent, I compared the decoders' results, and the output maps' values are exactly the same.

I also tested the new changes with different dataset sizes based on Neurosynth. See below:

adelavega · 2024-01-15T16:03:47Z

Julio, thank for your running this comparison. This is fantastic :)
This should make it much easier for people to replicate the results of your new paper.

adelavega added 5 commits January 4, 2024 17:52

Simplify stacking

772d80c

Fix typo

37a88e8

Remove vstack

351ce8d

Fix stacking

3c38bbc

Remove @Profile

b1eb588

Use jit for _convolve_sphere

43b1fd6

jdkent and others added 5 commits January 5, 2024 11:28

switch arrays to int32 where possible

ecba8e6

reduce memory consumption

b588377

fix style

92b81cc

Simplify numba

ca87660

Merge branch 'optimize_kda' of github.com:neurostuff/NiMARE into opti…

c9dd10c

…mize_kda

adelavega changed the title ~~WIP: Optimize compute_kda_ma~~ Optimize compute_kda_ma for memory and speed Jan 5, 2024

adelavega and others added 7 commits January 5, 2024 17:18

Run black

42c638f

�Mask outside space in numba

517f7c6

Add indicator later

304ae21

Set value to input

886843c

manage minimum dependencies

92bcada

Index within numba

40acd08

adelavega and others added 5 commits January 9, 2024 18:42

Only allow sum_overlap if not sum_across_studies

0a0ccbc

Add unique index back

4bede88

Remove @Profile

c5992de

run black

a89c16d

ensure the methods for creating the kernel are equivalent

6236f6c

jdkent approved these changes Jan 10, 2024

View reviewed changes

jdkent merged commit 1fa0603 into main Jan 10, 2024
17 checks passed

JulioAPeraza mentioned this pull request Jan 15, 2024

Optimize sparse COO matrix sum #747

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize compute_kda_ma for memory and speed #857

Optimize compute_kda_ma for memory and speed #857

adelavega commented Jan 5, 2024 •

edited

codecov bot commented Jan 5, 2024 •

edited

adelavega commented Jan 5, 2024

adelavega commented Jan 10, 2024

jdkent left a comment

adelavega commented Jan 10, 2024

JulioAPeraza commented Jan 15, 2024

adelavega commented Jan 15, 2024

Optimize compute_kda_ma for memory and speed #857

Optimize compute_kda_ma for memory and speed #857

Conversation

adelavega commented Jan 5, 2024 • edited

codecov bot commented Jan 5, 2024 • edited

Codecov Report

adelavega commented Jan 5, 2024

adelavega commented Jan 10, 2024

jdkent left a comment

Choose a reason for hiding this comment

adelavega commented Jan 10, 2024

JulioAPeraza commented Jan 15, 2024

adelavega commented Jan 15, 2024

adelavega commented Jan 5, 2024 •

edited

codecov bot commented Jan 5, 2024 •

edited