Permutation cluster test with TFCE: improvement of speed and memory usage in 2D #12609

nfourcau · 2024-05-14T12:32:08Z

Reference issue

No ref, only a post in the forum

What does this implement/fix?

In mne.stats.cluster_level.py :
The use of TFCE with large 2D data implied a huge amount of memory because of the creation of as many boolean arrays of the same size of the data as the number of data points (with TFCE, each point is considered as a cluster - consisting of a single point - and need an array to describe it). For this case, I replace the large boolean array by a single index, and removed the clusters output when it was not necessary.

Additional information

It is my first ever PR, all comments are welcomed

welcome · 2024-05-14T12:32:10Z

Hello! 👋 Thanks for opening your first pull request here! ❤️ We will try to get back to you soon. 🚴

larsoner · 2024-05-15T14:41:35Z

mne/stats/cluster_level.py

+    clusters_out : bool
+        If True, clusters are returned, otherwise None is returned instead


Do we need this param at all? Could we instead just effectively have clusters_out = not tfce?

Good observation. Indeed the clusters_out point is to not make the long list of arrays at each call of _find_cluster during permutation computation.
Since 'clusters' with TFCE is simply a list of one cluster per point, we could just decide to make this construction (for TFCE only) outside of the _find_cluster function. This would still need to depend on the dimension of input (1d or 2d) and the type of output wanted 'indices' or 'mask'.
I do not know what is the simplest (or more compliant with MNE practices) between the current additional parameter and moving the TFCE specific lines (lines 494-505) in _permutation_cluster_test (somewhere around lines1021 to 1033). But yes I would agree with the moving.
Any advice ?

+1 for just doing the construction outside of the _find_cluster function wherever it's needed. As long as the public API output stays the same (and hopefully this is checked in a test already, if not then please add it!) then we should be okay to refactor however is cleanest

Done, I moved the "clusters" construction for TFCE out of _find_clusters.

Regarding the API, there is already a test test_output_equiv to which I added the threshold parameter to be tested.
However, I also noticed the False value for adjacency was not tested and indeed there are problems here:

adjacency=False runs only for 1D inputs (which I think is not expected)

for 1D input, the output is always "indices" whatever is out_type

I guess it is rarely used... Should this be corrected in the same or another PR?

I guess it is rarely used... Should this be corrected in the same or another PR?

Let's do it in another PR, I created #12613 so we don't forget

larsoner · 2024-05-15T18:26:18Z

And merging main into your branch should fix CIs so I'll do it now

nfourcau · 2024-05-16T10:24:11Z

And merging main into your branch should fix CIs so I'll do it now

OK but with my new commits I need to choose a strategy (novice with this kind of stuff in git) :
git config pull.rebase false # merge (the default strategy)
git config pull.rebase true # rebase
git config pull.ff only # fast-forward only

What is the correct one? => I chose to merge (since it was was done to integrate the last commits of main in the current branch)

…to tfce-optimization

doc/changes/devel/12609.bugfix.rst

larsoner · 2024-05-16T13:32:29Z

Pushed a tiny commit and merged main into your branch, marking for merge-when-green. Thanks in advance @nfourcau !

drammock · 2024-05-16T16:05:15Z

CI failure is due to edfio adding a post-checkout hook:
https://dev.azure.com/mne-tools/mne-python/_build/results?buildId=30068&view=logs&j=b9064c46-2375-5b70-72c1-f55d0d61c63a&t=e34ff71f-29f1-5601-0139-1a3a772fec70&l=496

@cbrnr do you know what that hook does / whether we should allow it to run in our CIs?

cbrnr · 2024-05-16T16:23:57Z

No, @hofaflo?

hofaflo · 2024-05-16T17:18:02Z

Hmm, strange! As far as I am aware, edfio did not add any hooks – didn't even know that this was possible on a non-local repo level. Opening .git/hooks/post-checkout locally shows this (which makes sense, as we're using LFS to store test files, but that has always been the case):

#!/bin/sh
command -v git-lfs >/dev/null 2>&1 || { echo >&2 "\nThis repository is configured for Git LFS but 'git-lfs' was not found on your path. If you no longer wish to use Git LFS, remove this hook by deleting the 'post-commit' file in the hooks directory (set by 'core.hookspath'; usually '.git/hooks').\n"; exit 2; }
git lfs post-commit "$@"

So no idea why this error is coming up now, sorry! 🤔

Edit: Found the relevant issue: git-lfs/git-lfs#5749

larsoner · 2024-05-16T19:29:15Z

Weird, working on a workaround in #12615

welcome · 2024-05-17T14:24:37Z

🎉 Congrats on merging your first pull request! 🥳 Looking forward to seeing more from you in the future! 💪

ENH: improve tfce speed execution

e93f115

nfourcau changed the title ~~cluster permutation test with TFCE: improvement of speed and memory usage in 2D~~ WIP: cluster permutation test with TFCE: improvement of speed and memory usage in 2D May 14, 2024

nfourcau changed the title ~~WIP: cluster permutation test with TFCE: improvement of speed and memory usage in 2D~~ Cluster permutation test with TFCE: improvement of speed and memory usage in 2D May 14, 2024

nfourcau changed the title ~~Cluster permutation test with TFCE: improvement of speed and memory usage in 2D~~ Permutation cluster test with TFCE: improvement of speed and memory usage in 2D May 14, 2024

nfourcau marked this pull request as ready for review May 14, 2024 13:43

nfourcau requested review from larsoner, drammock, agramfort and dengemann as code owners May 14, 2024 13:43

Update changelog and name file

e39d5d9

larsoner reviewed May 15, 2024

View reviewed changes

Merge branch 'main' into tfce-optimization

a8b7e25

nfourcau added 3 commits May 16, 2024 12:26

FIX move tfce clusters construction out of _find_clusters with good API

ceaa3dc

Add TFCE parameters in cluster_level API tests

862f7ce

Merge branch 'tfce-optimization' of github.com:nfourcau/mne-python in…

fa7c0bd

…to tfce-optimization

larsoner mentioned this pull request May 16, 2024

BUG: Wrong output for adjacency=False #12613

Open

larsoner reviewed May 16, 2024

View reviewed changes

doc/changes/devel/12609.bugfix.rst Outdated Show resolved Hide resolved

larsoner added 2 commits May 16, 2024 09:31

Update doc/changes/devel/12609.bugfix.rst

dd906f7

Merge branch 'main' into tfce-optimization

dad599a

larsoner enabled auto-merge (squash) May 16, 2024 13:32

Merge branch 'main' into tfce-optimization

41ee73b

larsoner added this to the 1.8 milestone May 17, 2024

larsoner merged commit cf0e12d into mne-tools:main May 17, 2024
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permutation cluster test with TFCE: improvement of speed and memory usage in 2D #12609

Permutation cluster test with TFCE: improvement of speed and memory usage in 2D #12609

nfourcau commented May 14, 2024

welcome bot commented May 14, 2024

larsoner May 15, 2024

nfourcau May 15, 2024

larsoner May 15, 2024

nfourcau May 16, 2024

larsoner May 16, 2024

larsoner commented May 15, 2024

nfourcau commented May 16, 2024 •

edited

larsoner commented May 16, 2024

drammock commented May 16, 2024

cbrnr commented May 16, 2024

hofaflo commented May 16, 2024 •

edited

larsoner commented May 16, 2024

welcome bot commented May 17, 2024

		clusters_out : bool
		If True, clusters are returned, otherwise None is returned instead

Permutation cluster test with TFCE: improvement of speed and memory usage in 2D #12609

Permutation cluster test with TFCE: improvement of speed and memory usage in 2D #12609

Conversation

nfourcau commented May 14, 2024

Reference issue

What does this implement/fix?

Additional information

welcome bot commented May 14, 2024

larsoner May 15, 2024

Choose a reason for hiding this comment

nfourcau May 15, 2024

Choose a reason for hiding this comment

larsoner May 15, 2024

Choose a reason for hiding this comment

nfourcau May 16, 2024

Choose a reason for hiding this comment

larsoner May 16, 2024

Choose a reason for hiding this comment

larsoner commented May 15, 2024

nfourcau commented May 16, 2024 • edited

larsoner commented May 16, 2024

drammock commented May 16, 2024

cbrnr commented May 16, 2024

hofaflo commented May 16, 2024 • edited

larsoner commented May 16, 2024

welcome bot commented May 17, 2024

nfourcau commented May 16, 2024 •

edited

hofaflo commented May 16, 2024 •

edited