Permutation tests for CCA #124

LegrandNico · 2021-12-08T15:30:53Z

I would like to implement (if not already available elsewhere) a Python version of the permutation tests for CCA described in

Winkler AM, Renaud O, Smith SM, Nichols TE. Permutation Inference for Canonical Correlation Analysis. NeuroImage. 2020; 117065 (see article here)

The paper comes with a repository (Matlab) that could be ported to Python without requiring additional dependencies.

jameschapman19 · 2021-12-08T15:57:47Z

Funny you mention this as it's something vaguely on my radar!

I actually made a start porting the quickperms from the PALM toolbox (with the permission of Winkler/Smith) here: https://github.com/jameschapman19/scikit-perm/blob/main/skperm/permutation_tests/cca_permutation_test.py

Which would be a nice bonus with a ported version of permCCA (i.e. where the user can supply their own permutations based on exchangeability blocks).

htwangtw · 2021-12-08T16:28:39Z

I got code for the CCA part mentioned by @LegrandNico. The full implementation in python using cca_zoo.
It's not packaged up but if people want to work form here, this is the snippet:

# permutation testing from
# https://github.com/andersonwinkler/PermCCA/blob/6098d35da79618588b8763c5b4a519438703dba4/permcca.m#L131-L164
# from cca_zoo.models import PMD
# n_permutation = 2
# rng = np.random.RandomState(42)
# lW, cnt  = np.zeros(latent_dims), np.zeros(latent_dims)
# for i in range(n_permutation):
#     print(f"Permutation {1 + i} / {n_permutation} ")
#     if i == 0:
#         X_perm = z_transitions
#         Y_perm = z_mriq
#     else:
#         x_idx = rng.permutation(710)
#         y_idx = rng.permutation(710)
#         X_perm = z_transitions[x_idx]
#         Y_perm = z_mriq[y_idx]
#     for k in range(latent_dims):
#         print(f"Mode {1 + k} of {latent_dims}")
#         perm_model = PMD(c=trained_c,
#                             latent_dims=(latent_dims - k),
#                             max_iter=100)
#         perm_model.fit(X_perm[:, k:], Y_perm[:, k:])
#         r_perm = perm_model.train_correlations[0][1]
#         print(r_perm)
#         lWtmp = -1 * np.cumsum(np.log(1 - r_perm ** 2)[::-1])[::-1]
#         print(lWtmp)
#         lW[k] = lWtmp[0]
#     if i == 0:
#         lw1 = lW
#     cnt = cnt + (lW >= lw1)

# punc  = cnt / n_permutation
# pfwer = pd.DataFrame(punc).cummax().values
# print(punc)
# print(pfwer)

This is an incredibly lazy attempt.

LegrandNico · 2021-12-09T11:20:32Z

Thank you @htwangtw , I think that will be really helpful.

I don't know how you want to integrate the permutation functionalities with the rest of the package. I can try to make something, but maybe will start with an example tutorial notebook see if we have everything running.

jameschapman19 · 2021-12-13T13:00:42Z

I would guess you could adapt @htwangtw's code to look something like https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.permutation_test_score.html

Could be rudimentary initially with the goal to get close to sklearn API which could run permutation tests in parallel

JohannesWiesner · 2022-02-10T16:44:03Z

Just for completeness: There are two other packages that might also help?

The pyls package has implemented permutation tests:
https://pyls.readthedocs.io/en/latest/index.html

And there's also the resample package:
https://github.com/resample-project/resample

jameschapman19 · 2022-06-06T22:17:55Z

I've put a version of this in cca_zoo.model_selection._validation with an API that hooks into scikit-learn permutation_test_score.

It's not quite the same as what Winkler does for multiple latent dimensions (but it should be similar) but it works for 1 latent dimension. Should be able to build on this.

jameschapman19 · 2022-06-06T22:22:20Z

I'll add a proper example but it should work like:

from cca_zoo.model_selection import permutation_test_score
from cca_zoo.models import rCCA
import numpy as np

X = np.random.rand(100, 10)
Y = np.random.rand(100, 8)
model = rCCA(c=[0.1, 0.3])

permutation_test_score(model, [X, Y], n_permutations=10)

which returns score (average correlation across dimensions), permutation scores, p-value like scikit-learn

WantongLi123 · 2023-09-01T21:52:39Z

Although I got a lot insights from your nice discussion here, one thing I am still confused:
If I permute one variable once from all views (variable number: 1), or if I permute 1/5 of total variables in all views (variable number: around 10), the resulted significance p value is very different.

It's understandable that when permuting more variables, the random level is high, so that the canonical correlation coefficient is low compared to the reference experiment.
But do you know permuting how many variables is common? And permutation should be done for each view once or multiple views?

Many thanks in advance!

Cheers,
Wantong

JohannesWiesner mentioned this issue Jan 18, 2022

Implement reordering function to assess feature significance? #130

Open

LegrandNico mentioned this issue Feb 25, 2022

Add first version of permutation interence method in the model_selection module. #136

Open

jameschapman19 added enhancement New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels May 8, 2022

jameschapman19 mentioned this issue Jun 22, 2022

The p-value of "cca_zoo.models.SCCA_PMD.pairwise_correlations" #146

Closed

jameschapman19 closed this as completed Jul 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permutation tests for CCA #124

Permutation tests for CCA #124

LegrandNico commented Dec 8, 2021

jameschapman19 commented Dec 8, 2021 •

edited

htwangtw commented Dec 8, 2021 •

edited

LegrandNico commented Dec 9, 2021

jameschapman19 commented Dec 13, 2021

JohannesWiesner commented Feb 10, 2022

jameschapman19 commented Jun 6, 2022

jameschapman19 commented Jun 6, 2022 •

edited

WantongLi123 commented Sep 1, 2023

Permutation tests for CCA #124

Permutation tests for CCA #124

Comments

LegrandNico commented Dec 8, 2021

jameschapman19 commented Dec 8, 2021 • edited

htwangtw commented Dec 8, 2021 • edited

LegrandNico commented Dec 9, 2021

jameschapman19 commented Dec 13, 2021

JohannesWiesner commented Feb 10, 2022

jameschapman19 commented Jun 6, 2022

jameschapman19 commented Jun 6, 2022 • edited

WantongLi123 commented Sep 1, 2023

jameschapman19 commented Dec 8, 2021 •

edited

htwangtw commented Dec 8, 2021 •

edited

jameschapman19 commented Jun 6, 2022 •

edited