Corrected a bug on the rejection_sampling_2D algorithm and updated th… #250

mhurte · 2023-06-16T12:12:01Z

Hi everyone,

I would like to share a fix for a bug that I found as I was testing some sampling features of this library.

As I tried to compute the acceptance rate for the rejection sampling algorithm in 2D that is implemented in the sampling.py file, I noticed that something was odd : The following figure shows the acceptance rate as a function of sigma before the fix, compaired to the theoretical one for this method.

Since we know what the theoretical acceptance rate is for this algorithm, it was easy to notice that something was wrong when plotting the figure above. The acceptance rate is quickly converging to 1 when it should be converging to zero.

After searching through the code what the matter could be, I eventually found out that the roles of the mu_a and mu_b as defined in the functions _rejection_sampling_2D_gfunction_plus, _rejection_sampling_2D_gfunction_minus and _rejection_sampling_2D were in fact inverted.

After fixing this, the program now behaves properly, the following figure shows the acceptance rate as a function of sigma after the fix.

Moreover, I added a boolean parameter to _rejection_sampling_2D, return_rate (defaulted to False) that allows to get the acceptance rate for the generation along with the sample if necessary.
I also used the _pdf_r function that was already implemented in the file in the functions _rejection_sampling_2D_gfunction_minus and _rejection_sampling_2D_gfunction_plus to make the code more readable.

The documentation has been updated accordingly.

plcrodrigues · 2023-06-16T13:51:05Z

pyriemann/datasets/sampling.py

@@ -129,24 +122,31 @@ def _rejection_sampling_2D(n_samples, sigma, random_state=None):
        Dispersion of the Riemannian Gaussian distribution.
    random_state : int, RandomState instance or None, default=None
        Pass an int for reproducible output across multiple function calls.
+    return_rate : boolean


I think that calling it return_acceptance_rate would be better, since one could think that it is the rejection_rate or any other bizarre rate.

Indeed, I've changed it.

plcrodrigues

Thanks @mhurte! This is very helpful.

qbarthelemy

Thx @mhurte for this contribution!
Can you complete whatsnew file, and complete tests with the new parameter?

pyriemann/datasets/sampling.py

mhurte · 2023-06-19T12:59:58Z

Hi,
I am having a couple of issues with my pullrequest :
For some reason I am unable to run the pytest locally before pushing my changes as it leads to the following error

ImportError while loading conftest 'C:\Users\Eldor\Desktop\Commit\pyRiemann\tests\conftest.py'.
tests\conftest.py:5: in <module>
    from pyriemann.datasets import make_matrices, make_masks
E   ImportError: cannot import name 'make_matrices' from 'pyriemann.datasets' (C:\Users\Eldor\AppData\Local\Programs\Python\Python311\Lib\site-packages\pyriemann\datasets\__init__.py)

@plcrodrigues seems to be having the same issue as me

Also, during the lasts tests that automatically ran on github I noticed that the test that is not working is the following :

=================================== FAILURES ===================================
____________________________ test_tlrotate[euclid] _____________________________

rndstate = RandomState(MT19937) at 0x7F0C69F626B0, metric = 'euclid'

    @pytest.mark.parametrize("metric", ["euclid", "riemann"])
    def test_tlrotate(rndstate, metric):
        """Test pipeline for rotating the datasets"""
        # check if the distance between the classes of each domain is reduced
        X, y_enc = make_classification_transfer(
            n_matrices=25, class_sep=5, class_disp=1.0, random_state=rndstate)
        rct = TLCenter(target_domain='target_domain')
        X_rct = rct.fit_transform(X, y_enc)
        rot = TLRotate(target_domain='target_domain', metric=metric)
        X_rot = rot.fit_transform(X_rct, y_enc)
        _, y, domain = decode_domains(X_rot, y_enc)
        for label in np.unique(y):
            d = 'source_domain'
            M_rct_label_source = mean_riemann(
                X_rct[domain == d][y[domain == d] == label])
            M_rot_label_source = mean_riemann(
                X_rot[domain == d][y[domain == d] == label])
            d = 'target_domain'
            M_rct_label_target = mean_riemann(
                X_rct[domain == d][y[domain == d] == label])
            M_rot_label_target = mean_riemann(
                X_rot[domain == d][y[domain == d] == label])
            d_rct = distance_riemann(M_rct_label_source, M_rct_label_target)
            d_rot = distance_riemann(M_rot_label_source, M_rot_label_target)
>           assert d_rot <= d_rct
E           assert 0.12430437458015[68](https://github.com/mhurte/pyRiemann/actions/runs/5312005943/jobs/9615952189#step:7:69)5 <= 0.124246099[81](https://github.com/mhurte/pyRiemann/actions/runs/5312005943/jobs/9615952189#step:7:82)3[95](https://github.com/mhurte/pyRiemann/actions/runs/5312005943/jobs/9615952189#step:7:96)297

tests/test_transfer.py:121: AssertionError

And it does not seem to be related to the files that I have modified so far.
Any idea where the problem could come from ?
Thank you in advance.

plcrodrigues · 2023-06-19T13:28:53Z

In fact, it seems that to run pytest locally one has to do:

python -m pytest tests/

Source: https://stackoverflow.com/a/34140498/1898001

…e documentation Update pyriemann/datasets/sampling.py Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>

…or unit test

plcrodrigues · 2023-06-19T14:05:34Z

The changes that you made in _rejection_sampling_2D had a direct impact on the sampling functions that generate a test dataset for the unit test test_tlrotate.

I think that the hyper-parameters that we had chosen for creating the dataset were not really good, so I changed them so to have an example where the distance between the class means after rotation was indeed smaller than after just re-center.

doc/whatsnew.rst

pyriemann/datasets/sampling.py

qbarthelemy · 2023-06-19T19:46:31Z

Thx @mhurte !

qbarthelemy requested a review from plcrodrigues June 16, 2023 12:13

plcrodrigues reviewed Jun 16, 2023

View reviewed changes

qbarthelemy requested changes Jun 16, 2023

View reviewed changes

pyriemann/datasets/sampling.py Outdated Show resolved Hide resolved

pyriemann/datasets/sampling.py Outdated Show resolved Hide resolved

pyriemann/datasets/sampling.py Outdated Show resolved Hide resolved

pyriemann/datasets/sampling.py Outdated Show resolved Hide resolved

mhurte force-pushed the master branch from 09b384f to d32f2fb Compare June 19, 2023 12:30

mhurte force-pushed the master branch from a8f7d0d to 6e35f4c Compare June 19, 2023 13:09

mhurte and others added 2 commits June 19, 2023 15:45

Corrected a bug on the rejection_sampling_2D algorithm and updated th…

e77dd7a

…e documentation Update pyriemann/datasets/sampling.py Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>

correct last pr number in whatsnew

2dd41ff

mhurte force-pushed the master branch from a3cf573 to 2dd41ff Compare June 19, 2023 13:48

mhurte and others added 5 commits June 19, 2023 15:54

Solving whatsnew conflict

0b2d2a5

Merge branch 'pyRiemann:master' into master

d4756fe

Solving whatsnew conflict 2

761d930

Merge branch 'master' of https://github.com/mhurte/pyRiemann

3ab34df

tweaking test_tlrotate parameters to have a reasonably good example f…

d407336

…or unit test

typo correction

db36b67

qbarthelemy approved these changes Jun 19, 2023

View reviewed changes

doc/whatsnew.rst Outdated Show resolved Hide resolved

pyriemann/datasets/sampling.py Outdated Show resolved Hide resolved

last corrections

657547f

qbarthelemy merged commit e496485 into pyRiemann:master Jun 19, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corrected a bug on the rejection_sampling_2D algorithm and updated th… #250

Corrected a bug on the rejection_sampling_2D algorithm and updated th… #250

mhurte commented Jun 16, 2023 •

edited

plcrodrigues Jun 16, 2023

mhurte Jun 16, 2023

plcrodrigues left a comment

qbarthelemy left a comment

mhurte commented Jun 19, 2023 •

edited

plcrodrigues commented Jun 19, 2023

plcrodrigues commented Jun 19, 2023

qbarthelemy commented Jun 19, 2023

Corrected a bug on the rejection_sampling_2D algorithm and updated th… #250

Corrected a bug on the rejection_sampling_2D algorithm and updated th… #250

Conversation

mhurte commented Jun 16, 2023 • edited

plcrodrigues Jun 16, 2023

Choose a reason for hiding this comment

mhurte Jun 16, 2023

Choose a reason for hiding this comment

plcrodrigues left a comment

Choose a reason for hiding this comment

qbarthelemy left a comment

Choose a reason for hiding this comment

mhurte commented Jun 19, 2023 • edited

plcrodrigues commented Jun 19, 2023

plcrodrigues commented Jun 19, 2023

qbarthelemy commented Jun 19, 2023

mhurte commented Jun 16, 2023 •

edited

mhurte commented Jun 19, 2023 •

edited