[MRG] Random sampler #295

tgnassou · 2021-07-06T15:09:47Z

Add a new sampler inspired by USleep article. We define a fixed number of sequence. For each sequence, a window is chosen for a random recording and a random class. The sequence is created around this window with a random position. Each class has the same probability to be chosen. So this method allow to equitably represent the classes despite their difference in proportion.

@agramfort @hubertjb @l-omar-chehab

agramfort

I am not sure where the balanced sampling is done.

have you tried this on a real problem / dataset already?

it not I would suggest you do it first to see how well it works. thx @tgnassou

docs/whats_new.rst

docs/api.rst

braindecode/samplers/base.py

tgnassou · 2021-07-07T07:12:15Z

I tried my fonction on dataset. It returns sampling as i want. But i'm not sure that is what we want.

I choose a random number of subject, I choose a random class (every class has the same probabilitie to be chosen) and I create the sequence with this window. Because all the class have the same probabilitie i thought it was balanced.

agramfort · 2021-07-07T13:34:20Z

I mean does it enable you to get better performance on real data? does it lead to a running time increase when learning? basically is it useful according to you and if so is it a free lunch in running time

…

braindecode/samplers/base.py

codecov · 2021-07-08T10:25:28Z

Codecov Report

Merging #295 (9a0b77c) into master (5b826c4) will increase coverage by 0.20%.
The diff coverage is 97.72%.

@@            Coverage Diff             @@
##           master     #295      +/-   ##
==========================================
+ Coverage   80.38%   80.59%   +0.20%     
==========================================
  Files          49       49              
  Lines        3085     3123      +38     
==========================================
+ Hits         2480     2517      +37     
- Misses        605      606       +1

robintibor · 2021-07-08T10:26:13Z

braindecode/samplers/base.py

@@ -81,6 +87,10 @@ def __iter__(self):
    def n_recordings(self):
        return self.info.shape[0]

+    @property


Might be actually safer to supply n_classes explicitly? Otherwise you may run into problems:

if in some subset not all classes are present (imagine small evaluation subset in debugging experiment)

targets somewhere else as in Discrete and synchronized targets support #261 may not work.

The first point is fixed in the latest commit. I'm not sure about the second point though - the sampler requires categorical targets in the BaseConcatDataset's metadata attribute. What use case exactly did you have in mind @robintibor ?

hubertjb · 2021-07-24T04:36:38Z

I added tests and uniformized the implementation so it follows the other samplers' logic more closely. Next step is to test #282 with this sampler and make sure we get better results than with naive sampling of sequences.

robintibor · 2021-08-09T13:35:40Z

maybe rebase this on #319 then tests should be fine. is there more to do before this is ready on merge @hubertjb ?

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Co-authored-by: robintibor <robintibor@gmail.com>

… only 5 classes)

- modifying _init_info to check for required keys in metadata when creating info dataframe - reviewing computation of start_ind - improving docstrings - adding tests

hubertjb · 2021-08-09T17:56:55Z

Thanks @robintibor ! The validation of the USleep architecture might lead to some minor changes, but otherwise I think this is ready to be merged.

robintibor · 2021-08-09T22:58:39Z

Great, merged!

agramfort reviewed Jul 7, 2021

View reviewed changes

robintibor reviewed Jul 8, 2021

View reviewed changes

braindecode/samplers/base.py Outdated Show resolved Hide resolved

robintibor reviewed Jul 8, 2021

View reviewed changes

braindecode/samplers/base.py Outdated Show resolved Hide resolved

robintibor reviewed Jul 8, 2021

View reviewed changes

hubertjb force-pushed the random-sampler branch from 7f212f9 to 5d10e92 Compare July 24, 2021 04:33

hubertjb marked this pull request as ready for review July 24, 2021 04:33

tgnassou and others added 14 commits August 9, 2021 13:50

update whats new

87e87a6

add RandomSampler

24f3e78

update init

f8a6d2e

update api

dbbc7ae

Update docs/whats_new.rst

27dbf3f

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update braindecode/samplers/base.py

b0562c6

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update braindecode/samplers/base.py

ad98da8

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update docs/api.rst

37424ac

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update braindecode/samplers/base.py

4168608

Co-authored-by: robintibor <robintibor@gmail.com>

ENH: improve comprehension of position windows choice

986afb8

fix: fix docstring and adapt the code for more general recording (not…

10ccbb7

… only 5 classes)

fix flake8

094a9fc

- handling classes on a recording basis

88bef41

- modifying _init_info to check for required keys in metadata when creating info dataframe - reviewing computation of start_ind - improving docstrings - adding tests

adding support for metadata with only one category

9a0b77c

hubertjb force-pushed the random-sampler branch from 69734db to 9a0b77c Compare August 9, 2021 17:55

hubertjb changed the title ~~[WIP] Random sampler~~ [MRG] Random sampler Aug 9, 2021

robintibor merged commit 5266556 into braindecode:master Aug 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Random sampler #295

[MRG] Random sampler #295

tgnassou commented Jul 6, 2021

agramfort left a comment

tgnassou commented Jul 7, 2021

agramfort commented Jul 7, 2021 via email

codecov bot commented Jul 8, 2021 •

edited

Loading

robintibor Jul 8, 2021

hubertjb Jul 24, 2021

hubertjb commented Jul 24, 2021

robintibor commented Aug 9, 2021

hubertjb commented Aug 9, 2021

robintibor commented Aug 9, 2021

[MRG] Random sampler #295

[MRG] Random sampler #295

Conversation

tgnassou commented Jul 6, 2021

agramfort left a comment

Choose a reason for hiding this comment

tgnassou commented Jul 7, 2021

agramfort commented Jul 7, 2021 via email

codecov bot commented Jul 8, 2021 • edited Loading

Codecov Report

robintibor Jul 8, 2021

Choose a reason for hiding this comment

hubertjb Jul 24, 2021

Choose a reason for hiding this comment

hubertjb commented Jul 24, 2021

robintibor commented Aug 9, 2021

hubertjb commented Aug 9, 2021

robintibor commented Aug 9, 2021

codecov bot commented Jul 8, 2021 •

edited

Loading