Integrating BrainDecode in MOABB benchmarking pipelines #46

sylvchev · 2020-01-17T14:06:45Z

Using the scikit-learn API, BrainDecode could be used as a regular scikit-learn pipeline and benchmarked against others classical BCI pipelines.

robintibor · 2021-06-15T08:42:07Z

What is status of this is it still planned? @sylvchev

sylvchev · 2021-06-15T11:20:03Z

It is still planned.
It is possible to make a transformer that make use of create_windows_from_events to transform raw epochs served by moabb into windows for classifier. I'll try to make a minimal example in the next few weeks.

Div12345 · 2021-10-27T10:08:34Z

@agramfort continuing the discussion from the PR,
This is all the things that have been regarding the MOABB_Benchmark till now - Link - Due to RAM limitations, I haven't been able to run some of them still.
I think these are the points to be considered before doing the benchmarking -

Which datasets to do the benchmarking on - Either the MOABB datasets can be used with the transforming structure @sylvchev proposed above or using the datasets that Braindecode (Sleep Physionet, TUH, TUHAbnormal) supports itself
If using the Braindecode datasets and if using MOABB to add to the benchmark, some more things may have to be defined like paradigm, etc. as these are referred to in the MOABB evaluation calls.
The hyperparameters, optimizers, epochs to be benchmarked.
What are the final things, metrics to be reported. We could potentially use WANDB for logging and reporting the results if we need the learning curves, etc to also be logged. It also logs time taken in data loading, etc. if it is of any use.

agramfort · 2021-10-27T19:55:44Z

I cannot invest time to adapt your code to our computer cluster. We don't work with notebooks in the team to run batch trainings. I would need 2 scripts: - 1 that downloads and put the files in one place - 1 that runs the code Readme should explain to me what versions of the packages to use

…

Div12345 · 2021-10-28T17:04:21Z

Sure, I should be able to do that. To add the braindecode models should I make it such that they test on the existing datasets of moabb? And should I add a run for benchmarking the sleep models on the physionet sleep dataset?

Div12345 · 2021-10-30T14:08:41Z

I was attempting running a Braindecode model in MOABB's evaluation and ran into an error while fitting the pipeline/classifier.
Link to the cell with the error - Link

Error -

TypeError                                 Traceback (most recent call last)

/usr/local/lib/python3.7/dist-packages/sklearn/base.py in clone(estimator, safe)
     80     for name, param in new_object_params.items():
     81         new_object_params[name] = clone(param, safe=False)
---> 82     new_object = klass(**new_object_params)
     83     params_set = new_object.get_params(deep=False)
     84 

TypeError: __init__() got an unexpected keyword argument 'on_train'

Is it something in the way I'm doing it?

robintibor · 2021-10-30T17:03:17Z

@gemeinl we solved this right? or is this a different problem? @Div12345 do you use current master?

Div12345 · 2021-10-30T20:36:08Z

Ah yes, I was doing with the PyPI version, using the master version solved that. I was able to create the dataset and pass it to the classifier, but I'm getting this weird error with on_epoch_end where the version of skorch is seemingly checked by braindecode I think, I tried the dev version of skorch after I got this error with the PyPI version and get the same error with both versions.
Link to error cell on Colab Notebook - Link
The check -

/usr/local/lib/python3.7/dist-packages/braindecode/training/scoring.py in on_epoch_end(self, net, dataset_train, dataset_valid, **kwargs)
    356                 batch_X, batch_y = unpack_data(batch)
    357                 # TODO: remove after skorch 0.10 release
--> 358                 if not check_version('skorch', min_version='0.10.1'):
    359                     yp = net.evaluation_step(batch_X, training=False)
    360                 # X, y unpacking has been pushed downstream in skorch 0.10

and the error comes with the version checking as this -


/usr/lib/python3.7/distutils/version.py in _cmp(self, other)
    335         if self.version == other.version:
    336             return 0
--> 337         if self.version < other.version:
    338             return -1
    339         if self.version > other.version:

TypeError: '<' not supported between instances of 'str' and 'int'

Don't mind the printed stuff on the Colab notebook cell output above this error, that was just something for debugging and is not relevant to the error.

robintibor · 2021-10-30T21:43:06Z

So it seems skorch returns version 'n/a' for some reason the way it was installed in your linked colab:

Seems this happens here:
https://github.com/skorch-dev/skorch/blob/2d06cd70896d01c948eceec90dff84f2c9990a6a/skorch/__init__.py#L46-L49

Unclear to me why it happens precisely, we could fix on our side by changing:

braindecode/braindecode/training/scoring.py

Line 358 in dba19a7

if not check_version('skorch', min_version='0.10.1'):

to

                if (skorch.__version__ != 'n/a') and (not check_version('skorch', min_version='0.10.1')):

(basically assuming n/a means we are up to date with skorch, maybe good to put a comment there)
Do you want to test this change on your side and make a PR if it works @Div12345 ?

Div12345 · 2021-11-02T05:46:29Z

@robintibor I checked again today with the 0.11.0 PyPI version of skorch that was pushed 2 days back and the error doesn't occur anymore.

gemeinl · 2021-11-02T10:33:50Z

@gemeinl we solved this right? or is this a different problem? @Div12345 do you use current master?

I was not aware that this was the same problem as in #347. So yes, we are now compatile with scikit-learn.

robintibor · 2021-11-02T13:09:19Z

well @Div12345 yes I think this can only happen with the master version of skorch, but maybe we still want to be compatible in any case?

Div12345 · 2021-11-04T09:48:07Z

@robintibor I'll make the check for the other versions but is it safe to assume that NaN implies that skorch is up to date? Couldn't it just mean that the version check is just failing somewhere and is uncertain? In which case, at least we could make a warning that the version is uncertain so that if some error occurs people can understand the probable cause more easily?

bruAristimunha · 2022-09-05T04:42:57Z

Hello guys!

I am trying to close open issues in braindecode, and I spent some time on this issue. I think I have found the solution to this issue.

Based on the @Div12345 code, I made a minimal viable code that integrates Moabb and Braindecode. It still needs to be well optimized! The code is below.

import os.path as osp

import matplotlib.pyplot as plt
import mne
import seaborn as sns
import torch
from braindecode import EEGClassifier
from braindecode.datasets import create_from_X_y
from braindecode.models import ShallowFBCSPNet
from braindecode.util import set_random_seeds
from moabb.datasets import BNCI2014001
from moabb.evaluations import WithinSessionEvaluation
from moabb.paradigms import LeftRightImagery
from moabb.utils import set_download_dir
from numpy import unique
from sklearn.base import BaseEstimator, ClassifierMixin, TransformerMixin
from sklearn.pipeline import Pipeline
from skorch.callbacks import LRScheduler

set_download_dir(osp.join(osp.expanduser("~"), "mne_data"))

cuda = (
    torch.cuda.is_available()
)  # check if GPU is available, if True chooses to use it
device = "cuda" if cuda else "cpu"
if cuda:
    torch.backends.cudnn.benchmark = True
seed = 20200220  # random seed to make results reproducible
# Set random seed to be able to reproduce results
set_random_seeds(seed=seed, cuda=cuda)

n_classes = 2

# hard-coded for now
n_chans = 22
input_window_samples = 1001

model = ShallowFBCSPNet(
    n_chans,
    n_classes,
    input_window_samples=input_window_samples,
    final_conv_length="auto",
)

# Send model to GPU
if cuda:
    model.cuda()

# These values we found good for shallow network:
lr = 0.0625 * 0.01
weight_decay = 0

batch_size = 64
n_epochs = 4

clf = EEGClassifier(
    model,
    criterion=torch.nn.NLLLoss,
    optimizer=torch.optim.AdamW,
    train_split=None,  # using valid_set for validation
    optimizer__lr=lr,
    optimizer__weight_decay=weight_decay,
    batch_size=batch_size,
    callbacks=[
        "accuracy",
        ("lr_scheduler", LRScheduler("CosineAnnealingLR", T_max=n_epochs - 1)),
    ],
    device=device,
)


class Transformer(BaseEstimator, TransformerMixin):
    def __init__(self, kw_args=None):
        self.kw_args = kw_args

    def fit(self, X, y=None):
        self.y = y
        return self

    def transform(self, X, y=None):
        dataset = create_from_X_y(
            X.get_data(),
            y=self.y,
            window_size_samples=X.get_data().shape[2],
            window_stride_samples=X.get_data().shape[2],
            drop_last_window=False,
            sfreq=X.info["sfreq"],
        )

        return dataset

    def __sklearn_is_fitted__(self):
        """Return True since Transfomer is stateless."""
        return True


class ClassifierModel(BaseEstimator, ClassifierMixin):
    def __init__(self, clf, kw_args=None):
        self.clf = clf
        self.classes_ = None
        self.kw_args = kw_args

    def fit(self, X, y=None):
        self.clf.fit(X, y=y, **self.kw_args)
        self.classes_ = unique(y)

        return self.clf

    def predict(self, X):
        return self.clf.predict(X)

    def predict_proba(self, X):
        return self.clf.predict_proba(X)


create_dataset = Transformer()
fit_params = {"epochs": 10}

brain_clf = ClassifierModel(clf, fit_params)

# from functools import partial
# clf.fit = partial(clf.fit, epochs=10)

pipe = Pipeline([("Braindecode_dataset", create_dataset), ("Net", brain_clf)])
print(pipe)
pipes = {}
pipes["ShallowFBCSPNet"] = pipe

mne.set_log_level(False)

# Define Evaluation
paradigm = LeftRightImagery()
# Because this is being auto-generated we only use 2 subjects
dataset = BNCI2014001()
dataset.subject_list = dataset.subject_list[:2]
datasets = [dataset]
overwrite = True  # set to True if we want to overwrite cached results
evaluation = WithinSessionEvaluation(
    paradigm=paradigm,
    datasets=datasets,
    suffix="braindecode_example",
    overwrite=overwrite,
    return_epochs=True,
)

results = evaluation.process(pipes)

print(results.head())

##############################################################################
# Plot Results
# ----------------
#
# Here we plot the results. We the first plot is a pointplot with the average
# performance of each pipeline across session and subjects.
# The second plot is a paired scatter plot. Each point representing the score
# of a single session. An algorithm will outperforms another is most of the
# points are in its quadrant.

fig, axes = plt.subplots(1, 2, figsize=[8, 4], sharey=True)

sns.stripplot(
    data=results,
    y="score",
    x="pipeline",
    ax=axes[0],
    jitter=True,
    alpha=0.5,
    zorder=1,
    palette="Set1",
)
sns.pointplot(
    data=results, y="score", x="pipeline", ax=axes[0], zorder=1, palette="Set1"
)

axes[0].set_ylabel("ROC AUC")
axes[0].set_ylim(0.5, 1)

# paired plot
paired = results.pivot_table(
    values="score", columns="pipeline", index=["subject", "session"]
)
paired = paired.reset_index()

axes[1].plot([0, 1], [0, 1], ls="--", c="k")
axes[1].set_xlim(0.5, 1)

plt.show()

How do you want to proceed, @sylvchev?

sylvchev · 2022-09-05T07:51:44Z

Thanks for this nice example. I'm working with @pangolinMagique on this integration. He just found a way to integrate braindecode using only epochs from MOABB, as you have. One central question is do MOABB need to provide access to raw signal or is epochs enough to have good performance?

agramfort · 2022-09-05T08:12:47Z

feel free to reopen if you feel we should discuss more.

…

Message ID: ***@***.***>

bruAristimunha · 2022-09-05T08:50:14Z

Hello @sylvchev and @agramfort,

I feel this is a parameter search.

Currently, I am thinking about how to coordinate the team to code this integration (@agramfort, @bruAristimunha, @sylvchev and @pangolinMagique). We can conduct a small study, maybe reproducing the robin results with moabb and braindecode.

I was wondering, shall we continue this topic via e-mail or braindecode chat instead of here? I think that coordinating it escapes the context of the issue.

agramfort · 2022-09-05T08:57:57Z

I prefer if conversations are in the open but we can maybe zoom at some point to simplify things

…

Message ID: ***@***.***>

bruAristimunha · 2022-09-05T09:01:31Z

No problem, we'll solve it here then.

sylvchev · 2022-09-05T10:52:20Z

Yes, good idea, it will be more efficient

sylvchev · 2023-02-06T21:22:30Z

We have now the possibility to expose raw data to train braindecode models in MOABB (see NeuroTechX/moabb#302)
We are working to integrate some deep learning pipelines in the benchmark: we will pushed them here NeuroTechX/moabb#326

agramfort · 2023-02-09T08:57:32Z

@bruAristimunha I think that @sylvchev is much more competent to review here.

I trust your judgement. Feel free to merge if happy.

@sylvchev you have the green buttons on both repos :)

sylvchev · 2023-02-09T09:18:34Z

Thanks @agramfort

bruAristimunha · 2023-08-18T07:54:21Z

Closed with: http://moabb.neurotechx.com/docs/auto_examples/plot_benchmark_braindecode.html#sphx-glr-auto-examples-plot-benchmark-braindecode-py
http://moabb.neurotechx.com/docs/auto_examples/load_model.html#sphx-glr-auto-examples-load-model-py

sylvchev added the enhancement New feature or request label Jan 17, 2020

sylvchev self-assigned this Jan 17, 2020

Div12345 mentioned this issue Sep 3, 2021

Creating a Global Benchmarking Pipeline and Results Page NeuroTechX/moabb#190

Open

Div12345 mentioned this issue Oct 26, 2021

Adding Sleep Models #341

Merged

Div12345 mentioned this issue Nov 2, 2021

Ability for custom scorer in Evaluations NeuroTechX/moabb#250

Closed

bruAristimunha added this to the 0.7 milestone Sep 23, 2022

bruAristimunha mentioned this issue Feb 9, 2023

Adding Braindecode pipeline NeuroTechX/moabb#328

Merged

bruAristimunha closed this as completed Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating BrainDecode in MOABB benchmarking pipelines #46

Integrating BrainDecode in MOABB benchmarking pipelines #46

sylvchev commented Jan 17, 2020

robintibor commented Jun 15, 2021

sylvchev commented Jun 15, 2021

Div12345 commented Oct 27, 2021

agramfort commented Oct 27, 2021 via email

Div12345 commented Oct 28, 2021 •

edited

Div12345 commented Oct 30, 2021

robintibor commented Oct 30, 2021

Div12345 commented Oct 30, 2021 •

edited

robintibor commented Oct 30, 2021 •

edited

Div12345 commented Nov 2, 2021

gemeinl commented Nov 2, 2021

robintibor commented Nov 2, 2021

Div12345 commented Nov 4, 2021

bruAristimunha commented Sep 5, 2022

sylvchev commented Sep 5, 2022

agramfort commented Sep 5, 2022 via email

bruAristimunha commented Sep 5, 2022

agramfort commented Sep 5, 2022 via email

bruAristimunha commented Sep 5, 2022

sylvchev commented Sep 5, 2022

sylvchev commented Feb 6, 2023

agramfort commented Feb 9, 2023

sylvchev commented Feb 9, 2023

bruAristimunha commented Aug 18, 2023

Integrating BrainDecode in MOABB benchmarking pipelines #46

Integrating BrainDecode in MOABB benchmarking pipelines #46

Comments

sylvchev commented Jan 17, 2020

robintibor commented Jun 15, 2021

sylvchev commented Jun 15, 2021

Div12345 commented Oct 27, 2021

agramfort commented Oct 27, 2021 via email

Div12345 commented Oct 28, 2021 • edited

Div12345 commented Oct 30, 2021

robintibor commented Oct 30, 2021

Div12345 commented Oct 30, 2021 • edited

robintibor commented Oct 30, 2021 • edited

Div12345 commented Nov 2, 2021

gemeinl commented Nov 2, 2021

robintibor commented Nov 2, 2021

Div12345 commented Nov 4, 2021

bruAristimunha commented Sep 5, 2022

sylvchev commented Sep 5, 2022

agramfort commented Sep 5, 2022 via email

bruAristimunha commented Sep 5, 2022

agramfort commented Sep 5, 2022 via email

bruAristimunha commented Sep 5, 2022

sylvchev commented Sep 5, 2022

sylvchev commented Feb 6, 2023

agramfort commented Feb 9, 2023

sylvchev commented Feb 9, 2023

bruAristimunha commented Aug 18, 2023

Div12345 commented Oct 28, 2021 •

edited

Div12345 commented Oct 30, 2021 •

edited

robintibor commented Oct 30, 2021 •

edited