DemonsP300 has only targets in the first third of data #216

jsosulski · 2021-07-09T06:21:29Z

See this plot for one subject, but all subjects look the same:

X-Axis is which epoch is plotted and y is the label the epoch has

sylvchev · 2021-07-10T07:52:04Z

Thanks!
@v-goncharenko do you know if there is a data loading problem?

v-goncharenko · 2021-07-15T08:32:23Z

Rest of the ground true labels are in .csv file. Look at the code, they get read there.

Also it's a good idea not to read raw unstructured data, but use final class (which is abstraction between internal format and common one)

jsosulski · 2021-07-15T08:44:37Z

So is this an issue with the dataset implementation?

See MWE using current moabb version on pypi:

from matplotlib import pyplot as plt
from moabb.datasets import DemonsP300
from moabb.paradigms import P300

paradigm = P300()

dset = DemonsP300()
subject = 0

X, label, meta = paradigm.get_data(dset, [subject])

plt.plot(label)
plt.show()

This produces the plot in the first post and this uses the default moabb way of loading data

jsosulski · 2021-12-01T08:58:05Z

I noticed in more and more literature that MOABB is being referenced (yay!), although most authors just use it for dataset acquisition, which is still a win I guess, until we have a centralized classification running system. However, should we start to tag datasets that have, e.g., been vetted by us that they work correctly? Then new MOABB users could use it as intended as a fire&forget way.

See e.g. this issue or the fixed #96 . As a new user who just wants to check out their classifiers performance on X, y data, they probably do not want to dig deep into the underlying datasets and check if everything is doing what it should. Currently on the documentation there is no hint that there are currently issues with this dataset.

I could offer to clean up the sanity check script (#184), commit it to moabb, and run it locally for all avilable P300 datasets, as I am most experienced with ERP data.

sylvchev · 2021-12-03T08:57:11Z

Good for the citations ;) I tried to add paper in found referencing MOABB on this wiki page, it could be useful soon. Feel free to add some papers if you have the time.

I agree that the dataset should be verified and this issue is open for quite some time. I'm trying to improve the documentation by adding more information on the dataset. As a groundtruth, I update a wiki page with metadata regarding the datasets that are useful for ML. As you suggest, we could a minima include references to issues that are open for each dataset.

Best would be to ensure that all dataset are ok before adding them and you sanity check script could really help. It could be part of the required steps asked to comply with before adding a dataset. If you could run it on P300 it is nice. Someone could help for checking existing MOABB dataset in MI and SSVEP? @Div12345 @ErikBjare @v-goncharenko (I could help)

sylvchev · 2022-03-02T10:47:06Z

This issue is stalling and users could use DemonsP300 without knowing that there is an issue. We could add a warning when the dataset is loaded, that make a reference to this issue, and we could update the documentation as well.
If the problem of this dataset could not be fixed, we may have to deprecate it.

jsosulski assigned v-goncharenko Jul 9, 2021

sylvchev added the bug label Jul 31, 2021

sylvchev mentioned this issue Jan 21, 2022

Creating a Global Benchmarking Pipeline and Results Page #190

Open

Div12345 added this to Datasets in Benchmarking paper Jan 21, 2022

jsosulski mentioned this issue Feb 2, 2022

Visualize all ERP datasets #261

Merged

sylvchev moved this from Datasets to Maybe Bugs in Benchmarking paper Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DemonsP300 has only targets in the first third of data #216

DemonsP300 has only targets in the first third of data #216

jsosulski commented Jul 9, 2021

sylvchev commented Jul 10, 2021

v-goncharenko commented Jul 15, 2021

jsosulski commented Jul 15, 2021

jsosulski commented Dec 1, 2021

sylvchev commented Dec 3, 2021

sylvchev commented Mar 2, 2022

DemonsP300 has only targets in the first third of data #216

DemonsP300 has only targets in the first third of data #216

Comments

jsosulski commented Jul 9, 2021

sylvchev commented Jul 10, 2021

v-goncharenko commented Jul 15, 2021

jsosulski commented Jul 15, 2021

jsosulski commented Dec 1, 2021

sylvchev commented Dec 3, 2021

sylvchev commented Mar 2, 2022