Add new dataset class using `zarr` file storage. #478

jpaillard · 2023-09-03T08:34:08Z

Pull Request Description

This PR introduces a new dataset implementation aimed at improving dataset loading speed. Additionally, it includes a BCI decoding example in a Colab notebook for reference: Link to Colab Example.

Notes

Considered Alternatives: During the development process, we explored potential alternatives like numpy.memmap and lmdb. However, we encountered limitations with numpy.memmap, particularly in terms of lacking compression options. LMDB, while efficient, isn't well-suited for lazy loading, useful for querying windows from recordings.
HDF5 vs. Zarr Debate: There's an ongoing debate between HDF5 and Zarr. Both have their respective advantages, but Zarr appears to have more active development. We might also allow users to choose between both options.

braindecode/datasets/base.py

sylvchev · 2023-09-08T09:36:25Z

braindecode/datasets/base.py

+            self,
+            train_indices=None,
+            test_indices=None,
+    ):


Could you add docstring?

environment.yml

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

codecov · 2023-09-16T08:39:24Z

Codecov Report

Merging #478 (0571806) into master (64b8e38) will decrease coverage by 1.10%.
The diff coverage is 25.28%.

@@            Coverage Diff             @@
##           master     #478      +/-   ##
==========================================
- Coverage   84.55%   83.46%   -1.10%     
==========================================
  Files          63       63              
  Lines        4676     4760      +84     
==========================================
+ Hits         3954     3973      +19     
- Misses        722      787      +65

robintibor · 2024-05-27T14:59:18Z

As we found current solution to be performant enough, closing this for now.

jpaillard added 3 commits September 3, 2023 10:18

Add new dataset class using zarr file storage.

d134a44

requirements and code format

3abb1b4

requirements for pytest

51d1f23

bruAristimunha mentioned this pull request Sep 4, 2023

Creating a BD-compatible dataset without loading X fully into memory #445

Open

sylvchev reviewed Sep 8, 2023

View reviewed changes

jpaillard and others added 14 commits September 16, 2023 09:20

Update braindecode/datasets/base.py

1ab21a8

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

0a67b91

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

e6d9078

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

a663b70

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

49585ee

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

6f7abee

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

Update braindecode/datasets/base.py

10ef0c9

Co-authored-by: Sylvain Chevallier <sylvain.chevallier@universite-paris-saclay.fr>

revert changes

0c9f2c8

revert changes 1

dcf615a

Add new dataset class using zarr file storage.

fa9eed7

Merge remote-tracking branch 'origin/detached'

89dc424

docstring and revert

0d92bb8

update whats new

a4d4906

whitespace

0571806

robintibor closed this May 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new dataset class using `zarr` file storage. #478

Add new dataset class using `zarr` file storage. #478

jpaillard commented Sep 3, 2023

sylvchev Sep 8, 2023

jpaillard Sep 16, 2023

codecov bot commented Sep 16, 2023 •

edited

Loading

robintibor commented May 27, 2024

Add new dataset class using zarr file storage. #478

Add new dataset class using zarr file storage. #478

Conversation

jpaillard commented Sep 3, 2023

Pull Request Description

Notes

sylvchev Sep 8, 2023

Choose a reason for hiding this comment

jpaillard Sep 16, 2023

Choose a reason for hiding this comment

codecov bot commented Sep 16, 2023 • edited Loading

Codecov Report

robintibor commented May 27, 2024

Add new dataset class using `zarr` file storage. #478

Add new dataset class using `zarr` file storage. #478

codecov bot commented Sep 16, 2023 •

edited

Loading