[Backend Configuration Va] Basic user documentation #802

CodyCBakerPhD · 2024-04-02T22:11:11Z

fix #797

…tion

…_nwbconverter

…tion

…_nwbconverter

…tion

docs/user_guide/backend_configuration.rst

bendichter · 2024-05-06T02:23:18Z

docs/user_guide/backend_configuration.rst

+
+    And likewise for ``AVAILABLE_ZARR_COMPRESSION_METHODS``.
+
+We can confirm these values are saved by re-printing that particular dataset configuration...


Suggested change

We can confirm these values are saved by re-printing that particular dataset configuration...

We can confirm these values are saved by re-printing that particular dataset configuration:

It's not clear from this that these configurations made it back into the larger object. It's also not clear how you would use this configuration to modify an in-memory NWBFile object. It would also be great to write this file and then show that the written file followed the chunking and compression parameters we chose. This could be done using HDFView, or some other app or API

It's not clear from this that these configurations made it back into the larger object.

This was obscured by the odd injection of all the notes; with those moved, I put the two printouts side by side and added some spacing

It would also be great to write this file and then show that the written file followed the chunking and compression parameters we chose.

The point the demo here makes is just that they get saved in the backend_configuration object itself - I can add something to prove that it actually writes the file in that manner (for chunks and compression anyway, can't prove buffering that way)

Though I will probably just demonstrate it with h5py since mentioning/showcasing HDFView could lead to annoyance because if a user wanted to try it they'd need to sign up for HDF5 - also Neurosift doesn't display this information, though I'm sure you could just request that - could also use PyNWB HTML repr after hdmf-dev/hdmf#1100 is merged, but this section is also currently for python script / ipython usage, not specific to a jupyter notebook

Added a section to the FAQ for how to check that customizations (or the default for that matter) actually configure the item correctly in the resulting file

OK great, could you please add a section on writing the data and confirming (somehow) that the data was written with the proper settings?

I did, it's the last section of the FAQ. Uses the file written using the preceeding section

OK that section of the FAQ looks great, but I still feel like it would be nice to have another snippet here showing how to apply the configs to an in-memory NWBFile object and write to disk

I don't understand what you mean here - what are you imagining and how is it different from the preceding section?

Co-authored-by: Ben Dichter <ben.dichter@gmail.com>

…before and after closer together

docs/user_guide/backend_configuration.rst

bendichter · 2024-05-06T03:49:39Z

docs/user_guide/backend_configuration.rst

+
+**How do I disable chunking and compression completely?**
+
+To completely disable chunking (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``.


Suggested change

To completely disable chunking (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``.

To completely disable chunking for HDF5 backends (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``. Zarr requires all datasets to be chunked.

Maybe not that important and this is what you meant but you can pass chunks=None to zarr and get all the data in one file:

import zarr # Create a Zarr group zarr_folder = zarr.open_group('test_zarr.zarr', mode='w') # Create an array and add it to the Zarr group array_data = np.random.rand(20, 20, 20) zarr_array = zarr_folder.create_dataset('array', data=array_data, chunks=None) zarr_array.info

Name /array
Type zarr.core.Array
Data type float64
Shape (20, 20, 20)
Chunk shape (20, 20, 20)
Order C
Read-only False
Compressor Blosc(cname='lz4', clevel=5, shuffle=SHUFFLE, blocksize=0)
Store type zarr.storage.DirectoryStore
No. bytes 64000 (62.5K)
No. bytes stored 56455 (55.1K)
Storage ratio 1.1
Chunks initialized 1/1

I did not check myself (trusted @bendichter on that one)

Our config literally just passes the values on to downstream calls so I assume this would work here, in which case we can remove this line as a minor point (even though chunks=None is the option, and even though the array says 1/1 chunks, that's all 'equivalent' to contiguous layout in a sense)

Co-authored-by: Ben Dichter <ben.dichter@gmail.com>

bendichter · 2024-05-06T17:14:49Z

I haven't had a chance to run through the code. If it all runs as advertised, this looks good to me

CodyCBakerPhD · 2024-05-06T17:33:29Z

If it all runs as advertised, this looks good to me

I ran through it all again - fixed a couple minor things now that the upstream ecephys change is through - confirmed everything works as advertised

We should keep in the back of our minds how to ensure this into the future however - doctest doesn't really work well for this scenario since we broke up various steps for the sake of readability (and doctest needs the variables to be in the same code block), which is more of a style that works well with using a notebook as a doc base (as PyNWB tutorials do) but we've not done that yet on NeuroConv

CodyCBakerPhD and others added 13 commits April 2, 2024 13:55

first integration with interface

426224d

several fixes

8eb87b7

saving sttate

ecda769

add backend to context tools and tests

82f1eeb

debugging

e748489

disable test

69d53dd

Merge branch 'extend_context_for_zarr' into converter_backend_integra…

378e5ce

…tion

pass backend to context

20c0992

temporarily suppress Zarr

d4bfc36

Update CHANGELOG.md

8757db1

Merge branch 'extend_context_for_zarr' into converter_backend_integra…

f78becf

…tion

add changelog

b6c0812

first pass of docs for backend configuration

ba34434

CodyCBakerPhD self-assigned this Apr 2, 2024

CodyCBakerPhD and others added 16 commits April 2, 2024 18:14

debug for references

48e2bcd

enhance

a880a3f

adjust call signature

19b71b7

add docstrings

a2d8f3d

Merge branch 'main' into extend_context_for_zarr

58c8d75

Merge branch 'extend_context_for_zarr' into converter_backend_integra…

dc32598

…tion

correct call signature

078ce17

Merge branch 'converter_backend_integration' into backend_integration…

12b178a

…_nwbconverter

Update CHANGELOG.md

d0e6c28

Update CHANGELOG.md

0d3a56e

Merge branch 'extend_context_for_zarr' into converter_backend_integra…

23cf2c2

…tion

Merge branch 'converter_backend_integration' into backend_integration…

d3225e2

…_nwbconverter

debugs

12c6862

Merge branch 'converter_backend_integration' into backend_integration…

767a2e4

…_nwbconverter

Merge branch 'main' into extend_context_for_zarr

4dd1a24

Merge branch 'extend_context_for_zarr' into converter_backend_integra…

6b7b654

…tion

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

CodyCBakerPhD and others added 5 commits May 5, 2024 22:57

Apply suggestions from code review

72f462c

Co-authored-by: Ben Dichter <ben.dichter@gmail.com>

try internal rst to see how it renders; swap note style to FAQ; move …

1c22b64

…before and after closer together

restore header

44d2723

try API docs

774039a

meth instead of method

c492afe

bendichter reviewed May 6, 2024

View reviewed changes

docs/user_guide/backend_configuration.rst Outdated Show resolved Hide resolved

bendichter reviewed May 6, 2024

View reviewed changes

CodyCBakerPhD mentioned this pull request May 6, 2024

Style suggestions from other PR #842

Merged

CodyCBakerPhD and others added 8 commits May 6, 2024 00:07

added h5py demo to confirm output was written as expected

114620b

Update docs/user_guide/backend_configuration.rst

1e02927

Co-authored-by: Ben Dichter <ben.dichter@gmail.com>

fix syntax

8853a80

fix syntax

432b057

rephrase

6364fd0

fix rendering

4952977

Merge branch 'main' into backend_config_docs

3bff83c

Backend config docs3 (#843)

30de25c

CodyCBakerPhD requested a review from bendichter May 6, 2024 17:02

minor fixes

8b53c71

bendichter approved these changes May 6, 2024

View reviewed changes

CodyCBakerPhD merged commit 247ba16 into main May 6, 2024
25 checks passed

CodyCBakerPhD deleted the backend_config_docs branch May 6, 2024 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backend Configuration Va] Basic user documentation #802

[Backend Configuration Va] Basic user documentation #802

CodyCBakerPhD commented Apr 2, 2024

bendichter May 6, 2024

CodyCBakerPhD May 6, 2024

CodyCBakerPhD May 6, 2024

bendichter May 6, 2024

CodyCBakerPhD May 6, 2024

bendichter May 6, 2024

CodyCBakerPhD May 6, 2024

bendichter May 6, 2024 •

edited

h-mayorquin May 6, 2024

CodyCBakerPhD May 6, 2024

bendichter commented May 6, 2024

CodyCBakerPhD commented May 6, 2024


		And likewise for ``AVAILABLE_ZARR_COMPRESSION_METHODS``.

		We can confirm these values are saved by re-printing that particular dataset configuration...


		How do I disable chunking and compression completely?

		To completely disable chunking (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``.

	To completely disable chunking (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``.
	To completely disable chunking for HDF5 backends (i.e., 'contiguous' layout), set both ``chunk_shape=None`` and ``compression_method=None``. Zarr requires all datasets to be chunked.

Name	/array
Type	zarr.core.Array
Data type	float64
Shape	(20, 20, 20)
Chunk shape	(20, 20, 20)
Order	C
Read-only	False
Compressor	Blosc(cname='lz4', clevel=5, shuffle=SHUFFLE, blocksize=0)
Store type	zarr.storage.DirectoryStore
No. bytes	64000 (62.5K)
No. bytes stored	56455 (55.1K)
Storage ratio	1.1
Chunks initialized	1/1

[Backend Configuration Va] Basic user documentation #802

[Backend Configuration Va] Basic user documentation #802

Conversation

CodyCBakerPhD commented Apr 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bendichter May 6, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bendichter commented May 6, 2024

CodyCBakerPhD commented May 6, 2024

bendichter May 6, 2024 •

edited