Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Computations requiring irregularly-chunked Zarr stores #312

Open
TomNicholas opened this issue Sep 20, 2023 · 2 comments
Open

Computations requiring irregularly-chunked Zarr stores #312

TomNicholas opened this issue Sep 20, 2023 · 2 comments
Labels
enhancement New feature or request upstream Involves changes to an upstream library zarr

Comments

@TomNicholas
Copy link
Collaborator

All intermediate results in Cubed are written out to persistent storage via Zarr, but currently Zarr can't represent any chunked array, because the Zarr spec does not yet support irregular chunks.

This comes up in Cubed when a computation changes the chunking of an array from regularly chunked to irregularly chunked. An important example of such a computation is groupby, see #223.

I'm not sure if there are any other examples of array operations that might change regular chunks into irregular ones? np.pad comes to mind?

@TomNicholas TomNicholas added enhancement New feature or request zarr upstream Involves changes to an upstream library labels Sep 20, 2023
@TomNicholas
Copy link
Collaborator Author

See discussion on ZEP003

@tomwhite
Copy link
Member

You could implement pad in Cubed using concatenate, which already exists but has to copy the Zarr arrays to make them regularly-chunked. But if Zarr supported irregular chunking then it could be more efficient.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request upstream Involves changes to an upstream library zarr
Projects
None yet
Development

No branches or pull requests

2 participants