-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generating references without kerchunk #78
Comments
From @sharkinsspatial:
|
I took a quick, first look at Virtualizarr last night (amazing 🎊 , thank you for pushing this forward). I can re-purpose/simplify the existing kerchunk HDF backend to directly support generating A few questions for creating a PR for this.
|
That would be awesome, thanks @sharkinsspatial ❤️ . I think whilst we're doing this we should try hard here to improve test coverage and understanding of behaviour in nasty cases. I'm thinking about #38 in particular.
This is totally equivalent to the
I think what you're suggesting is the least instrusive way. The main thing is to keep code that actually depends on the |
VirtualiZarr + zarr chunk manifests re-implement so much of kerchunk that the only part left is kerchunk's backends - the part that actually generates the byte ranges from a given legacy file. It's interesting to imagine whether we could make virtualizarr work without using kerchunk or fsspec at all.
#61 (comment) discusses how the rust
object-store
crate might allow us to read actual bytes from zarr v3 stores with chunk manifests over S3, without using fsspec.The other place we use fsspec (+ kerchunk) is to generate the references in the first place. But can we imagine alternative implementations for generating that byte range information?
Arguments for doing this without using kerchunk + fsspec are essentially:
The text was updated successfully, but these errors were encountered: