Next iteration on xarray support for intake #2

mmccarty · 2018-04-23T14:35:02Z

Notes on our conversation about the future of this PR/repo (cc @jbcrail , but note that @mmccarty will presumably return to work on this when he has time). I believe the following is a reasonable course of action.

a new container type should be acceptable to Intake, "xarray". The builtin functionality in the source class will be overridden.
We must consider carefully what this means for an xarray opened on an Intake server - presumably communication will be the same as an ndarray, which doesn't yet exist (right?). Note that being able to load netCDF/HDF from a remote location would be a huge boon, and there are servers around doing only that job, because it is so useful - can we make it happen? We would need to create each variable as a dask-array, where any chunk calls the server with its multi-dimensional index, and create a local xarray that stores these dask-arrays in the same arrangement and with the same metadata as remotely.
The natural representation of an open xarray is the open xarray object itself, and that is what discover() should return. Also, the arrays should be chunked from the start, so to_dask() is a no-op on that, and read() should call whatever xarray function it is to materialise the data into memory.
This repo should be renamed intake-xarray, and include three separate plugins: netCDF which opens one or more files (these are separate functions in xarray); rasterIO and zarr. The latter is the only one that can actually directly open files remotely, and we should take care to parse s3:, hdfs:, and gcs: and create the mappers that zarr needs (I'll help with that). It would be nice is an unstructured zarr array returns an xarray data-array as opposed to a dataset, although maybe that should be a separate plugin. Note again, that we have no array readers at all, not even numpy (never mind scientific formats)

mmccarty · 2018-04-23T14:46:51Z

Rename repo - Done!

martindurant · 2018-04-23T14:47:20Z

:)

mmccarty · 2018-05-08T13:29:17Z

@martindurant is this issue resolved with the last 2 PRs #7 and #6

martindurant · 2018-05-08T15:27:57Z

Point 2) is outstanding, but maybe that should be a general issue on Intake main for all ndarrays. I have an idea of how to handle the server, but it's not quite simple.

martindurant · 2018-06-20T21:49:25Z

#9 is pointed at this, but will need changes in Intake too.

martindurant · 2018-11-08T14:50:55Z

Everything here was done

mmccarty mentioned this issue Apr 23, 2018

Addressed feedback. #1

Merged

martindurant closed this as completed Nov 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Next iteration on xarray support for intake #2

Next iteration on xarray support for intake #2

mmccarty commented Apr 23, 2018

mmccarty commented Apr 23, 2018

martindurant commented Apr 23, 2018

mmccarty commented May 8, 2018

martindurant commented May 8, 2018

martindurant commented Jun 20, 2018

martindurant commented Nov 8, 2018

Next iteration on xarray support for intake #2

Next iteration on xarray support for intake #2

Comments

mmccarty commented Apr 23, 2018

mmccarty commented Apr 23, 2018

martindurant commented Apr 23, 2018

mmccarty commented May 8, 2018

martindurant commented May 8, 2018

martindurant commented Jun 20, 2018

martindurant commented Nov 8, 2018