You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
xarray 11.3 currently won't read HDF5 file-like objects
importxarrayasxrimportgcsfsfs=gcsfs.GCSFileSystem()
images=fs.ls('pangeo-data/grfn-v2/137/')
fileObj=fs.open('pangeo-data/grfn-v2/137/S1-GUNW-A-R-137-tops-20181129_20181123-020010-43220N_41518N-PP-e2c7-v2_0_0.nc')
# but, can we open this w/ xarray anyway? Yes! with modifications to xarray and h5netcdfda=xr.open_dataset(fileObj, group='/science/grids/data', engine='h5netcdf')
da
---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
<ipython-input-3-22e0010de1f2> in <module>()
1# but, can we open this w/ xarray anyway? Yes! with modifications to xarray and h5netcdf
----> 2 da = xr.open_dataset(fileObj, group='/science/grids/data', engine='h5netcdf')
3 da
/srv/conda/lib/python3.6/site-packages/xarray/backends/api.py in open_dataset(filename_or_obj, group, decode_cf, mask_and_scale, decode_times, autoclose, concat_characters, decode_coords, engine, chunks, lock, cache, drop_variables, backend_kwargs)
347else:
348if engine isnotNoneand engine !='scipy':
--> 349 raise ValueError('can only read file-like objects with '
350"default engine or engine='scipy'")
351# assume filename_or_obj is a file-like objectValueError: can only read file-like objects with default engine or engine='scipy'
Problem description
It is now possible to do this with h5py >2.9.0. see h5py/h5py#1105. This would be a useful feature because there is a lot of NASA data out there in HDF5. This functionality could open up reading without first writing to disk (to translate to Zarr or other formats possibly). There seem to be many issues related to this: fsspec/s3fs#144 #2535
I'm guessing adding this functionality doesn't fix many of the performance issues related to HDF5 and Dask dask/dask#2488 dask/distributed#2319
xarray 11.3 currently won't read HDF5 file-like objects
Problem description
It is now possible to do this with h5py >2.9.0. see h5py/h5py#1105. This would be a useful feature because there is a lot of NASA data out there in HDF5. This functionality could open up reading without first writing to disk (to translate to Zarr or other formats possibly). There seem to be many issues related to this:
fsspec/s3fs#144
#2535
I'm guessing adding this functionality doesn't fix many of the performance issues related to HDF5 and Dask
dask/dask#2488
dask/distributed#2319
Expected Output
Output of
xr.show_versions()
xarray: 0.11.3
pandas: 0.24.1
numpy: 1.16.1
scipy: 1.2.0
netCDF4: 1.4.2
pydap: None
h5netcdf: 0.6.2
h5py: 2.9.0
Nio: None
zarr: 2.2.0
cftime: 1.0.3.4
PseudonetCDF: None
rasterio: 1.0.18
cfgrib: None
iris: None
bottleneck: None
cyordereddict: None
dask: 1.1.0
distributed: 1.25.2
matplotlib: 3.0.2
cartopy: 0.17.0
seaborn: 0.9.0
setuptools: 40.7.1
pip: 19.0.2
conda: 4.6.3
pytest: None
IPython: 7.1.1
sphinx: None
The text was updated successfully, but these errors were encountered: