How to reduce the output size with to_netcdf? #865

simon3122 · 2016-06-02T07:45:28Z

Hello,

The options of the ds.to_netcdf method allow several netcdf modes and a control of several options (missing values, format...).

Our current options are as follows:

ds.to_netcdf(filenam,
'w',
'NETCDF4',
'netcdf4',
encoding={ vkey2:{'dtype': float32,'_FillValue':0}})

Whereas the input size is 6 GiB / variable, the output size reaches 23 GiB / variable.

What is the best strategy in the options to reduce the netcdf output size, other than changing the format (in my exercice we have float32)?
In particular, is there some interest in modifying the deprecated "missing_values" attribute?

Thank you in advance,

fmaussion · 2016-06-03T08:05:13Z

NetCDF and xarray support lossy compression (extremely efficient and fast with the cost of numerical precision loss) or gzip compression (without precision loss but with slower I/O - especially when reading chunks of data). You can have a look at the documentation about NetCDF I/O here.

marcosrdac · 2022-08-18T16:01:58Z

How do I get lossy compression? I could not find it on the documentation :(

dcherian · 2022-08-18T17:04:09Z

Please read the documentation

marcosrdac · 2022-08-19T00:04:21Z

Thanks, I thought there were some methods to choose from or something like that. For future readers, scale_factor seems to be used to control compression loss.

dcherian · 2022-08-19T14:07:00Z

You can also use zlib and complevel

marcosrdac · 2022-08-24T14:42:47Z

You can also use zlib and complevel

Just making it clear: those would configure lossless compression of netcdf4 lib, not lossy compression.

dcherian closed this as completed Jan 23, 2019

dcherian reopened this Aug 19, 2022

dcherian closed this as completed Aug 19, 2022

github-actions bot added the needs triage Issue that has not been reviewed by xarray team member label Aug 19, 2022

andersy005 removed the needs triage Issue that has not been reviewed by xarray team member label Aug 19, 2022

shartgring mentioned this issue May 30, 2024

Round data in staticmaps.nc Deltares/hydromt_wflow#231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to reduce the output size with to_netcdf? #865

How to reduce the output size with to_netcdf? #865

simon3122 commented Jun 2, 2016 •

edited

Loading

fmaussion commented Jun 3, 2016

marcosrdac commented Aug 18, 2022

dcherian commented Aug 18, 2022

marcosrdac commented Aug 19, 2022

dcherian commented Aug 19, 2022 •

edited

Loading

marcosrdac commented Aug 24, 2022

How to reduce the output size with to_netcdf? #865

How to reduce the output size with to_netcdf? #865

Comments

simon3122 commented Jun 2, 2016 • edited Loading

fmaussion commented Jun 3, 2016

marcosrdac commented Aug 18, 2022

dcherian commented Aug 18, 2022

marcosrdac commented Aug 19, 2022

dcherian commented Aug 19, 2022 • edited Loading

marcosrdac commented Aug 24, 2022

simon3122 commented Jun 2, 2016 •

edited

Loading

dcherian commented Aug 19, 2022 •

edited

Loading