Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Representing missing values in string arrays on disk #1647
This came up as part of my clean-up of serializing unicode strings in #1648.
There are two ways to represent strings in netCDF files.
Currently, by default (if no
For character arrays, we could use the normal
For variable length strings, it currently isn't possible to set a fill-value. So there's no good way to indicate missing values, though this may change if the future depending on the resolution of the netCDF-python issue.
It would obviously be nice to always automatically round-trip missing values, both for strings and bytes. I see two possible ways to do this:
The default option is to adopt neither of these, and keep the current behavior where missing values are written as empty strings and not decoded at all.
Any opinions? I am leaning towards option (2).
referenced this issue
Oct 23, 2017
It occurs to me that yet another option is to avoid using