Skip to content

[Spec] Clarify whether floating point lower and upper bounds can contain NaNs#4404

Merged
rdblue merged 3 commits intoapache:masterfrom
emkornfield:clarify_stats_nans
Mar 27, 2022
Merged

[Spec] Clarify whether floating point lower and upper bounds can contain NaNs#4404
rdblue merged 3 commits intoapache:masterfrom
emkornfield:clarify_stats_nans

Conversation

@emkornfield
Copy link
Copy Markdown
Contributor

This is somewhat pedantic. Based on current wording:
Each value must be less than or equal to all non-null, non-NaN values in the column for the file [2]

Based on defined sorting statistics
Sorting floating-point numbers should produce the following behavior: -NaN < -Infinity < -value < -0 < 0 < value < Infinity < NaN -NaN could be a lower bound and NaN could be an upper bound.

This PR tries to clarify whether statistics can or can't used for lower and upper bounds (I think the first sentence quoted might have been intended to rule out NaN but it isn't clear, so I'm happy to reverse this language to state that +/- NaN are valid values for bounds if that is the case).

@rdblue
Copy link
Copy Markdown
Contributor

rdblue commented Mar 27, 2022

You're correct. NaN values are not allowed in lower or upper bounds. Thanks!

@rdblue rdblue merged commit 4d2ff3f into apache:master Mar 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants