Skip to content

ARROW-5562: [C++][Parquet] Write negative zero or small epsilons as positive zero when computing Parquet statistics#5375

Closed
wesm wants to merge 2 commits intoapache:masterfrom
wesm:ARROW-5562
Closed

ARROW-5562: [C++][Parquet] Write negative zero or small epsilons as positive zero when computing Parquet statistics#5375
wesm wants to merge 2 commits intoapache:masterfrom
wesm:ARROW-5562

Conversation

@wesm
Copy link
Member

@wesm wesm commented Sep 12, 2019

The OP reported a bug where we could have the "max" statistic equal to -0.0 while the min is +0.0, which causes an error in some processing frameworks like Hive.

I also added logic to zero out small epsilons. If this offends anyone, can you suggest an alternative approach to fixing this?

@wesm wesm requested review from majetideepak and pitrou September 12, 2019 22:03
Copy link
Member

@pitrou pitrou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM on the principle.

@bkietz
Copy link
Member

bkietz commented Sep 16, 2019

travis failure is https://issues.apache.org/jira/browse/ARROW-6509
merging

@bkietz bkietz closed this in 879cf3c Sep 16, 2019
@wesm
Copy link
Member Author

wesm commented Sep 16, 2019

@majetideepak if you have a chance to look at this since it affects Parquet statistics please let us know if you have concerns

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants