forked from apache/arrow
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ARROW-11607: [C++][Parquet] Update values_capacity_ when resetting.
I'm not sure why values_capacity_ is different kept separately from the buffer, but there is check which does not reserve capacity again values_capacity_ is already the needed size. When ReleaseValues is called, we allocate a brand new empty buffer. I'm not really sure why this hasn't caused users more issues (maybe increasingly large row groups or some other phenonemon). This bug also highlight that our C++ tests have very limited coverage on batched reads. To fix this I added an batch read for every round trip test to confirm it yields the same values. Closes apache#9498 from emkornfield/ARROW-11607 Lead-authored-by: Micah Kornfield <emkornfield@gmail.com> Co-authored-by: emkornfield <micahk@google.com> Signed-off-by: Antoine Pitrou <antoine@python.org>
- Loading branch information
Showing
3 changed files
with
86 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters