Skip to content

Total allocated memory keeps growing even when reading a parquet file in a streaming manner #7675

Closed Answered by tustvold
twitu asked this question in Q&A
Discussion options

You must be logged in to vote

The unit of IO is the page if the offset index is enabled, otherwise falling back to reading entire column chunk. What did you use to write the file? This behaviour would make sense if only a few very large row groups, and no offset index

Replies: 3 comments 7 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@twitu
Comment options

Comment options

You must be logged in to vote
6 replies
@twitu
Comment options

@tustvold
Comment options

Answer selected by twitu
@twitu
Comment options

@alamb
Comment options

@twitu
Comment options

@alamb
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants