You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
parquet::ColumnReader::HasNextInternal might call ReadNewPage to check the record boundary.
boolHasNextInternal() {
// Either there is no data page available yet, or the data page has been// exhaustedif (num_buffered_values_ == 0 || num_decoded_values_ == num_buffered_values_) {
if (!ReadNewPage() || num_buffered_values_ == 0) {
returnfalse;
}
}
returntrue;
}
And ReadNewPage will call:
// Advance to the next data pageboolReadNewPage() {
// Loop until we find the next data page.while (true) {
current_page_ = pager_->NextPage();
if (!current_page_) {
// EOSreturnfalse;
}
When having data_page_filter, in v1 format, seems that NextPage might filter the data-page?
Component(s)
C++, Parquet
The text was updated successfully, but these errors were encountered:
data page filter should not be set with V1 repeated fields. This is because
we do not have num_rows in DataPageStats for V1 repeated fields. So the
stats that the callback is called with is not helpful in that case anyways.
We should add a comment in the code. Note that the set_data_page_filter API
is marked experimental.
Describe the enhancement requested
parquet::ColumnReader::HasNextInternal
might callReadNewPage
to check the record boundary.And
ReadNewPage
will call:When having
data_page_filter
, in v1 format, seems thatNextPage
might filter the data-page?Component(s)
C++, Parquet
The text was updated successfully, but these errors were encountered: