New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix bugs in New Parquet Reader #4000
Conversation
@@ -80,7 +68,9 @@ public ParquetColumnChunkPageReader readColumn(ColumnDescriptor columnDescriptor | |||
{ | |||
checkArgument(currentBlockMetadata.getRowCount() > 0, "Row group having 0 rows"); | |||
|
|||
ColumnChunkMetaData metadata = columnMetadata.get(columnDescriptor); | |||
ColumnChunkMetaData metadata = getColumnChunkMetaData(columnDescriptor); | |||
checkArgument(metadata != null, "Could not find column metadata in the parquet file"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why not put this in getColumnChunkMetaData
and then the method would never return null
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be some kind of Presto corruption exception (like we have for ORC)?
1f9f915
to
5b4f318
Compare
thank you @dain , get comments addressed |
@dain I updated this PR with more fixes in the new Parquet Reader, your comments and suggestions are appreciated |
f0f650f
to
779b0bb
Compare
also merge the testcase PR here |
I'm repeating this comment from one of the commits.... For all of the Parquet code, can you take a look at the calls like |
Additionally, look at all uses of the new |
This PR seems to have |
Can you scan through all the commits and fix any place where the method arguments are aligned, instead of using two indents (8 spaces)? |
For the new |
My comments are mostly style/formatting. Let me know when this is updated and I'll get it in. |
6dcd646
to
622fe5a
Compare
Thank u so much @dain |
f08a1dc
to
ed843c0
Compare
ed843c0
to
eddb174
Compare
I will land this after the next release goes out (they are working on it now). |
Merged, thanks! |
Fix 3 things in New Parquet Reader: