-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Populate partition column statistics for PartitionedFile #19284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| let schema_adapter = self.create_schema_adapter(); | ||
| let (schema_mapper, _) = schema_adapter.map_schema(self.file_schema.as_ref())?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was mapping from file schema -> table schema by filling in unknown stats for partition columns. It did not do any per-file adaptation, it assumes all files already have the same schema.
e1d1c91 to
26f07ab
Compare
mbutrovich
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's a nice net-negative line count! Thanks @adriangb!
3ad1f1f to
e15dbe0
Compare
|
Thanks @mbutrovich ! I'll leave this open for another day or so for public comment. |
|
@xudong963 in particular I'm interested in your input since you authored #15865 |
Co-authored-by: Martin Grigorov <martin-g@users.noreply.github.com>
c457754 to
2b569a9
Compare
Superseeds #15865
Part of #16800
The idea here was to remove usage of
SchemaAdapterand at the same time actually populate the partition column statistics.