-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Truncate stats from Parquet files #113
Comments
@aokolnychyi, #78 reminded me about this item as well. |
@rdblue is it an issue good for n00b? if yes, I am interested to take this one :) |
@feng-tao, I think this could be a good first issue. Let us know if you need any help or context. |
@rdblue , I am still reading the code base. It would be great if you could guide me a little bit on the context or the related code path. Thanks a lot :) |
You might want to explore passing a truncate length option to |
thanks @rdblue , will take a look |
Lower and upper bound values from Parquet files are not currently truncated, which takes more space than necessary in manifests. Truncating strings and binary values will probably improve performance for large tables.
The text was updated successfully, but these errors were encountered: