You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
…y optimization (#35989)
### Rationale for this change
### What changes are included in this PR?
1. This patch does some tiny optimizations on Parquet C++ Statistics. It does:
```
For min-max, using std::string. Because assume the case like that:
EncodedStatistics c1;
// do some operations
EncodedStatistics c2 = c1;
c2.set_max("dasdasdassd");
After c2 set, c1 would be set too. So I use std::string here.
```
2. Force clear ndv count during merging, and set `has_distinct_count_ = false`, and add some comments
3. Add some specification in Statistics API
### Are these changes tested?
Yes
### Are there any user-facing changes?
No
* Closes: #34351
Lead-authored-by: mwish <maplewish117@gmail.com>
Co-authored-by: mwish <1506118561@qq.com>
Co-authored-by: Antoine Pitrou <pitrou@free.fr>
Co-authored-by: Gang Wu <ustcwg@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>
Describe the bug, including details regarding any error messages, version, and platform.
In
src/parquet/statistics.cc
:The original code ignore setting
has
flag.Component(s)
C++, Parquet
The text was updated successfully, but these errors were encountered: