New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

More convenient usage of big integers and ORDER BY WITH FILL. #46152

Merged

alexey-milovidov merged 12 commits into master from with-fill-bigint

Feb 10, 2023

Member

alexey-milovidov commented Feb 8, 2023 •

edited

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

More convenient usage of big integers and ORDER BY WITH FILL. Allow using plain integers for start and end points in WITH FILL when ORDER BY big (128-bit and 256-bit) integers. Fix the wrong result for big integers with negative start or end points. This closes #16733

See also the assertion about unexpected type of exception in debug build https://s3.amazonaws.com/clickhouse-test-reports/46149/a69f9c05a981a6c95af496b47413e540c87e2c09/fuzzer_astfuzzerdebug/report.html

alexey-milovidov added 2 commits

February 8, 2023 07:56


          Allow accurate comparison of Big Int with other integers

d66beb9


          Add a test

2678def

robot-ch-test-poll4 added the pr-improvement label

alexey-milovidov added 4 commits

February 8, 2023 08:18


          Remove bits of trash

0791b85


          Fix strange trash

a9ec73a


          Add a test

966f5b0


          Fix style

4f2a58d

Member Author

alexey-milovidov commented Feb 8, 2023

I also noticed that the code around type conversion is stupid.

yakov-olkhovskiy self-assigned this


          Merge remote-tracking branch 'origin/master' into with-fill-bigint

292a5ab

Member

yakov-olkhovskiy commented Feb 9, 2023 •

edited

@alexey-milovidov seems there is an exception in debug build:
https://github.com/ClickHouse/ClickHouse/actions/runs/4129815223/jobs/7136650496#step:6:960
here:
https://github.com/ClickHouse/ClickHouse/blob/with-fill-bigint/src/Core/Field.h#L827-L834

Member Author

alexey-milovidov commented Feb 9, 2023

@yakov-olkhovskiy I want WITH FILL to use getLeastSuperType to determine the resulting type.
Currently, the logic is strange.

Member Author

alexey-milovidov commented Feb 9, 2023

But I don't understand - whether we can change the types after WITH FILL.

Member Author

alexey-milovidov commented Feb 9, 2023

Looks like no.

Member Author

alexey-milovidov commented Feb 9, 2023

And there are some complications with STEP.

alexey-milovidov added 4 commits

February 9, 2023 08:09


          Make it better

a40ef2b


          Add a test for #30421

327be4d


          Fix something

cb84717


          Implement #16733

3a75ede

Member

yakov-olkhovskiy commented Feb 9, 2023

Currently, the logic is strange.

I think it's done this way to simplify step function generation

Member Author

alexey-milovidov commented Feb 9, 2023

Ready for review.

alexey-milovidov commented

View reviewed changes

src/Common/FieldVisitorConvertToNumber.h

@@ @@ -53,7 +52,6 @@ class FieldVisitorConvertToNumber : public StaticVisitor<T> @@
                   T operator() (const UInt64 & x) const { return T(x); }
                   T operator() (const Int64 & x) const { return T(x); }
-                  T operator() (const Int128 & x) const { return T(x); }

Member Author

alexey-milovidov Feb 9, 2023

Here were leftovers from when we used UInt128 for UUID a long time ago. I've cleaned it up. Some bugs can be automatically fixed by this change...

alexey-milovidov commented

View reviewed changes

src/Common/FieldVisitorsAccurateComparison.h

                           if constexpr (std::is_same_v<T, U>)
                               return l == r;
-                          if constexpr (std::is_arithmetic_v<T> && std::is_arithmetic_v<U>)

Member Author

alexey-milovidov Feb 9, 2023

This enables comparison for big integers.

alexey-milovidov commented

View reviewed changes

src/Interpreters/FillingRow.cpp

@@ @@ -49,7 +49,7 @@ bool FillingRow::next(const FillingRow & to_row) @@
                   size_t pos = 0;
                   /// Find position we need to increment for generating next row.
-                  for (; pos < size(); ++pos)

Member Author

alexey-milovidov Feb 9, 2023

This is a minor change, but the compiler cannot typically optimize size calls out of the loop due to not enough strict-aliasing rules in C++ (in comparison to Rust).

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

                   /// Columns which are not from sorting key may not be constant anymore.
                   for (auto & column : header)
                       if (column.column && isColumnConst(*column.column) && !sort_keys.contains(column.name))
-                          column.column = column.type->createColumn();
+                          column.column = column.column->convertToFullColumnIfConst();

Member Author

alexey-milovidov Feb 9, 2023

This is more natural way to do the same.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

                   {
                       WhichDataType which_from(descr.fill_from_type);
                       if ((which_from.isDateOrDate32() || which_from.isDateTime() || which_from.isDateTime64()) &&
-                          !descr.fill_from_type->equals(*type))
+                          !descr.fill_from_type->equals(*removeNullable(type)))

Member Author

alexey-milovidov Feb 9, 2023

Enables WITH FILL when the type is Nullable, see the test.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

                               return false;
                   }
-                  /// TODO Wrong results for big integers.
-                  if (isInteger(type) || which.isDate() || which.isDate32() || which.isDateTime())
+                  if (which.isInt128() || which.isUInt128())

Member Author

alexey-milovidov Feb 9, 2023

Implemented TODO.
The code remains tricky, maybe it's worth simplifying later.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

-                  descr.fill_from = convertFieldToType(descr.fill_from, *to_type);
-                  descr.fill_to = convertFieldToType(descr.fill_to, *to_type);
-                  descr.fill_step = convertFieldToType(descr.fill_step, *to_type);
+                  if (!descr.fill_from.isNull())

Member Author

alexey-milovidov Feb 9, 2023

This is much better - we are explicitly asking if the field is convertible, instead of having a risk it is confused with NULL, which has another meaning for the WITH FILL logic.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

@@ @@ -184,18 +196,20 @@ FillingTransform::FillingTransform( @@
                       fill_column_positions.push_back(block_position);
                       auto & descr = filling_row.getFillDescription(i);
-                      const auto & type = header_.getByPosition(block_position).type;
+                      const Block & output_header = getOutputPort().getHeader();

Member Author

alexey-milovidov Feb 9, 2023

Taking it from the output header is unneeded, as it is currently identical to the input header by the data types, but I was not sure about it and added this code. It is more natural, so worth keeping this change.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

-                      const auto & type = header_.getByPosition(block_position).type;
+                      const Block & output_header = getOutputPort().getHeader();
+                      const DataTypePtr & type = removeNullable(output_header.getByPosition(block_position).type);

Member Author

alexey-milovidov Feb 9, 2023

Support for Nullable.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp


		if (type->isValueRepresentedByUnsignedInteger() &&
		if (isUnsignedInteger(type) &&

Member Author

alexey-milovidov Feb 9, 2023

Support for big integers.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

@@ @@ -213,7 +227,7 @@ FillingTransform::FillingTransform( @@
                                   input_positions.emplace_back(idx, p->second);
                       if (!is_fill_column[idx] && !(interpolate_description && interpolate_description->result_columns_set.contains(column.name)))
-                              other_column_positions.push_back(idx);
+                          other_column_positions.push_back(idx);

Member Author

alexey-milovidov Feb 9, 2023

Style.

alexey-milovidov commented

View reviewed changes

src/Processors/Transforms/FillingTransform.cpp

@@ @@ -335,8 +349,8 @@ void FillingTransform::transform(Chunk & chunk) @@
                       interpolate();
                       while (filling_row.next(next_row))
                       {
-                              insertFromFillingRow(res_fill_columns, res_interpolate_columns, res_other_columns, filling_row, interpolate_block);
-                              interpolate();
+                          insertFromFillingRow(res_fill_columns, res_interpolate_columns, res_other_columns, filling_row, interpolate_block);

Member Author

alexey-milovidov Feb 9, 2023

Style.


          Add a test

8c9be17

yakov-olkhovskiy approved these changes

View reviewed changes

alexey-milovidov merged commit d5d87bc into master

alexey-milovidov deleted the with-fill-bigint branch

February 10, 2023 17:05

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment