[FEA] Unify `cudf::structs::detail::flatten_nested_columns` and `cudf::experimental::decompose_structs` to improve performance for structs comparison #13032

ttnghia · 2023-03-29T17:06:37Z

For comparing structs column, both the legacy row comparators and the new experimental row comparators rely on struct flattening procedures. Each of them have their own flattening mechanism: cudf::structs::detail::flatten_nested_columns and cudf::experimental::decompose_structs. The difference between them are:

cudf::structs::detail::flatten_nested_columns replaces the input structs column with an optional column generated by materializing the input null mask.
cudf::experimental::decompose_structs doesn't materialize any new column. Instead, it replaces the input structs column with a modified version of it, which only has either zero or one child at the innermost level.

Although these APIs produce different output, these APIs do very similar job:

Both extract the input structs column into a table of children columns, which are much simpler than the input structs column to be compared on device code.
Both replace the input by a new column, and this new column is mainly used for checking nulls.

The issue of each from these approaches are:

cudf::structs::detail::flatten_nested_columns needs to materialize null mask of the input column into a real column.
cudf::experimental::decompose_structs still has a nested structs column in the output. Although that column only has zero or one child at the innermost level, it still causes performance degradation if its nested level is very high.

As such, we can unify the two approaches, taking the pros of both while eliminating the cons. The new flattening API should:

Avoid materializing new columns, and
Avoid output columns having more than one nested level.

This seems to be very straightforward with modifying the existing cudf::experimental::decompose_structs API.

The text was updated successfully, but these errors were encountered:

GregoryKimball · 2023-06-07T20:48:05Z

Hello @ttnghia, thinking about the priority of this suggestion, do we have a way to estimate the performance impact of high nesting levels for the cudf::experimental::decompose_structs implementation?

ttnghia · 2023-06-07T20:52:49Z

do we have a way to estimate the performance impact of high nesting levels for the cudf::experimental::decompose_structs implementation?

We can run a benchmark comparing sortings of two non-nullable tables:

A table with highly nested struct+list
A table resulted from calling cudf::structs::detail::flatten_nested_columns on the table above.

The second table is flattened from the first table so it will have all columns having at max 1 nested level.

GregoryKimball · 2023-07-05T21:58:11Z

@divyegala and I discussed this idea today, and we would like to also monitor the impact to peak memory usage when experimenting with this idea.

ttnghia added feature request New feature or request Needs Triage Need team to review and classify labels Mar 29, 2023

ttnghia mentioned this issue Apr 5, 2023

Adopt experimental row comparator for struct min and max operators #10811

Closed

GregoryKimball mentioned this issue Apr 5, 2023

[FEA] Implement full support for nested types #11844

Closed

GregoryKimball added 0 - Backlog In queue waiting for assignment proposal Change current process or code libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue and removed feature request New feature or request Needs Triage Need team to review and classify labels Jun 7, 2023

GregoryKimball added this to the List and Struct data types and operations milestone Jun 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Unify `cudf::structs::detail::flatten_nested_columns` and `cudf::experimental::decompose_structs` to improve performance for structs comparison #13032

[FEA] Unify `cudf::structs::detail::flatten_nested_columns` and `cudf::experimental::decompose_structs` to improve performance for structs comparison #13032

ttnghia commented Mar 29, 2023

GregoryKimball commented Jun 7, 2023

ttnghia commented Jun 7, 2023

GregoryKimball commented Jul 5, 2023

[FEA] Unify cudf::structs::detail::flatten_nested_columns and cudf::experimental::decompose_structs to improve performance for structs comparison #13032

[FEA] Unify cudf::structs::detail::flatten_nested_columns and cudf::experimental::decompose_structs to improve performance for structs comparison #13032

Comments

ttnghia commented Mar 29, 2023

GregoryKimball commented Jun 7, 2023

ttnghia commented Jun 7, 2023

GregoryKimball commented Jul 5, 2023

[FEA] Unify `cudf::structs::detail::flatten_nested_columns` and `cudf::experimental::decompose_structs` to improve performance for structs comparison #13032

[FEA] Unify `cudf::structs::detail::flatten_nested_columns` and `cudf::experimental::decompose_structs` to improve performance for structs comparison #13032