You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a known issue, and we are working on it. A workaround is to write the result of split_multi to disk before using it (which is a good idea regardless).
The cause appears to the interaction of several things:
split_multi computes two separate pieces, then merges them together with a TableUnion
The aggregation doesn't care how the mt is keyed, and we propagate that upstream
Because of that, the TableUnion ends up unioning two unkeyed tables
TableUnion, as currently implemented, can only produce a result with a strict partitioner. In the case of no key fields, that means one partition.
The text was updated successfully, but these errors were encountered:
fixes#13407
CHANGELOG: Resolves#13407 in which uses of `union_rows` could reduce
parallelism to one partition resulting in severely degraded performance.
TableUnion was always collapsing to a single partition when the key was
empty. This adds a special case handling, which just concatenates
partitions.
The body of the resulting TableStage is a little hacky: it does a
StreamMultiMerge, but where exactly one input stream is non-empty. I
think that should have fine performance, and I didn’t see any simpler
ways to do it.
For example,
performs the aggregation in one giant partition.
This is a known issue, and we are working on it. A workaround is to write the result of
split_multi
to disk before using it (which is a good idea regardless).The cause appears to the interaction of several things:
split_multi
computes two separate pieces, then merges them together with aTableUnion
TableUnion
ends up unioning two unkeyed tablesTableUnion
, as currently implemented, can only produce a result with a strict partitioner. In the case of no key fields, that means one partition.The text was updated successfully, but these errors were encountered: