Skip to content

[VL] Add support for approx_count_distinct_for_intervals #11599

@zhztheplayer

Description

@zhztheplayer

An essential function for Spark CBO + histogram.

Error:

[main] WARN org.apache.spark.sql.execution.GlutenFallbackReporter - Validation failed for plan: ObjectHashAggregate[QueryId=113], due to: 
 - Validation failed with exception from: RegularHashAggregateExecTransformer, reason: Could not find a valid substrait mapping name for approx_count_distinct_for_intervals(s_store_sk#87088L, org.apache.spark.sql.catalyst.expressions.UnsafeArrayData@3915186b, 0.05, 0, 0).

Reference:

https://github.com/apache/spark/blob/ebd5b007fcf203eadcf8b037ab2b99577490f869/sql/core/src/main/scala/org/apache/spark/sql/execution/command/CommandUtils.scala#L401

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions