[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in `ExtractBenchmark` #32035

MaxGekk · 2021-04-02T06:14:24Z

What changes were proposed in this pull request?

In the PR, I propose to disable ANSI intervals as the result of dates/timestamp subtraction in ExtractBenchmark and benchmark only legacy intervals because EXTRACT( .. FROM ..) doesn't support ANSI intervals so far.

Why are the changes needed?

This fixes the benchmark failure:

[info]   Running case: YEAR of interval
[error] Exception in thread "main" org.apache.spark.sql.AnalysisException: cannot resolve 'year((subtractdates(CAST(timestamp_seconds(id) AS DATE), DATE '0001-01-01') + subtracttimestamps(timestamp_seconds(id), TIMESTAMP '1000-01-01 01:02:03.123456')))' due to data type mismatch: argument 1 requires date type, however, '(subtractdates(CAST(timestamp_seconds(id) AS DATE), DATE '0001-01-01') + subtracttimestamps(timestamp_seconds(id), TIMESTAMP '1000-01-01 01:02:03.123456'))' is of day-time interval type.; line 1 pos 0;
[error] 'Project [extract(YEAR, (subtractdates(cast(timestamp_seconds(id#1456L) as date), 0001-01-01, false) + subtracttimestamps(timestamp_seconds(id#1456L), 1000-01-01 01:02:03.123456, false, Some(Europe/Moscow)))) AS YEAR#1458]
[error] +- Range (1262304000, 1272304000, step=1, splits=Some(1))
[error] 	at org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42)
[error] 	at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$$nestedInanonfun$checkAnalysis$1$2.applyOrElse(CheckAnalysis.scala:194)

Does this PR introduce any user-facing change?

No

How was this patch tested?

By running the ExtractBenchmark benchmark via:

$ build/sbt "sql/test:runMain org.apache.spark.sql.execution.benchmark.ExtractBenchmark"

MaxGekk · 2021-04-02T06:21:48Z

@HyukjinKwon Could you review this PR, please.

HyukjinKwon · 2021-04-02T06:29:39Z

sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/ExtractBenchmark.scala

-    withSQLConf(SQLConf.WHOLESTAGE_CODEGEN_ENABLED.key -> "true") {
+    withSQLConf(
+      SQLConf.LEGACY_INTERVAL_ENABLED.key -> "true",
+      SQLConf.WHOLESTAGE_CODEGEN_ENABLED.key -> "true") {


BTW, I asked (offline) to don't generate the benchmark results because I plan to regenerate everything after #32015

HyukjinKwon · 2021-04-02T06:45:14Z

None of tests actually verfiies this changes except comliation and linter which passed.

Merged to master

Enable legacy intervals

a7327e4

github-actions bot added the SQL label Apr 2, 2021

MaxGekk changed the title ~~[SPARK-34938][SQL] Benchmark only legacy interval in ExtractBenchmark~~ [SPARK-34938][SQL][TESTS] Benchmark only legacy interval in ExtractBenchmark Apr 2, 2021

MaxGekk mentioned this pull request Apr 2, 2021

[SPARK-34821][INFRA] Set up a workflow for developers to run benchmark in their fork #32015

Closed

HyukjinKwon reviewed Apr 2, 2021

View reviewed changes

HyukjinKwon approved these changes Apr 2, 2021

View reviewed changes

HyukjinKwon closed this in 1d08451 Apr 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in `ExtractBenchmark` #32035

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in `ExtractBenchmark` #32035

MaxGekk commented Apr 2, 2021

MaxGekk commented Apr 2, 2021

HyukjinKwon Apr 2, 2021

HyukjinKwon commented Apr 2, 2021

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in ExtractBenchmark #32035

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in ExtractBenchmark #32035

Conversation

MaxGekk commented Apr 2, 2021

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

MaxGekk commented Apr 2, 2021

HyukjinKwon Apr 2, 2021

Choose a reason for hiding this comment

HyukjinKwon commented Apr 2, 2021

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in `ExtractBenchmark` #32035

[SPARK-34938][SQL][TESTS] Benchmark only legacy interval in `ExtractBenchmark` #32035