[SPARK-39709][SQL] The result of executeCollect and doExecute of TakeOrderedAndProjectExec should be the same by wangyum · Pull Request #37118 · apache/spark

wangyum · 2022-07-07T13:21:28Z

What changes were proposed in this pull request?

This PR makes TakeOrderedAndProjectExec's executeCollect use doExecute()'s result.

Why are the changes needed?

To make the result of executeCollect and doExecute of TakeOrderedAndProjectExec the same. For example:

import testImplicits._

Seq((1, 1), (1, 2), (2, 3), (2, 4), (3, 5), (3, 6), (3, 7)).toDF("a", "b")
  .orderBy("a")
  .selectExpr("b")
  .limit(6)
  .show()/.collect().foreach(println)

.show() will use doExecute and the result is:

+---+
|  b|
+---+
|  1|
|  2|
|  3|
|  4|
|  5|
|  6|
+---+

.collect().foreach(println) will use executeCollect and the result is:

[2]
[1]
[3]
[4]
[7]
[6]

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Unit test.

wangyum · 2022-07-07T14:44:46Z

cc @cloud-fan @HyukjinKwon

cloud-fan · 2022-07-07T15:00:27Z

sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala

-    val limited = if (orderingSatisfies) {
-      child.execute().mapPartitionsInternal(_.map(_.copy()).take(limit)).takeOrdered(limit)(ord)
-    } else {
-      child.execute().mapPartitionsInternal(_.map(_.copy())).takeOrdered(limit)(ord)


hmm, so RDD.takeOrdered does not return ordered records?

No. The result does not match if it contains duplicate values. For example, order by column a:

+---+---+ | a| b| +---+---+ | 1| 1| | 1| 2| | 2| 3| | 2| 4| | 3| 5| | 3| 6| +---+---+

JoshRosen

In this example, it looks like the ordering does not result in a total order: there are multiple values for each sort key:

>>> Seq((1, 1), (1, 2), (2, 3), (2, 4), (3, 5), (3, 6), (3, 7)).sortBy(_._1).map(_._2)
res5: Seq[Int] = List(1, 2, 3, 4, 5, 6, 7)

>>> Seq((1, 1), (1, 2), (2, 3), (2, 4), (3, 5), (3, 6), (3, 7)).reverse.sortBy(_._1).map(_._2)
res6: Seq[Int] = List(2, 1, 4, 3, 7, 6, 5)

In these cases, the ordering of the inputs to the sort will affect the final outcome.

Spark does not guarantee the order in which shuffle blocks are fetched: doExecute() will shuffle the per-partition results into a single reduce partition and then does a final sorting there, but the order in which that final partition fetches data from mappers is not guaranteed. As a result, the final sort may receive its input in different orders in different executions and this can lead to non-deterministic results.

That effect doesn't show up in the toy examples that use LocalRelation / parallelize, but I think we could demonstrate it with a more realistic example involving multiple input partitions and multiple executors.

As a result, I don't think that this patch's changes are sufficient to guarantee determinism of results when the sorting order does not totally order the records. Therefore I don't think we should make this change: it adds additional performance overheads but I don't think it will actually solve the problem of differing results with non-total sort orders.

wangyum · 2022-07-08T15:20:12Z

Thank you @JoshRosen. Make sense. I will close this PR.

The result of executeCollect and doExecute should be the same

869647f

github-actions bot added the SQL label Jul 7, 2022

cloud-fan reviewed Jul 7, 2022

View reviewed changes

JoshRosen reviewed Jul 7, 2022

View reviewed changes

wangyum closed this Jul 8, 2022

wangyum deleted the SPARK-39709 branch July 8, 2022 15:20

wangyum mentioned this pull request Jul 11, 2022

[SPARK-39698][SQL] Use TakeOrderedAndProject if maxRows below the spark.sql.execution.topKSortMaxRowsThreshold #37104

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[SPARK-39709][SQL] The result of executeCollect and doExecute of TakeOrderedAndProjectExec should be the same#37118

[SPARK-39709][SQL] The result of executeCollect and doExecute of TakeOrderedAndProjectExec should be the same#37118
wangyum wants to merge 1 commit intoapache:masterfrom
wangyum:SPARK-39709

wangyum commented Jul 7, 2022 •

edited

Loading

Uh oh!

wangyum commented Jul 7, 2022

Uh oh!

cloud-fan Jul 7, 2022

Uh oh!

wangyum Jul 8, 2022

Uh oh!

JoshRosen left a comment

Uh oh!

wangyum commented Jul 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

wangyum commented Jul 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

wangyum commented Jul 7, 2022

Uh oh!

cloud-fan Jul 7, 2022

Choose a reason for hiding this comment

Uh oh!

wangyum Jul 8, 2022

Choose a reason for hiding this comment

Uh oh!

JoshRosen left a comment

Choose a reason for hiding this comment

Uh oh!

wangyum commented Jul 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wangyum commented Jul 7, 2022 •

edited

Loading