[SPARK-21478][SQL] Avoid unpersisting related Datasets #18771

vinodkc · 2017-07-29T18:06:58Z

What changes were proposed in this pull request?

While unpersisting a dataset, only unpersist and remove that datasets's plan from Cachemanager's cachedData.

How was this patch tested?

Added unit tests

gatorsmile · 2017-07-29T18:43:42Z

sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala

   */
  def uncacheQuery(spark: SparkSession, plan: LogicalPlan, blocking: Boolean): Unit = writeLock {
    val it = cachedData.iterator()
    while (it.hasNext) {
      val cd = it.next()
-      if (cd.plan.find(_.sameResult(plan)).isDefined) {
+      if (plan.sameResult(cd.plan)) {


This is by design. This is for avoiding to get the incorrect results. See the original PR: #17097

@gatorsmile Thanks for the update.
I'll close this PR

SparkQA · 2017-07-29T19:53:43Z

Test build #80046 has finished for PR 18771 at commit 857b3dd.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

vinodkc added 2 commits July 29, 2017 20:42

Fixed unpersisting related DFs

e187aeb

Updated test cases and condition for unpersist

857b3dd

gatorsmile reviewed Jul 29, 2017

View reviewed changes

vinodkc closed this Jul 30, 2017

vinodkc deleted the br_SPARK-21478 branch May 25, 2021 07:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-21478][SQL] Avoid unpersisting related Datasets #18771

[SPARK-21478][SQL] Avoid unpersisting related Datasets #18771

vinodkc commented Jul 29, 2017

gatorsmile Jul 29, 2017

vinodkc Jul 30, 2017

SparkQA commented Jul 29, 2017

[SPARK-21478][SQL] Avoid unpersisting related Datasets #18771

[SPARK-21478][SQL] Avoid unpersisting related Datasets #18771

Conversation

vinodkc commented Jul 29, 2017

What changes were proposed in this pull request?

How was this patch tested?

gatorsmile Jul 29, 2017

Choose a reason for hiding this comment

vinodkc Jul 30, 2017

Choose a reason for hiding this comment

SparkQA commented Jul 29, 2017