Skip to content

Conversation

davies
Copy link
Contributor

@davies davies commented Nov 13, 2015

Currently, all the shuffle writer will write to target path directly, the file could be corrupted by other attempt of the same partition on the same executor. They should write to temporary file then rename to target path, as what we do in output committer. In order to make the rename atomic, the temporary file should be created in the same local directory (FileSystem).

This PR is based on #9214 , thanks to @squito

@davies davies changed the title [SPARK-8029] robust shuffle writer [SPARK-8029] robust shuffle writer (for 1.5 branch) Nov 13, 2015
@davies
Copy link
Contributor Author

davies commented Nov 13, 2015

cc @andrewor14

@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45841 timed out for PR 9686 at commit 7e83298 after a configured wait of 175m.

@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45847 has finished for PR 9686 at commit a6d569e.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@andrewor14
Copy link
Contributor

retest this please

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@aarondav Do you remember why we did not enable consolidation in this test? Since the title is "consolidated shuffle can write to shuffle group without messing existing offsets/lengths" (introduced in #1678)

@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45865 has finished for PR 9686 at commit a6d569e.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@andrewor14
Copy link
Contributor

retest this please

@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45873 has finished for PR 9686 at commit e95fcf5.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@andrewor14
Copy link
Contributor

merged into 1.5

asfgit pushed a commit that referenced this pull request Nov 13, 2015
Currently, all the shuffle writer will write to target path directly, the file could be corrupted by other attempt of the same partition on the same executor. They should write to temporary file then rename to target path, as what we do in output committer. In order to make the rename atomic, the temporary file should be created in the same local directory (FileSystem).

This PR is based on #9214 , thanks to squito

Author: Davies Liu <davies@databricks.com>

Closes #9686 from davies/writer_1.5 and squashes the following commits:

e95fcf5 [Davies Liu] fix test
a6d569e [Davies Liu] fix consolidate
7e83298 [Davies Liu] robust shuffle writer
@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45879 has finished for PR 9686 at commit e95fcf5.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@andrewor14
Copy link
Contributor

@davies can you close this

@davies davies closed this Nov 13, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants