[SPARK-17119][Core]allow the history server to delete .inprogress files(configurable) #16293

cnZach · 2016-12-15T07:42:53Z

What changes were proposed in this pull request?

The History Server (HS) currently only considers completed applications when deleting event logs from spark.history.fs.logDirectory (since SPARK-6879). This means that over time, .inprogress files (from failed jobs, jobs where the SparkContext is not closed, spark-shell exits etc...) can accumulate and impact the HS.

Instead of having to manually delete these files, this change add a configurable feature to let user decide if the .inprogress files should also be deleted after a period of time:
spark.history.fs.cleaner.deleteInProgress.enabled
spark.history.fs.cleaner.noProgressMaxAge

How was this patch tested?

verified with manual tests
unit tests added in FsHistoryProviderSuite.scala but I am not able to run ./dev/run-tests for the whole project on my laptop, failed on SparkSinkSuite and network related tests uner org.apache.spark.network.* (all due to java.io.IOException: Failed to connect to /<my_laptop_ip>:62343).
[info] SparkSinkSuite: [info] - Success with ack *** FAILED *** (1 minute) [info] java.io.IOException: Error connecting to /0.0.0.0:62298 [info] at org.apache.avro.ipc.NettyTransceiver.getChannel(NettyTransceiver.java:261)

doc

monitoring.md is also updated

…igurable

AmplabJenkins · 2016-12-15T07:47:15Z

Can one of the admins verify this patch?

vanzin · 2016-12-15T21:06:20Z

@cnZach could you close this? This was already implemented in SPARK-8617. I don't think we need a config option.

splinepalash · 2016-12-17T08:04:56Z

Hi Respected contributors,

Thanks for all of your hard work to make this platform really stable.

I'm almost new to Spark and building an application using it. I was facing problems to initiate my new spark jobs if any previous job is failed due to some problem. Could anyone Please help me how I can use this functionality? Is it packaged in any Spark official realest?

Best regards,
Palash Gupta

cnZach · 2016-12-19T00:52:37Z

as there's already some changes implemented in SPARK-8617, close this PR.

Yuexin Zhang added 3 commits December 15, 2016 14:19

allow the history server to delete .inprogress files and make it conf…

aa45caa

…igurable

fix a typo noProgressMaxAg -> noProgressMaxAge

f281d92

fix checkstyle failures in FsHistoryProviderSuite.scala

989422d

cnZach closed this Dec 19, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-17119][Core]allow the history server to delete .inprogress files(configurable) #16293

[SPARK-17119][Core]allow the history server to delete .inprogress files(configurable) #16293

cnZach commented Dec 15, 2016

AmplabJenkins commented Dec 15, 2016

vanzin commented Dec 15, 2016

splinepalash commented Dec 17, 2016

cnZach commented Dec 19, 2016

[SPARK-17119][Core]allow the history server to delete .inprogress files(configurable) #16293

[SPARK-17119][Core]allow the history server to delete .inprogress files(configurable) #16293

Conversation

cnZach commented Dec 15, 2016

What changes were proposed in this pull request?

How was this patch tested?

doc

AmplabJenkins commented Dec 15, 2016

vanzin commented Dec 15, 2016

splinepalash commented Dec 17, 2016

cnZach commented Dec 19, 2016