-
Notifications
You must be signed in to change notification settings - Fork 705
Open
Labels
Description
Hadoop 2.2.0
Scalding 0.9.0
Cascading 2.5.2
Scala 2.10.4
I'm running into issues while running larger volumes of data:
2015-01-20 21:49:59,752 WARN cascading.flow.FlowStep (pool-2-thread-1): [com.co.scalding...] failure info: Application application_1421789685841_0027 failed 2 times due to Error launching appattempt_1421789685841_0027_000002. Got exception: java.io.EOFException
at java.io.DataInputStream.readFully(DataInputStream.java:197)
at java.io.DataInputStream.readFully(DataInputStream.java:169)
at org.apache.hadoop.security.Credentials.readTokenStorageStream(Credentials.java:189)
at org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.setupTokens(AMLauncher.java:225)
at org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.createAMContainerLaunchContext(AMLauncher.java:197)
at org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.launch(AMLauncher.java:107)
at org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.run(AMLauncher.java:249)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:744)
. Failing the application.
Has this been fixed in prior release?
Current fix seems to be have Scalding/Cascading produce smaller jobs i.e. one sink/one tap.
Thank you for taking a look,