-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Closed
Labels
Description
all the tasks keep restarting with this error in the logs:
I tried:
- upgrading node from 0.12.1 to 0.12.2
- spinning a whole new server and created a replica.
Completely unrecoverable!
2018-08-13 15:41:48,555 INFO i.d.i.o.ForkingTaskRunner [forking-task-runner-1] Exception caught during execution
java.io.IOException: Stream closed
at java.io.BufferedInputStream.getBufIfOpen(BufferedInputStream.java:170) ~[?:1.8.0_181]
at java.io.BufferedInputStream.read1(BufferedInputStream.java:291) ~[?:1.8.0_181]
at java.io.BufferedInputStream.read(BufferedInputStream.java:345) ~[?:1.8.0_181]
at java.io.FilterInputStream.read(FilterInputStream.java:107) ~[?:1.8.0_181]
at com.google.common.io.ByteStreams.copy(ByteStreams.java:175) ~[guava-16.0.1.jar:?]
at io.druid.indexing.overlord.ForkingTaskRunner$1.call(ForkingTaskRunner.java:452) [druid-indexing-service-0.12.2.jar:0.12.2]
at io.druid.indexing.overlord.ForkingTaskRunner$1.call(ForkingTaskRunner.java:224) [druid-indexing-service-0.12.2.jar:0.12.2]
at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_181]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_181]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_181]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_181]
Even posted a thread here:
https://groups.google.com/forum/#!topic/druid-user/K7YmcBJM6T4
More details:
- Coord+overlord node is still at 0.12.1 (we havent upgraded it yet, for fear of more damage)
- another replica of MM node (we deploy 2) has been running without issue. we are very, very scared that if that replica dies and gets stuck like this, ALL ingestion will stop!
All help is highly appreciated.
Reactions are currently unavailable