Skip to content

Unrecoverable error in KIS 0.12.2 and 0.12.1 #6166

@pdeva

Description

@pdeva

all the tasks keep restarting with this error in the logs:
I tried:

  • upgrading node from 0.12.1 to 0.12.2
  • spinning a whole new server and created a replica.
    Completely unrecoverable!
2018-08-13 15:41:48,555 INFO i.d.i.o.ForkingTaskRunner [forking-task-runner-1] Exception caught during execution
java.io.IOException: Stream closed
	at java.io.BufferedInputStream.getBufIfOpen(BufferedInputStream.java:170) ~[?:1.8.0_181]
	at java.io.BufferedInputStream.read1(BufferedInputStream.java:291) ~[?:1.8.0_181]
	at java.io.BufferedInputStream.read(BufferedInputStream.java:345) ~[?:1.8.0_181]
	at java.io.FilterInputStream.read(FilterInputStream.java:107) ~[?:1.8.0_181]
	at com.google.common.io.ByteStreams.copy(ByteStreams.java:175) ~[guava-16.0.1.jar:?]
	at io.druid.indexing.overlord.ForkingTaskRunner$1.call(ForkingTaskRunner.java:452) [druid-indexing-service-0.12.2.jar:0.12.2]
	at io.druid.indexing.overlord.ForkingTaskRunner$1.call(ForkingTaskRunner.java:224) [druid-indexing-service-0.12.2.jar:0.12.2]
	at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_181]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_181]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_181]
	at java.lang.Thread.run(Thread.java:748) [?:1.8.0_181]

Even posted a thread here:
https://groups.google.com/forum/#!topic/druid-user/K7YmcBJM6T4

More details:

  • Coord+overlord node is still at 0.12.1 (we havent upgraded it yet, for fear of more damage)
  • another replica of MM node (we deploy 2) has been running without issue. we are very, very scared that if that replica dies and gets stuck like this, ALL ingestion will stop!

All help is highly appreciated.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions