Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed to build store metadata causes OutOfMemory exception #18984

Closed
edalford11 opened this issue Jun 20, 2016 · 3 comments
Closed

Failed to build store metadata causes OutOfMemory exception #18984

edalford11 opened this issue Jun 20, 2016 · 3 comments
Labels
discuss :Distributed/Engine Anything around managing Lucene and the Translog in an open shard.

Comments

@edalford11
Copy link

Elasticsearch version: 2.3.2

JVM version: 1.7.0_91

OS version: CentOS 6.7

We received the following error message in the logs

[2016-06-20 19:33:24,465][WARN ][index.store              ] [search1_node1] [v4_messages_12_2013][3]failed to build store metadata. checking segment info integrity (with commit [no])
java.nio.file.NoSuchFileException: /var/lib/elasticsearch/stocktwits/nodes/0/indices/v4_messages_12_2013/3/index/segments_27

which immediately caused the node to reach an OutOfMemory exception, even though it was only using 70% of the heap. After doing some digging it appears that index it is referring to could be corrupted. Is this a cause of running an older version of java?

@clintongormley clintongormley added discuss :Distributed/Store Issues around managing unopened Lucene indices. If it touches Store.java, this is a likely label. labels Jun 21, 2016
@bleskes
Copy link
Contributor

bleskes commented Jun 21, 2016

@edalford11 can you post the rest of the error message? I'm curious to see the stack trace.

After doing some digging it appears that index it is referring to could be corrupted

Can you share what you found?

Is this a cause of running an older version of java?

This should not be the case, but if you can upgrade (which is recommended and will be required as of 5.0) it will tell us more.

@edalford11
Copy link
Author

Turns out that the original log message posted came after the OutOfMemory exception happened so that makes a little more sense.

I seem to be missing log files for when the OutOfMemory exception occurred. I see the logs on all of the other nodes when the event occurred but for some reason the node that actually went out of memory is missing logs, which makes this problem hard to solve.

All of the nodes were 70% and under on the heap usage which makes me confused as to why this happened but without logs I suppose it makes this hard to diagnose. I'll keep digging but this can probably be closed until I get more information.

@bleskes
Copy link
Contributor

bleskes commented Jun 21, 2016

yeah, OOMs are nasty. I'm closing this for now. Please reopen (or open a new ticket) if you find something interesting.

@bleskes bleskes closed this as completed Jun 21, 2016
@clintongormley clintongormley added :Distributed/Engine Anything around managing Lucene and the Translog in an open shard. and removed :Distributed/Store Issues around managing unopened Lucene indices. If it touches Store.java, this is a likely label. labels Feb 13, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss :Distributed/Engine Anything around managing Lucene and the Translog in an open shard.
Projects
None yet
Development

No branches or pull requests

3 participants