-
Notifications
You must be signed in to change notification settings - Fork 36
Persistent Problems with Shard / Cache corruption #23
Comments
I actually know what this is now I think When I tail the logs while querying the not-working time range, I see an error like 'blah blah metadata.txt not found' So in an ideal world it shouldn't fail. |
There is a configuration parameter for the file cache size, specified in /opt/imhotep/imhotep-caching.yaml:
If you have a configuration like that, check the size of /var/data/file_cache and make sure the cache size is well under the available space (minus current cache size)
Adjust the I got this note from an Indeed developer about our experiences with running out of space for cache:
We have an internal fix for the empty directories cleanup that we need to get done here. Since that seems like a very likely cause, this issue can track delivering that fix. |
I am having persistent problems like this:
Some day, suddenly I will get an error:
java.lang.RuntimeException: unable to open session
This error will be tied to specific date range in a dataset.
Eg If my query includes that date in that dataset it breaks, if not, it doesn't.
I can solve this by deleting the contents of cache
sudo rm -rf /var/data/file_cache/*
And then restarting deamon and then killing all active imhotep processes (workaround due to #19)
However it sucks I have to do this manually and with some frequency.
When I look in the logs for the daemon, I see some periodic problem that looks like:
2017-08-21 17:29:06,578 ERROR [CachingLocalImhotepServiceCore] Exception during cleanup of a Closeable, ignoring
java.lang.NullPointerException
at com.indeed.imhotep.io.Shard.close(Shard.java:131)
at com.indeed.util.core.reference.SharedReference.decRef(SharedReference.java:111)
at com.indeed.util.core.reference.SharedReference.close(SharedReference.java:76)
at com.indeed.util.core.io.Closeables2.closeQuietly(Closeables2.java:29)
at com.indeed.imhotep.service.CachingLocalImhotepServiceCore.updateShards(CachingLocalImhotepServiceCore.java:308)
at com.indeed.imhotep.service.CachingLocalImhotepServiceCore.(CachingLocalImhotepServiceCore.java:148)
at com.indeed.imhotep.service.ImhotepDaemon.newImhotepDaemon(ImhotepDaemon.java:758)
at com.indeed.imhotep.service.ImhotepDaemon.main(ImhotepDaemon.java:728)
at com.indeed.imhotep.service.ImhotepDaemon.main(ImhotepDaemon.java:694)
Not sure if related.
The text was updated successfully, but these errors were encountered: