Skip to content
This repository has been archived by the owner on Nov 4, 2022. It is now read-only.

ulimit nofiles reached on systems with large amounts of retention #136

Closed
hosom opened this issue May 3, 2016 · 7 comments
Closed

ulimit nofiles reached on systems with large amounts of retention #136

hosom opened this issue May 3, 2016 · 7 comments

Comments

@hosom
Copy link
Contributor

hosom commented May 3, 2016

I have a system with 35 threads writing 2-6 MB index files and ~250 MB packet files.

This worked great up until we had more than 20 days of retention and then we hit the nofile ulimit since Stenographer keeps a handle open for every index. It would be pretty awesome to have some sort of system that enables large amounts of retention by opening/closing old index files as needed.

@gconnell
Copy link
Contributor

gconnell commented May 9, 2016

There's an experimental filecache branch uploaded now. Please let me know if it works, and if not please send me any logs you can from stenographer so I can debug.

@hosom
Copy link
Contributor Author

hosom commented May 16, 2016

Just an update...

We've been running stable since 5/4 on the filecache branch. I'm not experiencing any user-noticeable slowness at the moment. I have steno writing in 1 min segments and verbose logging enabled, so if anything goes wrong we'll be able to troubleshoot.

@gconnell
Copy link
Contributor

Excellent to hear!

Given that, I'll probably pull the filecache branch into master. Thanks for your work testing this out!

@hosom
Copy link
Contributor Author

hosom commented May 25, 2016

Queries still returning successfully. No errors in the logs when I reviewed them this morning.

@hosom
Copy link
Contributor Author

hosom commented May 31, 2016

Everything still stable and no errors to report.

@hosom
Copy link
Contributor Author

hosom commented Jul 12, 2016

We haven't restarted services since we installed the experimental branch and we are still able to retrieve packets without errors and have not had performance issues.

@hosom
Copy link
Contributor Author

hosom commented Jul 19, 2017

LRU cache has definitely solved this issue.

@hosom hosom closed this as completed Jul 19, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants