Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sos: addWatch: resource exhausted (No space left on device) #34

Open
stites opened this issue Jun 21, 2018 · 8 comments
Open

sos: addWatch: resource exhausted (No space left on device) #34

stites opened this issue Jun 21, 2018 · 8 comments

Comments

@stites
Copy link

stites commented Jun 21, 2018

When running sos on Ubuntu-17.10 (x64), I occasionally get sos: addWatch: resource exhausted (No space left on device) at first I thought this was only when I was in tmux, but it looks like it happens in some non-deterministic fashion. Memory usage is <25% utilization and disk space is ~66% -- so I'm not sure what kind of space is being referred to in this error. I've never had this problem on Archlinux or Debian (so far).

If someone could point me in a direction, I can try to fix this myself (I imagine it is a bit difficult to simulate).

@stites
Copy link
Author

stites commented Jun 21, 2018

It looks like rebooting will clear up this issue -- is it possible that steeloverseer is missing a clean-up step?

@schell
Copy link
Owner

schell commented Jun 22, 2018

In my experience when linux says "no space left on device" - "device" means "disk". I am by no means a linux expert though. Are you running sos on a separate filesystem? Maybe in a dropbox folder or a mounted drive? Just spit balling. :)

@aboutthomas
Copy link

Possibly relevant: google/cadvisor#1581 (comment).

@stites
Copy link
Author

stites commented Nov 14, 2018

@sjakobi-as My internal simulator wouldn't be surprised if this was the culprit: perhaps when you run steeloverseer in tmux, watches aren't evicted as you would expect because of the top-level process.

I've been avoiding using sos because of this bug. I'll switch back and report what I find.

@stites stites changed the title [Ubuntu] sos: addWatch: resource exhausted (No space left on device) sos: addWatch: resource exhausted (No space left on device) Dec 7, 2018
@stites
Copy link
Author

stites commented Dec 7, 2018

Ah! I think I found the bug. I was using sos and, upon termination (via ctrl-c), saw this:

^CError removing watch: <wd=132>
Error removing watch: <wd=131>
Error removing watch: <wd=130>
Error removing watch: <wd=129>
Error removing watch: <wd=128>
Error removing watch: <wd=127>
Error removing watch: <wd=126>
Error removing watch: <wd=125>
Error removing watch: <wd=124>
Error removing watch: <wd=123>
Error removing watch: <wd=122>
Error removing watch: <wd=121>
Error removing watch: <wd=120>
Error removing watch: <wd=34>

I don't have time to make the fix at the moment, but I think this behavior confirms @sjakobi-as' observation. I'll try to lock down some test cases.

@schell
Copy link
Owner

schell commented Dec 7, 2018

Thanks for your diligence @stites :)

@schell
Copy link
Owner

schell commented Dec 7, 2018

Try providing the directory of the sources (code) you'd like to watch, this will limit soss watching behavior. Seems to work for me.

@stites
Copy link
Author

stites commented Dec 7, 2018

No need! This is reproducible on two of my machines, but the bug is that fsnotify is swallowing errors thrown by hinotify. See haskell-fswatch/hfsnotify#85

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants