New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runq_overload everyday for few minutes #12919
Comments
Hello, This alarm is raised when there are too many Erlang processes with planned tasks. It can happen for many reasons:
So it can't be pinpointed to a single cause. Therefore, we need to know more about your setup. What authentication and authorization plugins are enabled? Is there any particular pattern to the client behavior? |
runq_overload means you are lacking of computing (CPU) resources during that period, there are 104 processes cannot be scheduled. It is not critical issue, if it happens very often, it is a sign to scale up your node. Do you see CPU spikes? how many cores do you have, are they shared?
do you have other service run on the same node? |
To slightly elaborate on @qzhuyan 's reply:
There are 104 processes that are waiting to be scheduled, to be more precise. Essentially, this alarm tells that the system stopped being soft real-time, and there is CPU time starvation. |
We've provided a high level explanation of the alarm. I close the issue, since no further details were given. |
What happened?
Im getting this alarm runq_overload: VM is overloaded on node: '<node_name>': 104
for around 1-2 mins(duration) everyday.
I have two nodes in my cluster, each node is of 4GB memory
My traffic is almost the same all day,
~420 incoming msgs/sec
~220 outgoing msgs/sec
~32,000 clients
usually my RAM is usage always 1.2GB/4GB, there is no spike in RAM when i receive alerts
But i observed a spike in Disk I/O when there is runq_overload
Please help me in resolving this issue.
What did you expect to happen?
No error everyday, as my system resources are more than enough
How can we reproduce it (as minimally and precisely as possible)?
No response
Anything else we need to know?
No response
EMQX version
OS version
Log files
The text was updated successfully, but these errors were encountered: