-
Notifications
You must be signed in to change notification settings - Fork 525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Mellanox] PFC watchdog long term solution to reduce false alarm #3036
Conversation
5ae8448
to
429e00d
Compare
429e00d
to
3aabd58
Compare
bdd56a6
to
04dec9e
Compare
04dec9e
to
66a2105
Compare
66a2105
to
e15d569
Compare
Can you please help me understand why we set the criteria to |
Hi @bingwang-ms
|
@neethajohn Can you please help review this PR? Thanks |
@neethajohn kindly reminder. this is supposed to get into 202405 and we wish to close it with prio. |
Changes look good to me. @stephenxs , will the current sonic-mgmt pfcwd functionality tests need enhancement with the new logic ? |
|
Signed-off-by: Stephen Sun <stephens@nvidia.com>
Signed-off-by: Stephen Sun <stephens@nvidia.com>
@yxieca Can you please help approve this cherry-pick? |
The fix has been included in 202405 branch. |
…ic-net#3036) What I did Adjust PFC watchdog detection algorithm to reduce false alarms. In the old PFC watchdog detection algorithm, the PFC watchdog can be triggered if either of the following conditions is satisfied in a detection interval: There are packets accumulated in the queue && there is no packet sent out of the queue && there are PFC frames received in the queue There is no packet accumulated in the queue and there are PFC frames received and blocking more than 80% of the detection interval. The new PFC watchdog detection algorithm merges two conditions into one: The PFC watchdog is triggered only if: There are packets accumulated in the queue && there is no packet sent out of the queue && there are PFC frames received and blocking more than 99% of the detection interval.
Cherry-pick PR to 202311: #3213 |
What I did Adjust PFC watchdog detection algorithm to reduce false alarms. In the old PFC watchdog detection algorithm, the PFC watchdog can be triggered if either of the following conditions is satisfied in a detection interval: There are packets accumulated in the queue && there is no packet sent out of the queue && there are PFC frames received in the queue There is no packet accumulated in the queue and there are PFC frames received and blocking more than 80% of the detection interval. The new PFC watchdog detection algorithm merges two conditions into one: The PFC watchdog is triggered only if: There are packets accumulated in the queue && there is no packet sent out of the queue && there are PFC frames received and blocking more than 99% of the detection interval.
What I did
Adjust PFC watchdog detection algorithm to reduce false alarms.
In the old PFC watchdog detection algorithm, the PFC watchdog can be triggered if either of the following conditions is satisfied in a detection interval:
The new PFC watchdog detection algorithm merges two conditions into one:
The PFC watchdog is triggered only if:
Signed-off-by: Stephen Sun stephens@nvidia.com
Why I did it
There are some rare scenarios in which the PFC watchdog can be mistriggered
Scenarios 1 and 2 are addressed in this PR.
How I verified it
Run PFC watchdog regression test with background traffic.
Details if related