Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workaround for division by zero crash #385

Merged
merged 1 commit into from Dec 15, 2021
Merged

Workaround for division by zero crash #385

merged 1 commit into from Dec 15, 2021

Conversation

ccascone
Copy link
Contributor

@ccascone ccascone commented Dec 10, 2021

We don't know yet the root cause of #376. The crash has been observed multiple times per day on a few production deployments. This is a workaround to prevent the crash while we investigate the root cause.

We don't know how to reproduce the issue, but we know it happens once in a while (tens of minutes to hours after starting bessd). As a consequence, we assume this workaround will be hit only a few times, causing only a few packets to skip the meter and be marked as green. We can better asses the frequency by looking at the new log message.

@ccascone
Copy link
Contributor Author

ccascone commented Dec 10, 2021

Also, the author of this PR knows very little about C++, DPDK, and BESS... we ask reviewers to please be gentle :)

@ccascone ccascone force-pushed the div-zero branch 2 times, most recently from b826989 to 08135fd Compare December 11, 2021 00:25
@ccascone
Copy link
Contributor Author

ccascone commented Dec 14, 2021

this workaround has been running on 3 deployments for more than 24 hours, no crash has been reported since then. I'm planning to merge this soon, after I figure out why CI is failing.

@amarsri28
Copy link
Contributor

pls look at the comments given.

@ccascone
Copy link
Contributor Author

ccascone commented Dec 14, 2021

@amarsri28 I replied to your comments on #376. It looks like you don't have any explicit concern with this PR, so I'm going ahead and merge this.

Base automatically changed from gate-status to master December 15, 2021 00:23
@ccascone
Copy link
Contributor Author

retest this please

@ccascone ccascone merged commit 103b198 into master Dec 15, 2021
@ccascone ccascone deleted the div-zero branch December 15, 2021 05:53
amarsri28 added a commit to amarsri28/upf-epc that referenced this pull request Feb 28, 2022
ccascone pushed a commit that referenced this pull request Mar 2, 2022
* Revert "Workaround for division by zero crash (#385)"

This reverts commit 103b198.

* Revert "QoS: Use params to profile map, carry pointer"

This reverts commit d2c0793.

Co-authored-by: Maximilian Pudelko <pudelkoM@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants