-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Usage: Fixup entity exists exception #8726
Usage: Fixup entity exists exception #8726
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## 4.18 #8726 +/- ##
=========================================
Coverage 13.16% 13.16%
Complexity 9203 9203
=========================================
Files 2724 2724
Lines 258137 258137
Branches 40235 40235
=========================================
Hits 33989 33989
+ Misses 219841 219840 -1
- Partials 4307 4308 +1 ☔ View full report in Codecov by Sentry. |
@blueorangutan package |
@vishesh92 a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
Packaging result [SF]: ✔️ el7 ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 8817 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code lgtm
I have never seem an event being processed twice without manual intervention in the database; instead, I already saw dup events, like @vishesh92, do you have the steps to reproduce the error? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
clgtm
@blueorangutan test |
@sureshanaparti a [SL] Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
[SF] Trillian test result (tid-9352)
|
@GutoVeronezi I don't have any steps to reproduce the error. We saw the EntityExistsException in the log and it's probably because of duplicate events. |
I think you just described the method of reproducing; duplicate any of the not yet processed events for create/start/register events. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code lgtm
My point here was to try to identify the workflows causing those duplicated events; until far, I only saw the exception when manually intervening in the database. However, if it an ACS' process that is causing this, it would be good if we could identify and fix it. |
That is a good point, but we haven't yet found way to reproduce it, It may be due to events being fired once and then handled twice, like you could expect in UDP connections, but nor reproduction scheme yet. This is symptom fighting admittedly. |
ideally yes. IMHO, sometimes it is difficult to reproduce an issue and find the root cause. Of so, we'd better apply the workaround to avoid subsequent issues. |
@GutoVeronezi I spent some time but couldn't find the reason for duplication of events. Even if we find and fix the root cause, we might not be able to prevent such an occurrence again. So, it would be good to have the check on the usage server itself as well. |
@vishesh92 |
@blueorangutan package |
@DaanHoogland a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
Packaging result [SF]: ✔️ el7 ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 8903 |
@blueorangutan test |
@weizhouapache a [SL] Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
[SF] Trillian test result (tid-9439)
|
Description
While creating helper records, it's possible that an event is processed twice which can result in failure due EntityExistsException. This PR adds a catch to ensure that job doesn't fail if the event is already present in the db.
Types of changes
Bug Severity
How Has This Been Tested?
How did you try to break this feature and the system with this change?