-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Golden Record: Confidence Level Logic #616
Comments
A solution for determining the overlap counter for point 3 could look like this: When the BPNs for a record change a new uuidv3 is generated and the count for the other BPNs has to be reduced. In order to make this information available to the curation services each business partner record also contains a field for the previous unique ID, if exists. This way the process canself correct the overlaps count on new updates. One problem: The first time before a BPN is assigned we can't really create a meaningful uuidv3. The count would only work after the "second round". Proposal: use a uuidv4 for generating an entirely random ID, that is being used as a placeholder. It will be replaced on the next update when a uuidv3 can be generated. |
I think there is an even simpler solution: The cleaning service should keep a mapping between UUID and BPN. |
Good approach but keep in mind that there are duplicates in the Gate, which link to the same BPNA. They are sometimes even intended. So the gate needs to keep track of these non-intended / intended duplicates and assign the same UUID for counting to all these duplicates. |
Exactly, such duplicates in the same Gate are the reason why I poposed UUIDv3 in the first place, as that naturally leads to the same UUID for records that are in the same Gate and have the same BPNs. I think it's not possible to prevent reassigning unique IDs and employ a self correcting algorithm under these circumstances. But maybe someone has a simpler solution. |
You're right that's something I didn't consider yet. I could see a somewhat simpler variation: So this works similar to the initial proposition with the previous UUID, but tracks BPN changes related to RecordUUID. Let's discuss if we prefer this or not. Using the Gate owner's BPNL for calculating the hash has some potential risk: It might be possible to guess the owner's BPNL from the hash by brute-force, because the BPN is known and for the owner's BPNL there is just a relatively small list of options. |
This is an issue for discussing the logic to calculate the confidence level business partner records.
Description
Sharing members would like to see how good the quality of a given golden record is. Quality meaning the confidence of the golden record process on how likely the data of that record is correct.
The confidence level should be calculated from three different factors:
The responsibility to create such a final confidence level lies with the cleaning service. However, the BPDM system should be able to hold the confidence level as well as provide enough information to calculate such a number.
Thoughts on Factors
Dependencies
This story depends on creating the data model changes to include fields to store and show the confidence level: #631
The text was updated successfully, but these errors were encountered: