Duplicate epikeys across unique token_person_id (HES-APC) #51
Replies: 2 comments 9 replies
-
|
Hi Yasmin |
Beta Was this translation helpful? Give feedback.
-
|
Howdy after further investigation, we believe some of your issues have been replicated. Under certain conditions, we've found duplicated keys in APC 2021 and APC 2022. The good news is that the the actual raw data isn't the issue, but that the code used to access the raw data would return the data in scientific (e) notation, causing duplicate epikeys. Could you kindly check if running the following line of code
Could you then confirm whether you can see the EPIKEY's are displayed in e notation please? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello!
I have been working through Matt's 'deliveries cohort' scripts (https://github.com/UCL-CHIG/deliveries-cohort) within my ECHILD project, for the years 2010-2022.
When trying to identify and de-duplicate delivery episodes, I find that a substantial number of "epikey" values are duplicated in my dataset across unique "token_person_id" values. Most of these occur within financial years. I.e., the same "epikey" is linked to different "token_person_id"s within financial years, affecting just over 10% of my token_person_id's.
Is this a known issue / am I missing something simple / or does this seem to be an issue with something I've done?
Any advice much appreciated!
Thank you,
Yasmin
Beta Was this translation helpful? Give feedback.
All reactions