-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge APS investigation outcomes into DETECT data #39
Comments
Initial review of the APS subject identifier data appeared to indicate that the variable (client_id) would be readily valid, with a few "failed matches" noted due to multiple (client_id) values associated with certain (case_id) values (each case should only belong to a single subject). Data largely appeared to be clean and ready to use. However, further examination has found significant typographical and other errors in the identifier data fields and a much larger degree of "failed matches" for subject-id values. As such, the data requires significant cleaning in preparation for fuzzy-matching algorithm application and a within-set APS subject ID would need to be created. Cleaning of APS data is underway. Name fields:
Potentially valuable information (such as "female" if a name was given as "unknown female" or a suffix trimmed from a name value) is being shifted to a comment field, so it is available in manual review of fuzzy-match pairs. Additionally, some exploration of address values has been completed.
|
As of today, further progress has been made in cleaning/standardizing the APS Client data Name fields are clean! 🎉
Address fields are pending only street address cleaning/validation:
|
As of today, further progress has been made in cleaning/standardizing the APS Client data. Address fields are pending only street address cleaning/validation completion. Street Addresses are a bear.
|
As of today, further progress has been made in cleaning/standardizing the APS Client data Address fields are pending only street address and street unit cleaning/validation completion. Street Addresses are a bear.
|
As of today, further progress has been made in cleaning/standardizing the APS Client data Address fields are pending only street address and street unit cleaning/validation completion. Street Addresses are a bear.
|
As of today, further progress has been made in cleaning/standardizing the APS Client data. Separation of secondary address values should be complete (within reason) at this time, though QC checks are designed to help catch other potential remaining values. Address fields are pending only street address and street unit cleaning/validation completion. Street Addresses are a bear.
|
Overview
On 2024-03-21, Catherine sent us a new batch of APS data. We need to merge the APS outcomes with our DETECT screenings for a publication.
We want to link APS investigation outcomes to DETECT screenings completed by MedStar during the R01 phase of DETECT. The DETECT data we want to use for linking is
participant_import.rds
.Links
Tasks
The text was updated successfully, but these errors were encountered: