R scripts to produce high-level characterization data to benchmark across DAPs and with external resources.
Table of Contents
Aims of Level 3 quality checks:
1. To perform high-level data characterization of the ConcePTION CDM instance for each DAP and benchmark across DAPs and with external resources.
a. Assessing medication use in study population.
b. Assessing vaccine exposure in study population.
c. Calculation of incidence rates of events during study period.
d. Assessing outcomes in relation to drug exposure for signal generation and signal evaluation.
e. Assessing lifestyle factors in study population.
f. Assessing pregnancy status in study population.
g. Assessing sub-populations (or populations of interest) within study population.
Level 3 checks will quantify population and person time in each data source for the source and study population as a whole as well as for subpopulations of interest. Examples of this type of check include: counts of codes extracted to identify each event and exposure of interest, counts of medication prescription and vaccine administrations etc.
The level 3 checks are divided in 8 major steps:
- Source and study population.
- Medicines.
- Vaccines.
- Diagnoses.
- Pregnancy.
- Populations of interest.
- Health-seeking behaviour and lifestyle factors.
- EUROCAT indicators (For pregnancy-related studies).
Follow the steps below to run Level 3 checks in your data.
R version 4.1.0 (2021-05-18)
- Download the ZIP folder and extract the contents.
- Create a main folder with the name of your project(if you already have done so for Level 1/2 checks skip this step).
- Inside the main folder create the folder
Data characterisation
. Put the extracted folder inside. - Inside the main folder create a folder named
CDMInstances
, which will be used to store the .csv files representing the CDM tables(if you already have done so for Level 1/2 checks skip this step). - Inside the
CDMInstances
folder create a folder with the name of your project and inside the latter put all your .csv files(if you already have done so for Level 1/2 checks skip this step). - In the folder
Level_3_checks_to_be_deployed_v1.0
, go to thep_steps
folder and replace the file called study_parameters.csv with the file study_parameters.csv containing the pre-specified parameters applicable to your DAP. Do not make changes to the name of the file. Instructions on how to fill in this file are given in the file itself, look at the tabs. - Open the to_run.R script. We recommend to run the script to_run.R by chunks, start with study source population until line. If reports are not generated, please go
p_step
folder, open the R script named Main_01_studysourcepopulation.R and run line 65, this will save the temporary file. This error is due to the size of the reports but the study source population is generated, so subsequent reports will be generated. Please continue with following chuncks. - Once the to_run.R script is finish, please open the script to_run_results.R. Select all by using ctrl+A(Windows) or cmmd+A(Mac) and run.
- After everything is complete, see outputs/result in the
g_ouputs
folder.
Folder structure
Main folder
Uploading to anDREa
- In a web browser, Go To: mydre.org.
- Click on 'Click here to login'. Pick an account and enter password.
- Click on Workspaces in upper left and then double click on the project workspace.
- Click on Files tab at top.
- Double click on 'inbox' folder.
- Click on 'Level3'.
- Create a folder by clicking on the folder icon with + on it.
- Click on the folder you created.
- Click on cloud icon to upload files.
- Click on select and upload.
- Open the
ForDashboard
folder which is located insideLevel_3_to_be_deployed1.0/g_output/
. Hold down control and select all files within your prepared folder (can only do one folder at a time). - Click on open.
- When it asks to confirm: "Would like to upload the inbox?" select 'OK'.
- Note: It may take many minutes for your upload to complete. You should receive an email once they are uploaded.
- If you find that your files are not in the corresponding level directory, check if the files are in the inbox and move them to the corresponding level directory.
Level 1 checks: Checking the integrity of the ETL procedure.
Level 2 checks: Checking the logical relationship of the CDM tables.
Level 3 checks: Benchamrking across DAPs and external sources.
The current version of the script is 2.0. Released on July 26th, 2023.
ConcePTION aims to build an ecosystem that can use Real World Data (RWD) to generate Real World Evidence (RWE) that may be used for clinical and regulatory decision making, closing the big information gap of medication safety in pregnancy. As part of WP7, level checks were design to assess the quality of the data supporting RWE. Level checks described here has been successfully implemented in VAC4EU, EMA-tendered risk minimization studies, COVID vaccines effectiveness study, Post-Authorization Safety Studies, and CONSIGN. For details, please find the scientific pre-print article on INSIGHT here
The ConcePTION project has received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No 821520. This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA
Vjola Hoxhaj - v.hoxhaj@umcutrecht.nl
Roel Elbers - R.J.H.Elbers@umcutrecht.nl
Ema Alsina - palsinaaer@gmail.com
Project Link: https://github.com/UMC-Utrecht-RWE/INSIGHT-Level3