Skip to content

OHDSI/ClinicalTrialsWGETL

Repository files navigation

Clinical trial data conventions for the OMOP Common Data Model

The goal of this repository is to apply Clinical trial data conventions to the 'real' data. As the first try, the PhUSE dataset has been chosen. It is a CDISC-compliant synthetic SDTM dataset.

For more information about the Clinical trials conventions, please visit our wiki or download pdf version.

Clinical Trials Working Group

About the group - see here.

Run ETL

The ETL process is based on Apache Spark™ analytics engine running in a docker container. So the only thing you need is Docker. You can find instructions on how to install Docker on your system at the official site. After getting Docker installed, run the conversion in easy three steps:

  1. Clone the repository into a folder on your machine:
$ git clone https://github.com/OHDSI/ClinicalTrialsWGETL.git
  1. Download all the necessary vocabularies from the Athena and put them into vocab/omop folder
  2. And finally, from the root repository folder, run the following command:
$ docker-compose run --rm --service-ports phuse_etl  

After the conversion is done, resultant CDM tables (in csv format) are in the data/cdm folder.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

About

[under development] ETL materials to support proposal for CDM enhancements for clinical trial data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published