Skip to content

An r package for clustering patient cohorts in the OMOP CDM format by medical history or creating topic concept sets

License

Notifications You must be signed in to change notification settings

jreps/patientCluster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

patientCluster

Introduction

An R package for clustering cohort patients by medical history or creating data-driven medical topics (clusters of concept_ids corresponding to the same medical topic).

There is also a javascript D3 interactive visulisation available for viewing the results.

Features

  • Aids the creation of topics (sets of similar concept ids)
  • Takes a cohort as input and feature definitions (set of individual concept_ids or topics).
  • Extracts the necessary data from a database in OMOP Common Data Model format.
  • Performs kmeans, generalised low rank models or concensus clustering
  • Includes functions for evaluating clusters and exporting into JSON format

Screenshots

Javascript Cluster Screenshot
Example Javascript Plot

Technology

patientCluster is an R package, with some functions implemented using h2o (http://h2o-release.s3.amazonaws.com/h2o/rel-lambert/5/docs-website/Ruser/Rinstall.html).

System Requirements

Requires R (version ? or higher). Installation on Windows requires RTools. Libraries used in patientCluster require Java.

Dependencies

  • h2o
  • DatabaseConnector
  • SqlRender

Getting Started

  1. On Windows, make sure RTools is installed.
  2. The DatabaseConnector, h2o and SqlRender packages require Java. Java can be downloaded from http://www.java.com.
  3. Install h2o as describe here: http://h2o-release.s3.amazonaws.com/h2o/rel-lambert/5/docs-website/Ruser/Rinstall.html
  4. In R, use the following commands to download and install patientCluster:
install.packages("devtools")
library(devtools)
install_github("ohdsi/SqlRender") 
install_github("ohdsi/DatabaseConnector") 
install_github("ohdsi/patientCluster") 

library("patientCluster")
library("h2o")
h2o.init(nthreads = -1, max_mem_size = '16g')

Getting Involved

License

patientCluster is licensed under Apache License 2.0

Development

patientCluster is being developed in R Studio.

Beta

About

An r package for clustering patient cohorts in the OMOP CDM format by medical history or creating topic concept sets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages