Skip to content


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

Clinical Trials Browser

This repository contains code and other files related to visualizing and mining the clinical trials registry data maintained by the National Institutes of Health at and published by the Clinical Trials Transformation Intiative as the Aggregate Analysis of database.

This work began as class projects for Information Visualization and Presentation and Data Mining and Analytics in Intelligent Business Services courses at University of California, Berkeley's School of Information.

Data preparation

The database and vizdata folders of this repository contain the SQL and Python code to create the database tables, upload the data, and extract and transform it for the D3-based visualization. All Python files in the repository are in the form of IPython notebooks running Python 2.7.6.

Data visualization

The completed visualization can be viewed on our I School group website.

Upon loading the visualization, the user is presented with a help overlay that explains the main interaction mechanisms and components of the visualization. This can be accessed at any time using the help icon in the upper right hand corner. The initial bubble diagram is organized based on Medical Subject Headings (MeSH) subcategories, and each bubble can be clicked to drill down into the defined MeSH polyhierarchy. A navigation section on the top left describes the current view (total number of trials and patients enrolled), along with breadcrumbs showing the current location within the hierarchy and which selections have been made. A link takes users to the registry, showing details of all trials based on the user’s current selection.

The main bubble visualization is complemented by a series of charts that provide additional trial details and comparisons. Each chart compares the current selection characteristics to all interventional trials in the registry. This can be viewed by phase of study, study enrollment and completion status, primary and secondary financial sponsor, continent where the trial was performed, and temporal trends.

The files to support this web visualization are in the assets, css, and js folders of this repository, as well as the the HTML file in the main directory.

Data mining

The data mining project is summarized in a report, and the code and other supporting files are in the datamining folder of this repository.


No description, website, or topics provided.







No releases published


No packages published

Contributors 4