# Open-source Electronic Health Records (EHR) data

1\. **MIMIC-III (Medical Information Mart for Intensive Care) Database**

*   **Description**: MIMIC-III is one of the most popular open-access EHR datasets, containing de-identified health data of over 40,000 ICU patients. It includes detailed information on patient demographics, diagnoses (ICD-9 codes), medications, laboratory test results, and more.
*   **Website**: MIMIC-III Database
*   **Data Access**: To access the data, you must complete the **CITI Program** training in human subjects research ethics, after which you can download the dataset.

* * *

2\. **PhysioNet**

*   **Description**: PhysioNet offers various health-related datasets, including MIMIC-III, and other datasets related to patient monitoring and health records. PhysioNet provides access to EHR-like data, clinical notes, lab results, and other critical information.
*   **Website**: [PhysioNet](https://physionet.org/)
*   **Data Access**: The data is available for free but may require registration or special access through an application process.

* * *


3\. **OHDSI (Observational Health Data Sciences and Informatics)**

*   **Description**: OHDSI provides an open-source platform for health data, including EHR data. The OHDSI network builds upon open-source tools for analyzing large-scale health data. Their **OMOP Common Data Model** is widely used for harmonizing EHR data from diverse sources.
*   **Website**: [OHDSI](https://www.ohdsi.org/)
*   **Data Access**: The OHDSI network provides tools and methodologies for working with EHR data but doesn’t offer a single downloadable dataset. Instead, it offers guidance and tools to work with EHR datasets structured in the OMOP model.


* * *

4\. **Synthea (Synthetic Health Data)**

*   **Description**: Synthea is an open-source synthetic patient population generator that creates realistic but completely synthetic EHR-like data, including demographics, medical histories, diagnoses (ICD-10), medications, and lab results. This data is useful for testing healthcare applications and systems.
*   **Website**: [Synthea](https://synthea.mit.edu/)
*   **Data Access**: You can download large synthetic datasets from their website.


* * *

5\. **eICU Collaborative Research Database**

*   **Description**: The eICU database is another large-scale, de-identified dataset containing data from over 200,000 ICU admissions in the United States. It includes information about patient vital signs, medications, diagnoses (ICD-9 codes), and other clinical events.
*   **Website**: [eICU Database](https://eicu-crd.mit.edu/)
*   **Data Access**: Access to the eICU database requires completion of a human subjects research training program (CITI Program).


* * *

6\. **i2b2 (Informatics for Integrating Biology and the Bedside)**

*   **Description**: i2b2 provides a variety of healthcare datasets, including EHR-like data for research purposes. It contains data such as clinical diagnoses, medications, lab results, and patient outcomes, often related to specific disease categories.
*   **Website**: [i2b2](https://www.i2b2.org/)
*   **Data Access**: i2b2 offers public datasets but also provides resources to create your own research database. Access is typically granted to researchers after agreeing to specific use terms.


* * *

7\. **The Cancer Imaging Archive (TCIA)**

*   **Description**: While primarily focused on **medical imaging**, TCIA also includes linked clinical data in the form of EHR-like data, including patient demographics, diagnoses, treatment information, and outcomes. This is particularly valuable for cancer-related research.
*   **Website**: [The Cancer Imaging Archive (TCIA)](https://www.cancerimagingarchive.net/)
*   **Data Access**: Open access with registration.


* * *

8\. **OpenEHR**

*   **Description**: OpenEHR is an open standard for EHR data management. While not a data source itself, OpenEHR provides **open-source tools** and standards that allow for the creation, management, and sharing of health data. It includes **archetypes** (predefined templates) and data models for representing healthcare information, which can be integrated with other sources of open health data.
*   **Website**: [OpenEHR](https://www.openehr.org/)


* * *

9\. **ClinicalTrials.gov**

*   **Description**: While not strictly an EHR dataset, ClinicalTrials.gov provides access to data related to clinical trials, which often include information about patient demographics, interventions, and outcomes. It’s a valuable source for data on medical conditions, treatments, and results.
*   **Website**: [ClinicalTrials.gov](https://clinicaltrials.gov/)
*   **Data Access**: The data can be accessed via the website, and some detailed datasets are available for download.


* * *

10\. **The National Database for Autism Research (NDAR)**

*   **Description**: NDAR provides access to datasets related to autism research, including clinical data, EHR-like data, and genetic data. It allows for studying health conditions and treatment outcomes.
*   **Website**: [NDAR](https://ndar.nih.gov/)
*   **Data Access**: Access is available to researchers after completing a registration process.




# other resource

1.  **UK Biobank**

*   **Description**: The UK Biobank is a large-scale health research dataset containing health and genetic information from over 500,000 participants. It includes data on demographics, lifestyle, health conditions, genetic information, and medical imaging, making it a valuable resource for a wide range of health research, including genomics, epidemiology, and disease prevention studies.
*   **Website**: [UK Biobank](https://www.ukbiobank.ac.uk/)
*   **Data Access**: Access to the data is granted to approved researchers after completing an application process, which includes a research proposal and ethical approval.

* * *


2\. **Truven Health Analytics (IBM Watson Health)**

*   **Description**: Truven Health Analytics offers a wide range of commercial health data solutions, including claims data, clinical data, and real-world evidence. It is primarily used by pharmaceutical companies, healthcare providers, and researchers for analysis, drug development, and treatment outcomes.
*   **Website**: IBM Watson Health
*   **Data Access**: Available through commercial agreements with IBM Watson Health.

* * *

3\. **Optum**

*   **Description**: Optum, part of UnitedHealth Group, provides a large set of healthcare data covering insurance claims, patient data, and clinical data. This resource is used by researchers, payers, and health systems to analyze healthcare utilization, treatment effectiveness, and more.
*   **Website**: [Optum](https://www.optum.com/)
*   **Data Access**: Commercial agreements for access to healthcare datasets.

* * *




4\. **Cerner (now part of Oracle)**

*   **Description**: Cerner provides healthcare IT solutions and EHR systems, and they also offer access to a range of health data. Their commercial database includes de-identified patient data, clinical outcomes, and other healthcare-related information, primarily for research, analytics, and decision-making.
*   **Website**: [Cerner](https://www.cerner.com/)
*   **Data Access**: Available through agreements for health systems, academic researchers, and commercial entities.

* * *



5\. **IQVIA**

*   **Description**: IQVIA is a global leader in healthcare data analytics, offering insights from EHR data, clinical trials, and real-world evidence. They provide access to large datasets for health research, clinical development, and market analysis.
*   **Website**: [IQVIA](https://www.iqvia.com/)
*   **Data Access**: Available to commercial organizations, health providers, and researchers via subscription.

* * *

6\. **Veradigm (formerly Allscripts)**

*   **Description**: Veradigm offers access to a large commercial dataset containing EHR data, claims data, and patient outcomes data. The dataset is useful for healthcare providers, pharmaceutical companies, and researchers conducting observational studies and health outcomes research.
*   **Website**: [Veradigm](https://www.veradigm.com/)
*   **Data Access**: Commercial licenses and partnerships.

* * *


7\. **Health Catalyst**

*   **Description**: Health Catalyst provides a data platform for healthcare organizations and research groups, offering access to various healthcare datasets, including EHR data, clinical outcomes, and claims data.
*   **Website**: [Health Catalyst](https://www.healthcatalyst.com/)
*   **Data Access**: Access through agreements with healthcare organizations and researchers.


* * *

8\. **LexisNexis Risk Solutions (Healthcare)**

*   **Description**: LexisNexis offers healthcare data solutions including claims, clinical, and health risk data. They provide commercial access to healthcare datasets for providers, payers, and pharmaceutical companies.
*   **Website**: LexisNexis Health
*   **Data Access**: Commercial access through subscription or agreements.

* * *


9\. **Medicare & Medicaid Data (CMS)**

*   **Description**: The Centers for Medicare & Medicaid Services (CMS) offers data related to the Medicare and Medicaid populations, including claims data, patient outcomes, and utilization. While the CMS dataset is publicly available, there are commercial products built around this data that offer enhanced analysis and visualization tools.
*   **Website**: [CMS](https://www.cms.gov/)
*   **Data Access**: Free public access to certain data sets; more advanced data access through third-party providers.

* * *


10\. **GE Healthcare**

*   **Description**: GE Healthcare provides access to clinical and diagnostic data through its commercial platforms. This includes imaging data, clinical records, and health management solutions used across healthcare systems globally.
*   **Website**: [GE Healthcare](https://www.gehealthcare.com/)
*   **Data Access**: Available through commercial agreements with healthcare institutions.

* * *


11\. **Clarity AI (Clinical Data Solutions)**

*   **Description**: Clarity AI provides access to clinical data analytics and solutions. It aggregates and anonymizes clinical data for use in healthcare decision-making, drug development, and clinical research.
*   **Website**: [Clarity AI](https://www.clarity.ai/)
*   **Data Access**: Commercial access via subscription or licensing.

* * *
