Sally - Heart Disease Risk Assessment Tool

Sally is a specialized heart disease risk assessment tool developed using the All of Us Research Program database. It's designed to help healthcare providers quickly assess a patient's risk of heart disease.

Setup

Clone this repository
Install requirements: pip install -r requirements.txt
Run the Jupyter Notebook model_training.ipynb to train the model
Run the Flask app: python app.py
Open a web browser and go to http://localhost:5000

Working with Concept IDs in the All of Us Database

The All of Us Research Program stores patient data using concept IDs, which are unique identifiers for medical metrics (e.g., blood pressure, BMI) or conditions (e.g., heart disease). If you're pulling data from this database:

Understand Concept IDs: Each health metric or condition has a specific concept ID. Example:

3004249 = Diastolic Blood Pressure 316139 = Heart Disease Writing Queries: Use SQL queries with these concept IDs to pull relevant data. Example Query:

sql Copy code SELECT person_id, measurement_concept_id, value FROM your_workspace.measurement WHERE measurement_concept_id = 3004249 Navigating the Database:

Use the Google BigQuery console to explore the schema. Reference the All of Us data dictionary for concept IDs to find more health metrics. Quick Tip: When working with multiple IDs, use IN statements to grab data for multiple measurements at once:

sql Copy code WHERE measurement_concept_id IN (3004249, 3018586, 3043111) This will help others easily navigate the database and extract the right data for their models without the struggle we went through.#

Usage

Open the Sally web interface in your browser
Enter the patient's health metrics:
- Age, gender, race, and ethnicity
- Blood pressure (systolic and diastolic)
- BMI
- Cholesterol levels (total, HDL, LDL)
- Glucose level
- Waist circumference
- Triglycerides
Click "Assess Risk" to get the prediction
Review the risk assessment and recommendations

Customization

Sally is built on the All of Us database but can be adapted for other datasets. To use a different dataset:

Prepare your data in a similar format to the All of Us data
Modify the data loading and preprocessing steps in the Jupyter Notebook
Retrain the model with your data

Troubleshooting

If you encounter any issues:

Ensure all dependencies are correctly installed
Check that the model file (.joblib) is in the correct directory
Verify that your input data is within expected ranges

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to submit pull requests.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgments

All of Us Research Program for the dataset

For any questions or support, please open an issue in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gcloudignore		.gcloudignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Procfile.txt		Procfile.txt
README.md		README.md
app.py		app.py
app.yaml		app.yaml
heart_disease_model.joblib		heart_disease_model.joblib
index.html		index.html
results.html		results.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sally - Heart Disease Risk Assessment Tool

Setup

Working with Concept IDs in the All of Us Database

Usage

Customization

Troubleshooting

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sally - Heart Disease Risk Assessment Tool

Setup

Working with Concept IDs in the All of Us Database

Usage

Customization

Troubleshooting

Contributing

License

Acknowledgments

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages