Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create validation module #7

Merged
merged 24 commits into from
Jan 4, 2023
Merged

Create validation module #7

merged 24 commits into from
Jan 4, 2023

Conversation

roshankern
Copy link
Member

This PR is ready for review! In this PR, the final model is validated using the cell health dataset.

After using the final model to derive phenotypic class probabilities for the classify cell health data features, these probabilities are averaged across perturbation and cell line to create create 357 classifiction profiles (119 CRISPR guides x 3 cell lines).

As part of Predicting cell health phenotypes using image-based morphology profiling, Way et al derived cell health indicators, and averaged these indicators across across CRISPR guide/cell line to create 357 Cell Health label profiles.

We use pandas.DataFrame.corr to find the Pearson correlation coefficient between the classifiction profiles and the Cell Health label profiles. A Pearson correlation closer to -1/+1 shows a stronger inverse/direct relationship.

These correlations can be used to validate the model's success in classifying the cell health data. For example, the vb_percent_all_apoptosis indicator has a relatively high Pearson correlation with the model's apoptosis probabilities, implying a relatively strong direct linear relationship.

In future PRs, we hope to apply this process to the shuffled baseline model to provide a baseline for the final model's correlations. Also, we hope to add other methods of validation, such as applying the model to other datasets.

roshankern and others added 15 commits December 9, 2022 15:51
* finish download module changes

* download notebook

* rerun split data module

* rerun download module

* rerun train_model

* rerun evaluation module

* rerun interpretation module

* combine datasets

* combine datasets

* split changes

* update format

* format update

* format

* finish split data

* combine datasets, remove holdout

* formatting

* rerun pipelines

* remove folded class

* rerun pipeline

* Update utils/download_utils.py

Co-authored-by: Dave Bunten <ekgto445@gmail.com>

* PR fixes

* module docstrings

Co-authored-by: Dave Bunten <ekgto445@gmail.com>
@roshankern roshankern requested a review from d33bs January 4, 2023 21:06
Copy link
Member

@d33bs d33bs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work! I left a few comments and thoughts throughout. Please don't hesitate to let me know if you have any questions or if I may clarify at all.

5.validate_model/validate_model.sh Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Outdated Show resolved Hide resolved
utils/validate_utils.py Show resolved Hide resolved
roshankern and others added 9 commits January 4, 2023 15:32
Co-authored-by: Dave Bunten <ekgto445@gmail.com>
Co-authored-by: Dave Bunten <ekgto445@gmail.com>
Co-authored-by: Dave Bunten <ekgto445@gmail.com>
Co-authored-by: Dave Bunten <ekgto445@gmail.com>
Co-authored-by: Dave Bunten <ekgto445@gmail.com>
Copy link
Member

@d33bs d33bs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work on the changes - LGTM!

@roshankern roshankern merged commit bcf6872 into WayScience:main Jan 4, 2023
@roshankern roshankern deleted the create-validation-module branch January 4, 2023 22:53
@roshankern roshankern restored the create-validation-module branch January 4, 2023 22:55
@roshankern roshankern deleted the create-validation-module branch January 4, 2023 23:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants