New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Updating LOIO analysis to include all features spaces #46

Merged

gwaybio merged 7 commits into WayScience:main from gwaybio:loio-all

Oct 30, 2023

Member

gwaybio commented Oct 24, 2023

I also add shuffled data to this analysis. Dependent on merging #40 first. After merging #40, I will update visualization

gwaybio added 2 commits

October 23, 2023 20:33


          update loio analysis to all feature spaces and shuffled

c26b4ca


          update lOIO to all feature spaces and track lfs

ea4bb92

review-notebook-app bot commented Oct 24, 2023

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

gwaybio added 3 commits

October 26, 2023 13:24


          rename loio analysis file to avoid painful/impossible merge

de5348e


          Merge remote-tracking branch 'upstream/main' into loio-all

ccebbe0


          add back initial loio probabilities notebook

3a21370

gwaybio requested a review from MattsonCam

October 26, 2023 20:45

MattsonCam reviewed

View reviewed changes

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

May also consider removing this

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023

Yeah, I agree. I will remove!

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Very well documented, easy to follow

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023

Great! It is maybe worth noting that this is mostly Roshan's code ported over to this new file.

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Line #60.            straified_k_folds = StratifiedKFold(n_splits=10, shuffle=False)

Consider removing this code

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023

Great point! this must have stuck around and I forgot to delete. Thanks for noting. (I will also delete the GridSearchCV object call)

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Line #62.            # create logistic regression model with following parameters

Same thing here, consider removing this code

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023

💯 thanks!

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Line #68.            grid_search_cv = GridSearchCV(

Consider removing this code

Reply via ReviewNB

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Line #94.                        C=model.C,

Obviously it works, but I'm surprised you can access the C parameter, since it is not an attribute on the 1.1.1 sklearn logistic regression documentation

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023

Interesting! Thanks for digging into this and raising. I confirmed that I am using 1.1.1 (see below), but maybe it is through the saving and/or loading with joblib process that exposes the attribute again? Maybe this process also exposes parameters (C is a parameter in 1.1 docs) 🤷

3.evaluate_model/get_LOIO_probabilities_all_featurespaces_no_retraining.ipynb

		@@ -0,0 +1,1258 @@
		{

Member

MattsonCam Oct 30, 2023 •

edited

Loading

Line #7.    # define combinations to test over

May also consider storing these in a dictionary

Reply via ReviewNB

Member Author

gwaybio Oct 30, 2023 •

edited

Loading

Yeah, it might be more elegant to do so. However, it is a relatively minor change that would cause additional compute time that likely isn't worth pursuing. Roshan also uses itertools.product() on these combinations, so it is efficiently iterating over them with limited code, which is one of the primary benefits of dictionary storage. I will skip doing this one, thanks!

MattsonCam approved these changes

View reviewed changes

Member

MattsonCam left a comment

It LGTM @gwaybio! Just some minor comments. Let me know if you have any questions!

gwaybio added 2 commits

October 30, 2023 13:40


          Merge remote-tracking branch 'upstream/main' into loio-all

31ce737


          software gardening in response to PR comments

f4e668e

Member Author

gwaybio commented Oct 30, 2023

Thanks for the review @MattsonCam - I will go ahead and merge!

gwaybio merged commit 9066225 into WayScience:main

gwaybio deleted the loio-all branch

October 30, 2023 19:58

gwaybio mentioned this pull request

Updating LOIO visualization #47

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment