SGD learning rate tuning + pytorch implementation, simulations #81

jjc2718 · 2023-05-26T18:59:14Z

Sorry in advance - this is kind of a large PR with no super obvious way to split it up into smaller ones. I'll summarize some of the main changes I'm making in this PR below, and if you want to review each of those individually feel free to do that and just approve once you finish them all.

Main changes:

Tune learning rate for SGD optimizer: see changes to 01_stratified_classification/run_stratified_lasso_penalty.py, 01_stratified_classification/scripts/run_lasso_lr_compare.sh, parts of pancancer_evaluation/prediction/classification.py and pancancer_evaluation/utilities/classify_utilities.py. This turns out to matter quite a bit for SGD performing well, and once we use a slightly more sophisticated approach for tuning the learning rate (constant learning rate + a grid search in this case) we get much better performance, on par with liblinear.
Try a pytorch implementation of SGD: we did this primarily to make sure the SGD performance/regularization dynamics weren't specific to the sklearn implementation. These changes are in 01_stratified_classification/run_stratified_nn.py and pancancer_evaluation/prediction/classification.py (the train_mlp_lr function primarily). We probably won't end up using these results for much in the paper, but it was a useful sanity check.
Try SGD and liblinear on some simulated data: these are the 01_stratified_classification/sgd_params/sim.ipynb and 01_stratified_classification/sgd_params/sim_lr.ipynb notebooks. I used these to iterate quickly on the learning rate changes and compare our results to L2 regularization, but the results turn out to be a bit different on real data so I'm not sure how applicable this simulation approach is to the problem we're trying to address in our paper. I want to keep these scripts around for future reference, though.

I have a summary of the main plots/conclusions in these slides, in case they help a bit with putting the results in context: https://docs.google.com/presentation/d/1LRBq_ciFeS503J8-GeH51l-1p4RTdWJPn_LcGhHNjgM/edit?usp=sharing. Let me know if you have questions!

review-notebook-app · 2023-05-26T18:59:18Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

arielah

I'm glad you checked this out! Some interesting results in here, and they'll definitely be interesting for the paper.

pancancer_evaluation/prediction/classification.py

pancancer_evaluation/utilities/analysis_utilities.py

pancancer_evaluation/utilities/file_utilities.py

01_stratified_classification/sgd_params/nbconverted/sgd_lr_pytorch.py

01_stratified_classification/sgd_params/nbconverted/sim.py

jjc2718 added 21 commits May 8, 2023 14:21

add command line option for max iterations

f872f5f

refactor filenames + add max iter option

7ef2eba

testing more flexible approach to filename functions

6c950c4

script to look at max iter variation

595b3d0

pytorch logistic regression seems to work

fc36f43

save loss function values + add script to run

93d606b

add batch size command line argument

eb7f8d2

batch size analysis

82bec4f

fix loss function logging

7439bef

simulate optimizer comparison

0af9674

Merge branch 'master' into sgd_vary_params

6aa0a08

Merge branch 'sgd_vary_params' into sgd_simulations

15e92d9

add learning rate variation script

70b820c

compare learning rate schedules

45fc941

plot KRAS results for varying learning rates

d9d3851

learning rate search and output loss

2fbab2d

plot learning rate schedule comparison results

8669670

split bins code out + add per-param coef visualization

57bd3b6

scripts to run stuff on slurm cluster

a998f72

plot loss values + add liblinear baseline

f7a484e

quantify overfitting in all script

6c349a3

jjc2718 requested a review from arielah May 26, 2023 19:11

jjc2718 added 2 commits May 26, 2023 15:16

clean up slightly

6cca544

add constant search to simulation script

ba72543

arielah approved these changes May 31, 2023

View reviewed changes

changes from code review

c00f8f2

jjc2718 merged commit 0564770 into greenelab:master Jun 1, 2023
1 check passed

jjc2718 deleted the sgd_simulations branch June 1, 2023 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SGD learning rate tuning + pytorch implementation, simulations #81

SGD learning rate tuning + pytorch implementation, simulations #81

jjc2718 commented May 26, 2023 •

edited

Loading

review-notebook-app bot commented May 26, 2023

arielah left a comment

SGD learning rate tuning + pytorch implementation, simulations #81

SGD learning rate tuning + pytorch implementation, simulations #81

Conversation

jjc2718 commented May 26, 2023 • edited Loading

review-notebook-app bot commented May 26, 2023

arielah left a comment

Choose a reason for hiding this comment

jjc2718 commented May 26, 2023 •

edited

Loading