Lora implementation in finetuning and evaluation #638

caroteu · 2024-06-20T09:04:26Z

No description provided.

caroteu · 2024-06-24T11:30:11Z

finetuning/specialists/lora/train_covid_if.py

@anwai98 This is the script I used for finetuning on covid_if data. I refer to this one in the data I sent as 'my impl'.

caroteu · 2024-06-24T11:48:57Z

finetuning/specialists/lora/train_covid_if.py

+            joint_model_params.append(params)
+
+    optimizer = torch.optim.Adam(joint_model_params, lr=1e-5)
+    scheduler = torch.optim.lr_scheduler.ReduceLROnPlateau(optimizer, mode="min", factor=0.9, patience=10)


@anwai98 The patience for the learning rate scheduler was set to 10 here but to 3 in the 'Resource Efficient Impl' - Could that be the reason for the performance difference ?

There are a few more changes. We had a lower learning rate and a different optimizer as well (https://github.com/caroteu/micro-sam/blob/dcd6ecdc5ef600e07670db27ccfba54e81f156f7/finetuning/specialists/resource-efficient/covid_if_finetuning.py#L135-L136)

And probably 100 epochs in the other experiments might translate to a bit more than 10k iterations here.

Potentially that could lead to a bit of a performance difference (unless it's a very severe drop in performance, that would be a different discussion)

The optimizer is actually the same - I ran this here with Adam and 1e-5 too to make it consistent with the other workflow. The results I send you both have adam and lr=1e-5

caroteu added 12 commits June 19, 2024 15:47

include lora in livecell training logic

79bd2fa

implemented training scripts for mouse embryo and covid if (non-stable)

111a994

removed use_lora from evaluation scripts

6b435eb

clean up for pr

e4db246

changed qos specification in batch script

2eaaf26

changed user in batch script

6dda34c

Merge branch 'lora-rank-study' into covid_if

81df896

removed mistake in submit evaluation

0bcd7df

lora implementation in evaluation scripts

df6a130

changed checkpoint handling in evaluation

bd00611

corrected checkpoint argument in evaluate amg

313c0ed

removed decoder initialization from covid_if training

cbf31ab

caroteu commented Jun 24, 2024

View reviewed changes

changed optimizer and naming of functions

c2ef267

caroteu commented Jun 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lora implementation in finetuning and evaluation #638

Lora implementation in finetuning and evaluation #638

caroteu commented Jun 20, 2024

caroteu Jun 24, 2024

caroteu Jun 24, 2024

anwai98 Jun 24, 2024

anwai98 Jun 24, 2024

caroteu Jun 24, 2024

Lora implementation in finetuning and evaluation #638

Are you sure you want to change the base?

Lora implementation in finetuning and evaluation #638

Conversation

caroteu commented Jun 20, 2024

caroteu Jun 24, 2024

Choose a reason for hiding this comment

caroteu Jun 24, 2024

Choose a reason for hiding this comment

anwai98 Jun 24, 2024

Choose a reason for hiding this comment

anwai98 Jun 24, 2024

Choose a reason for hiding this comment

caroteu Jun 24, 2024

Choose a reason for hiding this comment