compatibility with lora #6

chmxu · 2024-01-18T06:13:46Z

Hi, I wonder if you have tried using LoRA to finetune the model? It should need less gpu memory than the current full finetuning strategy.

zwcolin · 2024-01-18T07:07:11Z

Hi Chengming,

Yes, we have attempted training with lora but encountered performance degradation with default ranks. Therefore, we did not end up having a lora finetuned version.

That said, it still might be possible to finetune with lora, although many hyperparameters (e.g., rank, types of attention weights) likely need to be carefully experimented specifically with our setup, which makes attention map itself an objective to optimize.

Best,
Zirui

chmxu · 2024-01-18T07:08:21Z

Thank you!

chmxu closed this as completed Jan 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compatibility with lora #6

compatibility with lora #6

chmxu commented Jan 18, 2024

zwcolin commented Jan 18, 2024

chmxu commented Jan 18, 2024

compatibility with lora #6

compatibility with lora #6

Comments

chmxu commented Jan 18, 2024

zwcolin commented Jan 18, 2024

chmxu commented Jan 18, 2024