You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In Section 3.3, you specify that IA^3 adds a total of d_k + d_v + d_ff parameters.
However, if I look at this line, you seem to be allocating 2 * d vectors for each linear layer (multi_lora_a, multi_lora_b) and multiplying multi_lora_a with the input and multi_lora_b with the transformed input.
Hi :)
I was reading your interesting paper https://arxiv.org/pdf/2205.05638.pdf.
In Section 3.3, you specify that IA^3 adds a total of d_k + d_v + d_ff parameters.
However, if I look at this line, you seem to be allocating 2 * d vectors for each linear layer (multi_lora_a, multi_lora_b) and multiplying multi_lora_a with the input and multi_lora_b with the transformed input.
t-few/src/models/lora.py
Line 43 in 9dbc9cc
Am I missing something?
Thank you for your clarification :-)
The text was updated successfully, but these errors were encountered: