LoRA A and LoRA B dimensions mentioned in the paper are different from the implementation here. #983

s3pi · 2023-09-30T07:45:42Z

System Info

PEFT library

Who can help?

@sayakpaul

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

(lora_A): ModuleDict(
(default): Linear(in_features=20, out_features=8, bias=False)
)
(lora_B): ModuleDict(
(default): Linear(in_features=8, out_features=2000, bias=False)
)

Expected behavior

The dimensions of LoRA A and LoRA B layers have to opposite to the one implementation here.

The text was updated successfully, but these errors were encountered:

s3pi · 2023-09-30T11:36:47Z

Why does the paper say BA while in this implementation it is lora_A * lora_B?

ChrisHayduk · 2023-09-30T18:50:38Z

The paper seems to have some incosistency in which matrix it names A and which it names B. Take the main image from the paper for example:

In the above image, we have that A is a d x r matrix and B is an r x k (where d is the input dimension, k is the output dimension, and r is the LoRA adapter rank). In this case k = d. This suggests that AB would be a d x k matrix, matching the dimension of W (as desired). This is what PEFT has implemented.

However, later in the paper, the authors state that B is a d x r matrix and A is an r x k matrix, reversing their dimensionalities compared to the above image. This is why they use BA throughout the paper.

Both implementations are equivalent, just with different naming schemes .

github-actions · 2023-10-30T15:03:36Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

BenjaminBossan closed this as completed Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoRA A and LoRA B dimensions mentioned in the paper are different from the implementation here. #983

LoRA A and LoRA B dimensions mentioned in the paper are different from the implementation here. #983

s3pi commented Sep 30, 2023 •

edited

Loading

s3pi commented Sep 30, 2023

ChrisHayduk commented Sep 30, 2023 •

edited

Loading

github-actions bot commented Oct 30, 2023

LoRA A and LoRA B dimensions mentioned in the paper are different from the implementation here. #983

LoRA A and LoRA B dimensions mentioned in the paper are different from the implementation here. #983

Comments

s3pi commented Sep 30, 2023 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

s3pi commented Sep 30, 2023

ChrisHayduk commented Sep 30, 2023 • edited Loading

github-actions bot commented Oct 30, 2023

s3pi commented Sep 30, 2023 •

edited

Loading

ChrisHayduk commented Sep 30, 2023 •

edited

Loading