Add LoRA to BLOOM #3

haileyschoelkopf · 2022-06-28T14:51:41Z

This PR adds LoRA to adapter-transformers' BLOOM implementation.

I have verified that with no LoRA adapter added to the model (just using pfeiffer+inv config for adapters), the performance is exactly the same (with a seed set) as before I added this change.

so this can be safely merged for us to try out.

TODO:

test madx_run_clm.py using a LoRA adapter (+ determine appropriate config values.)

src/transformers/models/bloom/modeling_bloom.py

yongzx · 2022-06-29T10:30:48Z

@haileyschoelkopf can you separate prefix tuning from this PR (by creating another PR)?

I have tested LoRA and the implementation looks correct! Once I have obtained the training curves from madx_run_clm.py (currently running), I would just merge this PR for LoRA.

haileyschoelkopf · 2022-06-29T11:59:50Z

I've just done this! Will open a separate PR for prefix tuning

add LoRA to BLOOM

973e75b

haileyschoelkopf changed the base branch from master to bloom June 28, 2022 14:51

add prefix tuning (naive)

3ea51a4

yongzx reviewed Jun 28, 2022

View reviewed changes

src/transformers/models/bloom/modeling_bloom.py Outdated Show resolved Hide resolved

correct prefix-tuning (still slow)

492aba0

revert prefix tuning

1d0bbda

revert last prefix tuning change

93ca628

yongzx merged commit a70be1b into yongzx:bloom Jun 29, 2022

yongzx mentioned this pull request Jun 29, 2022

Implement LoRA. bigscience-workshop/multilingual-modeling#31

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LoRA to BLOOM #3

Add LoRA to BLOOM #3

haileyschoelkopf commented Jun 28, 2022

yongzx commented Jun 29, 2022 •

edited

Loading

haileyschoelkopf commented Jun 29, 2022

Add LoRA to BLOOM #3

Add LoRA to BLOOM #3

Conversation

haileyschoelkopf commented Jun 28, 2022

yongzx commented Jun 29, 2022 • edited Loading

haileyschoelkopf commented Jun 29, 2022

yongzx commented Jun 29, 2022 •

edited

Loading