[LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning #3386

tgaddair · 2023-05-06T19:55:33Z

This PR adds support for parameter efficient fine-tuning strategies like Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt.

To use them in Ludwig with the LLM model types, you need to specify the strategy name in the adapter config:

Prompt Tuning: prompt_tuning
Prefix Tuning: prefix_tuning
P-Tuning: p_tuning
LoRA: lora
AdaLoRA: adalora
Adaption Prompt: adaption_prompt

Here's an example config:

model_type: llm
model_name: facebook/opt-66b
tuner: lora

Things to note:

Prompt Tuning, Prefix Tuning and PTuning require the num_virtual_tokens parameter to be set to a non-zero value
Lora is only supported for the following model types: t5, mt5, bart, opt, roberta and deberta-v2
Adaption Prompt is only supported for Llama model types and requires adapter_len and adapter_layers parameters to be set.

Co-authored by @arnavgarg1

github-actions · 2023-05-07T00:11:30Z

Unit Test Results

    6 files ±0   6 suites ±0 1h 31m 56s ⏱️ + 16m 12s
  33 tests ±0 28 ✔️ - 1   4 💤 ±0 0 ❌ ±0 1 🔥 +1
100 runs +1 87 ✔️ ±0 12 💤 ±0 0 ❌ ±0 1 🔥 +1

For more details on these errors, see this check.

Results for commit 29b6b0c. ± Comparison against base commit e2cde3a.

♻️ This comment has been updated with latest results.

for more information, see https://pre-commit.ci

…he correct devices

tgaddair · 2023-05-16T00:08:51Z

LGTM!

tgaddair added 7 commits May 6, 2023 12:34

Consolidated adapter into tuner

4eb3857

Fixed example

25964a1

Zero and few shot to none trainer type

1f57fba

Fixed imports

799db02

Added Alpaca example

b18bdf6

Cleanup

f873b09

Fixed alpaca

4cef8cb

tgaddair and others added 6 commits May 8, 2023 09:35

Added auto

aca9e05

Added additional peft options

1efdabb

Disable prefix tuning and p-tuning

70378aa

Validation checks

82a739d

Fixes

b6f9ff1

[pre-commit.ci] auto fixes from pre-commit.com hooks

3799ac3

for more information, see https://pre-commit.ci

tgaddair mentioned this pull request May 8, 2023

[LLM] Add Prefix Tuning, PTuning, Lora, Adalora and Adaption Prompt fine-tuning strategies #3387

Closed

arnavgarg1 marked this pull request as ready for review May 8, 2023 17:49

Unused import

12590fe

arnavgarg1 changed the title ~~Added LoRA for LLM fine-tuning~~ Added Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning May 8, 2023

arnavgarg1 self-requested a review May 8, 2023 17:49

tgaddair and others added 5 commits May 8, 2023 10:55

Fix single gpu placement

8410c07

WIP

3ea6430

fix text generation llm test

52ba46e

Fix tensor alignment

7ddbe77

Fix text to text fine-tuning

f75dab0

arnavgarg1 changed the title ~~Added Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning~~ [LLM] Added Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning May 9, 2023

arnavgarg1 added 5 commits May 9, 2023 00:29

Add adaption prompt post init checks

883f767

Remove unnecessary func

7fcc80b

precommit

dc36564

Remove model_name override

81cb5e8

Fix non-finetuning LLM tests

4640a42

arnavgarg1 and others added 14 commits May 9, 2023 08:21

Make all tests locally by overriding train_loss

01f15bd

Fix tensor type casting

3d6c0fc

Create separate ray trainer for llms when finetuning

c4f0ef1

Add correct trainer for Ray backend

f750713

Defer tuner initialization until after the model has been placed on t…

04d3511

…he correct devices

Updated alpaca examples

da038dd

Merge branch 'master' into llm-lora

6c81aa9

Add config validation checks for deepspeed

be60598

Improve validation checks

13b003c

Disable Ray backend tests for llm fine-tuning

6c62d76

Fix adapter initialization point

cdc0b12

Rename tuner to adapter

757f9f6

re-enable quantization

759d42b

Cleaned up examples

29b6b0c

arnavgarg1 changed the title ~~[LLM] Added Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning~~ [LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning May 15, 2023

tgaddair merged commit cf250bc into master May 16, 2023
13 of 15 checks passed

tgaddair deleted the llm-lora branch May 16, 2023 00:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning #3386

[LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning #3386

tgaddair commented May 6, 2023 •

edited

github-actions bot commented May 7, 2023 •

edited

tgaddair commented May 16, 2023

[LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning #3386

[LLM] Add Prefix Tuning, PTuning, LoRA, AdaLoRA and Adaption Prompt for LLM fine-tuning #3386

Conversation

tgaddair commented May 6, 2023 • edited

github-actions bot commented May 7, 2023 • edited

Unit Test Results

tgaddair commented May 16, 2023

tgaddair commented May 6, 2023 •

edited

github-actions bot commented May 7, 2023 •

edited