LISA Finetuning Example #10743

Jasonzzt · 2024-04-11T09:22:31Z

Description

add an example of new fine-tuning algorithm on IPEX-LLM llama2 7b
#nano
'BF16Linear' object has no attribute 'enable_xetla', so 'enable_xetla' works only when q_proj.qtype == SYM_INT4/FP8E5

2. User API changes

How to run

python ./lisa_finetuning.py \
    --micro_batch_size 8 \
    --batch_size 128 \
    --base_model "/home/wangruonan/llm-models/Llama-2-70b-chat-hf" \
    --data_path "./alpaca_data_cleaned.json" \
    --output_dir "./ipex-llm-lisa-alpaca" \
    --gradient_checkpointing True \
    --lisa_activated_layers 1 \
    --lisa_interval_steps 20

3. Summary of the change

LISA Finetuning Example

4. How to test?

python/llm/example/GPU/LLM-Finetuning/LISA/README.md

python/llm/example/GPU/LLM-Finetuning/LISA/lisa_callback.py

Uxito-Ada · 2024-04-12T01:57:11Z

python/llm/src/ipex_llm/transformers/models/llama.py

-    enable_xetla = self.q_proj.enable_xetla
+    if self.q_proj.qtype == SYM_INT4 or self.q_proj.qtype == FP8E5:
+        enable_xetla = self.q_proj.enable_xetla
+    else:


Are you sure other dtypes all do not need xetla?
maybe change else to elif BF16

Are you sure other dtypes all do not need xetla? maybe change else to elif BF16

Here is the supported qtype of xetla.

supported_qtype = self.q_proj.qtype == SYM_INT4 and full_attn supported_qtype = supported_qtype or self.q_proj.qtype == FP8E5

On XPU, the method returns True when q_proj.qtype is among supported_qtypes(SYM_INT4/FP8E5) and enable_xetla is True.

ipex-llm/python/llm/src/ipex_llm/transformers/models/llama.py

Line 316 in 0d518aa

return device.type == "xpu" and enable_xetla and supported_qtype

So if qtype is other types, just set enable_xetla to False.

Uxito-Ada

lgtm

Jasonzzt added 3 commits April 11, 2024 15:35

enabling xetla only supports qtype=SYM_INT4 or FP8E5

1ee199f

LISA Finetuning Example on gpu

98ddd10

update readme

52ad9ee

Jasonzzt requested review from glorysdj, jason-dai and Uxito-Ada April 11, 2024 11:52

add licence

3d6215a

Uxito-Ada reviewed Apr 12, 2024

View reviewed changes

python/llm/example/GPU/LLM-Finetuning/LISA/README.md Show resolved Hide resolved

Uxito-Ada reviewed Apr 12, 2024

View reviewed changes

Jasonzzt added 3 commits April 12, 2024 15:20

Explain parameters of lisa & Move backend codes to src dir

1854648

fix style

126756f

fix style

cb84ae0

Uxito-Ada approved these changes Apr 15, 2024

View reviewed changes

Uxito-Ada mentioned this pull request Apr 15, 2024

Feature Request: RoSA and QRoSA #10755

Open

Jasonzzt added 6 commits April 15, 2024 15:05

update readme

8b57ddb

support chatglm

80dc863

fix style

ca16e8d

fix style

e11fec7

update readme

15b8688

fix

017da48

Uxito-Ada merged commit ff040c8 into intel-analytics:main Apr 18, 2024
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LISA Finetuning Example #10743

LISA Finetuning Example #10743

Jasonzzt commented Apr 11, 2024 •

edited

Loading

Uxito-Ada Apr 12, 2024

Jasonzzt Apr 12, 2024

Uxito-Ada left a comment

LISA Finetuning Example #10743

LISA Finetuning Example #10743

Conversation

Jasonzzt commented Apr 11, 2024 • edited Loading

Description

2. User API changes

3. Summary of the change

4. How to test?

Uxito-Ada Apr 12, 2024

Choose a reason for hiding this comment

Jasonzzt Apr 12, 2024

Choose a reason for hiding this comment

Uxito-Ada left a comment

Choose a reason for hiding this comment

Jasonzzt commented Apr 11, 2024 •

edited

Loading