[LLM] Fix llm models extension issue #955

changwangss · 2023-12-18T05:57:51Z

Type of Change

Fix llm models extension test issue

phi transformers version.
add requirements for baichuan/qwen/codegen.

Support codegen sq workflow, due to codegen use the remote tokenizer, so add trust_remote_code to get the correct tokenizer. transformers version requests 4.35.2.
fp32 with limit 20

numactl -m 0 -C 0-55 python run_generation.py     --model Salesforce/codegen25-7b-multi    --accuracy     --batch_size 20     --n_samples 20     --allow_code_execution     --temperature 0.2     --do_sample     --tasks "humaneval" --trust_remote_code True

Evaluating generations...
{
  "humaneval": {
    "pass@1": 0.5725,
    "pass@10": 0.649981867977224
  },
  "config": {
    "model": "llama",
    "temperature": 0.2,
    "n_samples": 20
  }
}
{'humaneval': {'pass@1': 0.5725, 'pass@10': 0.649981867977224}, 'config': {'model': 'llama', 'temperature': 0.2, 'n_samples': 20}}

int8 sq with alpha 0,5 calib_iters 5 limit 20

python run_generation.py     --model Salesforce/codegen25-7b-multi        --alpha 0.5       --accuracy     --batch_size 20     --n_samples 20     --allow_code_execution     --temperature 0.2     --do_sample     --tasks "humaneval" --limit 20 --trust_remote_code True --sq --int8 --calib_iters 5

Evaluating generations...
{
  "humaneval": {
    "pass@1": 0.3575,
    "pass@10": 0.4497261252679209
  },
  "config": {
    "model": "llama",
    "temperature": 0.2,
    "n_samples": 20
  }
}
{'humaneval': {'pass@1': 0.3575, 'pass@10': 0.4497261252679209}, 'config': {'model': 'llama', 'temperature': 0.2, 'n_samples': 20}}

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: changwangss <chang1.wang@intel.com>

VincyZhang · 2023-12-18T07:55:00Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/44/

changwangss · 2023-12-19T04:16:01Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/47/

VincyZhang · 2023-12-19T05:44:57Z

https://inteltf-jenk.sh.intel.com/job/ITREX-1.3-release-test/49/

Signed-off-by: changwangss <chang1.wang@intel.com>

examples/huggingface/pytorch/code-generation/quantization/run_generation.py

Signed-off-by: changwangss <chang1.wang@intel.com>

Signed-off-by: Wang, Chang <chang1.wang@intel.com>

intel_extension_for_transformers/transformers/utils/utility.py

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

…ion-for-transformers into wangchang/req

changwangss · 2023-12-21T09:17:42Z

local test qwen

python -u ./run_generation.py --model Qwen/Qwen-7B --alpha 0.9 --output_dir saved_results_qwen --trust_remote_code True --int8 --benchmark --batch_size 1  --_commit_hash  f7bc352f27bb1c02ee371a4576942a7d96c8bb97 --calib_iter 10  --sq

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang · 2023-12-21T10:38:10Z

CI all pass, ready for merge.

fix extension issue

e2e6e5d

Signed-off-by: changwangss <chang1.wang@intel.com>

changwangss requested review from VincyZhang and thuang6 December 18, 2023 07:18

thuang6 approved these changes Dec 19, 2023

View reviewed changes

VincyZhang approved these changes Dec 19, 2023

View reviewed changes

changwangss added 2 commits December 19, 2023 02:28

fix qwen,falcon,baichuan

4d2fee9

Signed-off-by: changwangss <chang1.wang@intel.com>

fix qwen

4e174d2

Signed-off-by: changwangss <chang1.wang@intel.com>

changwangss requested a review from PenghuiCheng as a code owner December 19, 2023 17:30

fix falcon and offline validate with transformers 4.33

7f7fc98

Signed-off-by: changwangss <chang1.wang@intel.com>

VincyZhang added the ITREX-1.3 label Dec 20, 2023

hshen14 reviewed Dec 20, 2023

View reviewed changes

examples/huggingface/pytorch/code-generation/quantization/run_generation.py Show resolved Hide resolved

changwangss added 6 commits December 20, 2023 15:14

fix bloom generate and qwen version

b934747

Signed-off-by: changwangss <chang1.wang@intel.com>

fix baichuan

7188e90

Signed-off-by: changwangss <chang1.wang@intel.com>

fix baichuan

030384f

Signed-off-by: changwangss <chang1.wang@intel.com>

fix name

33a80da

Signed-off-by: changwangss <chang1.wang@intel.com>

add qwen commit

726d952

Signed-off-by: changwangss <chang1.wang@intel.com>

Merge branch 'main' into wangchang/req

31c3f16

Signed-off-by: Wang, Chang <chang1.wang@intel.com>

VincyZhang reviewed Dec 21, 2023

View reviewed changes

intel_extension_for_transformers/transformers/utils/utility.py Outdated Show resolved Hide resolved

VincyZhang and others added 2 commits December 21, 2023 14:09

Update requirements.txt

84ea275

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

Merge branch 'wangchang/req' of https://github.com/intel/intel-extens…

d4f2609

…ion-for-transformers into wangchang/req

Merge branch 'main' into wangchang/req

a764bd4

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang merged commit 1967445 into main Dec 21, 2023

VincyZhang deleted the wangchang/req branch December 21, 2023 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LLM] Fix llm models extension issue #955

[LLM] Fix llm models extension issue #955

Uh oh!

changwangss commented Dec 18, 2023 •

edited

Loading

Uh oh!

VincyZhang commented Dec 18, 2023

Uh oh!

changwangss commented Dec 19, 2023

Uh oh!

VincyZhang commented Dec 19, 2023 •

edited by changwangss

Loading

Uh oh!

Uh oh!

Uh oh!

changwangss commented Dec 21, 2023 •

edited

Loading

Uh oh!

VincyZhang commented Dec 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[LLM] Fix llm models extension issue #955

[LLM] Fix llm models extension issue #955

Uh oh!

Conversation

changwangss commented Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

VincyZhang commented Dec 18, 2023

Uh oh!

changwangss commented Dec 19, 2023

Uh oh!

VincyZhang commented Dec 19, 2023 • edited by changwangss Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

changwangss commented Dec 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VincyZhang commented Dec 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

changwangss commented Dec 18, 2023 •

edited

Loading

VincyZhang commented Dec 19, 2023 •

edited by changwangss

Loading

changwangss commented Dec 21, 2023 •

edited

Loading