[LLM Runtime] Add Script for PPL Evaluation #685

DDEle · 2023-11-15T06:00:38Z

Type of Change: Feature

API not changed

Description

Add PPL script
Add cal_diff.py script for future CI @zhenwei-intel
Fix UT (test_llm_runtime.py)

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

Internal extension test:

Dependency Change?

No

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp

airMeng

add a note on homepage readme

intel_extension_for_transformers/llm/runtime/graph/scripts/perplexity.py

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp

zhenwei-intel · 2023-11-15T07:23:00Z

Do we have PPL data of pytorch and that of cpp graph fp32/int4?

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp

DDEle · 2023-11-16T02:16:01Z

Do we have PPL data of pytorch and that of cpp graph fp32/int4?

https://inteltf-jenk.sh.intel.com/job/ITREX-cpp-graph-extension-WIP/32/artifact/report.html

DDEle · 2023-11-16T07:32:45Z

gpt-neox-20b and opt-1.3b checked in this internal extension test: https://inteltf-jenk.sh.intel.com/job/ITREX-cpp-graph-extension-WIP/33/artifact/report.html

a32543254

LGTM

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

hshen14

What's the purpose to introduce ppl script here? Can we just reuse lm-eval-harness to measure the ppl?

DDEle · 2023-11-20T01:39:58Z

What's the purpose to introduce ppl script here? Can we just reuse lm-eval-harness to measure the ppl?

It is for internal usage of LLM Runtime to keep track of the inference accuracy with minimal test time addition. It also works as a usage example of the LLM Runtime (llama.cpp also has a ppl script in its examples directory). We will investigate lm-eval-harness and see if it can help.

DDEle added the ITREX.cpp label Nov 15, 2023

DDEle requested review from airMeng and zhenwei-intel November 15, 2023 06:00

DDEle marked this pull request as draft November 15, 2023 06:00

airMeng reviewed Nov 15, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp Show resolved Hide resolved

airMeng reviewed Nov 15, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/scripts/perplexity.py Show resolved Hide resolved

zhentaoyu reviewed Nov 15, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/scripts/perplexity.py Show resolved Hide resolved

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp Show resolved Hide resolved

zhenwei-intel reviewed Nov 15, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp Outdated Show resolved Hide resolved

DDEle force-pushed the graph-ppl branch 2 times, most recently from 856b6cd to 9e30626 Compare November 15, 2023 14:27

zhenwei-intel reviewed Nov 16, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/application/main_pybind.cpp Outdated Show resolved Hide resolved

zhenwei-intel force-pushed the graph-ppl branch from c29121b to 57943ec Compare November 16, 2023 03:57

zhenwei-intel approved these changes Nov 16, 2023

View reviewed changes

DDEle marked this pull request as ready for review November 16, 2023 06:18

a32543254 approved these changes Nov 16, 2023

View reviewed changes

DDEle force-pushed the graph-ppl branch from d32c18c to 2ad97ea Compare November 16, 2023 12:53

airMeng approved these changes Nov 16, 2023

View reviewed changes

airMeng added the ready for merge label Nov 16, 2023

zhenwei-intel mentioned this pull request Nov 17, 2023

[LLM Runtime] Compare cpp logits with pytorch #681

Closed

2 tasks

DDEle and others added 8 commits November 17, 2023 15:29

add ppl

54ef7ce

fix

dbec7a4

fit

7322120

update test

9c619b5

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>

fix empty batch input

cd8f7d2

update runtime test

dcd7ca6

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>

add script for diff test

a93aa45

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>

add copyright

d3a8866

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>

DDEle and others added 3 commits November 17, 2023 15:29

Add Perplexity script in README

ca2d980

fix 'batch_size=4'

436897c

Update requirements.txt

e6c924e

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

DDEle force-pushed the graph-ppl branch from 7179ace to e6c924e Compare November 17, 2023 07:29

hshen14 reviewed Nov 17, 2023

View reviewed changes

VincyZhang merged commit df40d5e into main Nov 20, 2023

VincyZhang deleted the graph-ppl branch November 20, 2023 01:53

sywangyi pushed a commit that referenced this pull request Nov 21, 2023

[LLM Runtime] Add Script for PPL Evaluation (#685)

deaab27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LLM Runtime] Add Script for PPL Evaluation #685

[LLM Runtime] Add Script for PPL Evaluation #685

Uh oh!

DDEle commented Nov 15, 2023 •

edited

Loading

Uh oh!

Uh oh!

airMeng left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhenwei-intel commented Nov 15, 2023

Uh oh!

Uh oh!

DDEle commented Nov 16, 2023

Uh oh!

DDEle commented Nov 16, 2023

Uh oh!

a32543254 left a comment

Uh oh!

hshen14 left a comment

Uh oh!

DDEle commented Nov 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

[LLM Runtime] Add Script for PPL Evaluation #685

[LLM Runtime] Add Script for PPL Evaluation #685

Uh oh!

Conversation

DDEle commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of Change: Feature

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

Uh oh!

airMeng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhenwei-intel commented Nov 15, 2023

Uh oh!

Uh oh!

DDEle commented Nov 16, 2023

Uh oh!

DDEle commented Nov 16, 2023

Uh oh!

a32543254 left a comment

Choose a reason for hiding this comment

Uh oh!

hshen14 left a comment

Choose a reason for hiding this comment

Uh oh!

DDEle commented Nov 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

DDEle commented Nov 15, 2023 •

edited

Loading