[Finetune] Fix evaluation #252

minmingzhu · 2024-06-16T15:05:26Z

After trying several solutions, I finally used the tokenize_func from neural_chat:

Using the transformer tokenizer with padding and attention_mask, and copying input_ids to labels.
Using the transformer tokenizer with padding and attention_mask, and masking the input in the prompt.
Using the transformer tokenizer with padding and attention_mask, and masking the response in the prompt.
Using neural_chat's tokenize_func for padding, filling attention_mask and labels which masking the input in the prompt.

Case	Average	ARC	HellaSwag	MMLU	TruthfulQA	Winogrande
Case 1	62.116	55.97	78.39	60.75	45.86	69.61
Case 2	58.99	54.18	72.85	60.23	42.18	65.51
Case 3	59.064	53.24	73.21	60.85	41.56	66.46
Case 4	63.03	53.67	81.47	60.27	43.02	76.72

llm_on_ray/finetune/DataPreprocess.py

llm_on_ray/finetune/finetune.py

llm_on_ray/finetune/DataPreprocess.py

llm_on_ray/finetune/data_preprocess.py

llm_on_ray/finetune/finetune.py

llm_on_ray/finetune/data_preprocess.py

llm_on_ray/finetune/data_process.py

Signed-off-by: minmingzhu <minming.zhu@intel.com>

2. fix comments Signed-off-by: minmingzhu <minming.zhu@intel.com>

Signed-off-by: minmingzhu <minming.zhu@intel.com>

minmingzhu changed the title ~~Fix evaluation~~ [Finetune] Fix evaluation Jun 17, 2024

harborn reviewed Jun 21, 2024

View reviewed changes

llm_on_ray/finetune/data_preprocess.py Outdated Show resolved Hide resolved

llm_on_ray/finetune/finetune.py Outdated Show resolved Hide resolved

llm_on_ray/finetune/data_preprocess.py Outdated Show resolved Hide resolved

harborn reviewed Jun 24, 2024

View reviewed changes

llm_on_ray/finetune/data_preprocess.py Outdated Show resolved Hide resolved

harborn reviewed Jun 24, 2024

View reviewed changes

llm_on_ray/finetune/data_process.py Outdated Show resolved Hide resolved

minmingzhu added 15 commits June 26, 2024 10:20

fix evaluation

968582e

fix evaluation

7d4b899

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

e77efd1

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

f97705a

Signed-off-by: minmingzhu <minming.zhu@intel.com>

format

b641a65

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

83e8305

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

e74679d

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

42b64c3

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

fe78c03

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

66ba1c8

Signed-off-by: minmingzhu <minming.zhu@intel.com>

update

c1136b9

Signed-off-by: minmingzhu <minming.zhu@intel.com>

fix license-header

268067d

Signed-off-by: minmingzhu <minming.zhu@intel.com>

1. update doc

cc94c4a

2. fix comments Signed-off-by: minmingzhu <minming.zhu@intel.com>

fix comments

a633a13

Signed-off-by: minmingzhu <minming.zhu@intel.com>

fix comments

d3c99ea

Signed-off-by: minmingzhu <minming.zhu@intel.com>

minmingzhu force-pushed the fix-evaluation branch from 03d3512 to d3c99ea Compare June 26, 2024 02:22

harborn approved these changes Jun 28, 2024

View reviewed changes

minmingzhu merged commit 0b44ac4 into intel:main Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Finetune] Fix evaluation #252

[Finetune] Fix evaluation #252

Uh oh!

minmingzhu commented Jun 16, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Finetune] Fix evaluation #252

[Finetune] Fix evaluation #252

Uh oh!

Conversation

minmingzhu commented Jun 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

minmingzhu commented Jun 16, 2024 •

edited

Loading