Fix bf16 inference accuracy for mistral, phi3, dbrx #833

eaidova · 2024-07-22T08:23:39Z

What does this PR do?

I found that range of models where these changes #783 are required are not limited only llama and gemma, the same issue depends on transformers version can be found in phi3, mistral and dbrx.

HuggingFaceDocBuilderDev · 2024-07-22T08:29:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

echarlaix

Thanks @eaidova

optimum/exporters/openvino/model_patcher.py

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

eaidova · 2024-07-24T06:22:02Z

@echarlaix could you please merge? These changes are also contributing to support changes in llama model for transformers 4.43

echarlaix · 2024-07-24T07:48:58Z

@echarlaix could you please merge? These changes are also contributing to support changes in llama model for transformers 4.43

Couple of tests are failing, can you take a look before we can merge ? test_exporters_cli_int4_with_local_model_and_default_config and test_load_model_from_hub_private_with_token are unrelated

eaidova · 2024-07-24T07:52:51Z

@echarlaix all of them unrelated, gpt-bigcode failed due to issue in optimum patcher (not on our level), optimum-cli failed due to removal config for bloomz from default configs (that leads to changes in group size and config not suitable for testing with small model anymore) - I'll take a look separetly, how we can update this test.

mistral/dbrx - I'm looking

echarlaix · 2024-07-25T15:00:37Z

@echarlaix all of them unrelated, gpt-bigcode failed due to issue in optimum patcher (not on our level), optimum-cli failed due to removal config for bloomz from default configs (that leads to changes in group size and config not suitable for testing with small model anymore) - I'll take a look separetly, how we can update this test.

mistral/dbrx - I'm looking

Thanks for fixing ! Will take care of the fix for gpt-bigcode

* Fix bf16 inference accuracy for mistral, phi3, dbrx * reuse inv_freq * Apply suggestions from code review Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * make dim and base optional * fix model patcher for dbrx and add bitwise fix for mistral --------- Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Fix bf16 inference accuracy for mistral, phi3, dbrx

193a985

reuse inv_freq

28b7edf

eaidova requested a review from echarlaix July 23, 2024 06:30

echarlaix approved these changes Jul 23, 2024

View reviewed changes

optimum/exporters/openvino/model_patcher.py Outdated Show resolved Hide resolved

optimum/exporters/openvino/model_patcher.py Outdated Show resolved Hide resolved

eaidova and others added 2 commits July 23, 2024 14:46

Apply suggestions from code review

9b113b5

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

make dim and base optional

492ad59

eaidova force-pushed the ea/bf16_accuracy_part2 branch from 6cc926b to 492ad59 Compare July 24, 2024 04:12

fix model patcher for dbrx and add bitwise fix for mistral

b95dd47

echarlaix merged commit 31430d6 into huggingface:main Jul 25, 2024
13 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bf16 inference accuracy for mistral, phi3, dbrx #833

Fix bf16 inference accuracy for mistral, phi3, dbrx #833

eaidova commented Jul 22, 2024

HuggingFaceDocBuilderDev commented Jul 22, 2024

echarlaix left a comment

eaidova commented Jul 24, 2024

echarlaix commented Jul 24, 2024

eaidova commented Jul 24, 2024 •

edited

Loading

echarlaix commented Jul 25, 2024

Fix bf16 inference accuracy for mistral, phi3, dbrx #833

Fix bf16 inference accuracy for mistral, phi3, dbrx #833

Conversation

eaidova commented Jul 22, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 22, 2024

echarlaix left a comment

Choose a reason for hiding this comment

eaidova commented Jul 24, 2024

echarlaix commented Jul 24, 2024

eaidova commented Jul 24, 2024 • edited Loading

echarlaix commented Jul 25, 2024

eaidova commented Jul 24, 2024 •

edited

Loading