[PTQ][OV] BF16 support #2307

KodiaqQ · 2023-12-07T20:21:49Z

Changes

Added BF16 type support
Added FQ parameters generation based on type
Extended the list of the supported types for OpenVINO input data with ov.Tensor

Reason for changes

BF16 support

Related tickets

126782

Tests

Updated existing tests with BF16
manual/post_training_weight_compression/99 - no regressions (failure due to CI issue)
manual/post_training_quantization/421 - no regressions (failure due to CI issue)

codecov · 2023-12-07T20:23:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.20%. Comparing base (f4bd077) to head (9471ac8).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #2307   +/-   ##
========================================
  Coverage    91.19%   91.20%           
========================================
  Files          483      483           
  Lines        46443    46435    -8     
========================================
- Hits         42355    42351    -4     
+ Misses        4088     4084    -4

Files	Coverage Δ
nncf/openvino/engine.py	`96.55% <ø> (ø)`
nncf/openvino/graph/model_transformer.py	`94.84% <100.00%> (+0.82%)`	⬆️
nncf/openvino/graph/node_utils.py	`98.80% <100.00%> (-0.02%)`	⬇️
nncf/openvino/graph/transformations/commands.py	`97.67% <100.00%> (+0.11%)`	⬆️
nncf/openvino/quantization/quantize_ifmodel.py	`100.00% <100.00%> (ø)`
.../algorithms/weight_compression/openvino_backend.py	`98.84% <100.00%> (-0.01%)`	⬇️

Flag	Coverage Δ
COMMON	`41.93% <0.00%> (+<0.01%)`	⬆️
ONNX	`34.19% <0.00%> (+<0.01%)`	⬆️
OPENVINO	`40.98% <100.00%> (-0.01%)`	⬇️
TENSORFLOW	`29.39% <0.00%> (+<0.01%)`	⬆️
TORCH	`65.11% <7.54%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
common	`93.54% <ø> (ø)`
torch	`93.65% <ø> (ø)`
tensorflow	`93.26% <ø> (ø)`
onnx	`93.06% <ø> (ø)`
openvino	`94.62% <100.00%> (+0.10%)`	⬆️
ptq	`90.50% <100.00%> (-0.01%)`	⬇️

nncf/openvino/graph/model_transformer.py

KodiaqQ · 2023-12-12T18:04:36Z

@alexsu52, @l-bat, please, review.

…upport

alexsu52 · 2024-06-25T09:11:19Z

I would suggest to check accuracy and performance of weight compression algorithms for FP32 and FP16 precision.

Here are the results of weight compression validation on the local machine - i9-10980XE:
Model Backend Metric name Metric value (develop) Metric value (bf16 branch) Compr. Time (develop) Compr. Time (bf16 branch)
tinyllama_data_aware OV Similarity 0,83853 0,83853 00:01:26 00:01:26
tinyllama_data_free OV Similarity 0,72057 0,72057 00:00:44 00:00:44

Also, here are the numbers from the examples/llm_compression/openvino/tiny_llama. Develop: BF16 branch:

There were no degradations observed. @alexsu52, how did you reproduce the issue?

I used model with FP16 weights.

…upport

KodiaqQ · 2024-07-10T10:05:36Z

@andrey-churkin, @l-bat, @kshpv, @alexsu52, @daniil-lyakhov, @andreyanufr, review, please.

nncf/openvino/graph/node_utils.py

nncf/openvino/engine.py

AS accepted the merge of this PR.

KodiaqQ · 2024-07-12T12:37:43Z

FP16 model as input (bloomz-560m). Develop:

Branch:

### Changes - Fixed shared memory warnings issue. - Reverted cast for data type while creating constant. ### Reason for changes - Leftover from #2307. - Bugfix. ### Related tickets - 147255 ### Tests - N/A

### Changes - Fixed shared memory warnings issue. - Reverted cast for data type while creating constant. ### Reason for changes - Leftover from openvinotoolkit#2307. - Bugfix. ### Related tickets - 147255 ### Tests - N/A (cherry picked from commit b328e4d)

### Changes - Fixed `model_extraction_command` issue introduced in #2307. ### Reason for changes - Bugfix. ### Related tickets - 147065 ### Tests - Updated

KodiaqQ added 5 commits December 7, 2023 12:30

Added BF16 & ov.Tensor support

26b6c73

Add FQ params dtype conversion

5f02d99

Update tests for BF16

2bcfca9

Fix tests

65ce6dc

Fix bf16 tests

4f91018

github-actions bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label Dec 7, 2023

Added const with types

69e2297

KodiaqQ mentioned this pull request Dec 8, 2023

[254-llm-chatbot] NNCF is not yet supported OpenVINO data type: bf16. openvinotoolkit/openvino_notebooks#1514

Closed

KodiaqQ requested review from alexsu52, l-bat and andrey-churkin December 11, 2023 13:10

KodiaqQ marked this pull request as ready for review December 11, 2023 13:10

KodiaqQ requested a review from a team as a code owner December 11, 2023 13:10

l-bat reviewed Dec 12, 2023

View reviewed changes

nncf/openvino/graph/model_transformer.py Outdated Show resolved Hide resolved

KodiaqQ added 2 commits December 12, 2023 17:45

Apply comment

de85c1d

Disable tests

058a6e1

KodiaqQ added 2 commits December 12, 2023 19:09

Added PrePostProcessor for FP32 outputs

12447d9

Remove BF16 from testing

627ff67

KodiaqQ marked this pull request as draft December 13, 2023 11:04

KodiaqQ added 5 commits January 16, 2024 12:43

Merge remote-tracking branch 'openvinotoolkit/develop' into nm/bf16_s…

6f011d9

…upport

Adjust to develop

f3c8ed8

Adjust BF16 suport in tests

c97c616

Added opset.constant with shared_memory option

ccd0b91

Merge remote-tracking branch 'openvinotoolkit/develop' into nm/bf16_s…

7f670a0

…upport

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Jan 23, 2024

KodiaqQ added 3 commits January 23, 2024 10:08

Added cast to fp32

91bb312

Merge openvinotoolkit/develop into nm/bf16_support

ff8f0ca

Removed PrePostProcessor usage

ca6ff73

KodiaqQ marked this pull request as ready for review June 25, 2024 07:17

KodiaqQ marked this pull request as draft June 25, 2024 14:51

KodiaqQ added 2 commits July 10, 2024 11:13

Limit .get_data usage

8ffe8ae

Merge remote-tracking branch 'openvinotoolkit/develop' into nm/bf16_s…

5405bc9

…upport

KodiaqQ marked this pull request as ready for review July 10, 2024 09:24

KodiaqQ requested review from daniil-lyakhov and andreyanufr July 10, 2024 09:24

KodiaqQ added 3 commits July 10, 2024 11:31

Limit shared_memory usage

5f4062b

Fix WC

cfa7ce9

Fix test_get_const_value

f2add1f

andreyanufr approved these changes Jul 10, 2024

View reviewed changes

kshpv reviewed Jul 11, 2024

View reviewed changes

nncf/openvino/graph/node_utils.py Show resolved Hide resolved

KodiaqQ requested a review from kshpv July 11, 2024 09:36

Apply comment

5725636

kshpv approved these changes Jul 11, 2024

View reviewed changes

daniil-lyakhov reviewed Jul 11, 2024

View reviewed changes

nncf/openvino/graph/node_utils.py Show resolved Hide resolved

daniil-lyakhov approved these changes Jul 11, 2024

View reviewed changes

andrey-churkin approved these changes Jul 12, 2024

View reviewed changes

nncf/openvino/engine.py Outdated Show resolved Hide resolved

l-bat approved these changes Jul 12, 2024

View reviewed changes

Apply minor comments

3e531c4

KodiaqQ merged commit 6926cf1 into openvinotoolkit:develop Jul 12, 2024
12 checks passed

This was referenced Jul 18, 2024

Fix for model transformer command #2823

Merged

Fix shared_memory issue with cast #2834

Merged

alexsu52 pushed a commit that referenced this pull request Jul 23, 2024

Fix for model transformer command (#2823)

5f5562d

### Changes - Fixed `model_extraction_command` issue introduced in #2307. ### Reason for changes - Bugfix. ### Related tickets - 147065 ### Tests - Updated

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PTQ][OV] BF16 support #2307

[PTQ][OV] BF16 support #2307

KodiaqQ commented Dec 7, 2023 •

edited

Loading

codecov bot commented Dec 7, 2023 •

edited

Loading

KodiaqQ commented Dec 12, 2023

alexsu52 commented Jun 25, 2024

KodiaqQ commented Jul 10, 2024

KodiaqQ commented Jul 12, 2024 •

edited

Loading

[PTQ][OV] BF16 support #2307

[PTQ][OV] BF16 support #2307

Conversation

KodiaqQ commented Dec 7, 2023 • edited Loading

Changes

Reason for changes

Related tickets

Tests

codecov bot commented Dec 7, 2023 • edited Loading

Codecov Report

KodiaqQ commented Dec 12, 2023

alexsu52 commented Jun 25, 2024

KodiaqQ commented Jul 10, 2024

KodiaqQ commented Jul 12, 2024 • edited Loading

KodiaqQ commented Dec 7, 2023 •

edited

Loading

codecov bot commented Dec 7, 2023 •

edited

Loading

KodiaqQ commented Jul 12, 2024 •

edited

Loading