Add support for INT8 quantization of fusion_gru op #27330

wojtuss · 2020-09-15T12:58:52Z

Please, enable INT8 quantization of the fusion_gru op using oneDNN-based INT8 kernel.

The text was updated successfully, but these errors were encountered:

wojtuss · 2020-09-17T08:35:52Z

After enabling INT8 quantization of the fusion_gru op (a PR will be submitted soon) we achieved the following accuracy result for the QAT GRU model on a CLX 6248 (with VNNI support):

	qat (fp32)	int8	diff
Precision	0.89198	0.89221	0.00023
Recall	0.89449	0.89412	-0.00037
F1 score	0.89323	0.89316	-0.00007

lidanqing-intel · 2020-09-22T23:28:28Z

PR #27481 submitted

wojtuss · 2020-11-30T11:53:31Z

A support for INT8 quantization of the GRU model is added to Paddle. The only piece we are waiting to add here is a version of oneDNN with additional GRU INT8 optimizations. The upgrade of oneDNN will be submitted in the PR #28420.

wojtuss · 2021-01-04T09:43:24Z

The PR #28420 is merged so the support for INT8 quantization of the GRU model is done.

paddle-bot-old · 2021-01-04T09:44:02Z

Are you satisfied with the resolution of your issue?

YES
No

wojtuss added Intel int8 labels Sep 15, 2020

wojtuss self-assigned this Sep 15, 2020

This was referenced Sep 23, 2020

Add support for (de/re)quantization with shift #27481

Merged

Added support for quantization of fusion_gru #27518

Merged

This was referenced Nov 12, 2020

Add multi_gru op and tests #28591

Merged

Add multi_gru_fuse_pass and tests #28601

Merged

Add multi_gru_seq_fuse_pass and tests #28604

Merged

Add quantization of multi_gru op and tests #28615

Merged

wojtuss closed this as completed Jan 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for INT8 quantization of fusion_gru op #27330

Add support for INT8 quantization of fusion_gru op #27330

wojtuss commented Sep 15, 2020

wojtuss commented Sep 17, 2020 •

edited

lidanqing-intel commented Sep 22, 2020

wojtuss commented Nov 30, 2020 •

edited

wojtuss commented Jan 4, 2021

paddle-bot-old bot commented Jan 4, 2021

Add support for INT8 quantization of fusion_gru op #27330

Add support for INT8 quantization of fusion_gru op #27330

Comments

wojtuss commented Sep 15, 2020

wojtuss commented Sep 17, 2020 • edited

lidanqing-intel commented Sep 22, 2020

wojtuss commented Nov 30, 2020 • edited

wojtuss commented Jan 4, 2021

paddle-bot-old bot commented Jan 4, 2021

wojtuss commented Sep 17, 2020 •

edited

wojtuss commented Nov 30, 2020 •

edited