Add INT8 Stable Diffusion through Optimum #1324

hshen14 · 2022-11-17T08:20:14Z

8-bit quantization is useful to improve the inference performance. This PR is to add INT8 quantization for Stable Diffusion through Optimum-Intel quantization API on top of Intel Neural Compressor. The sample code is implemented in Optimum-Intel.

hshen14 · 2022-11-17T08:25:21Z

@patrickvonplaten please review this one. Thanks.

HuggingFaceDocBuilderDev · 2022-11-17T08:25:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

anton-l · 2022-11-17T15:30:10Z

cc @echarlaix @michaelbenayoun

anton-l · 2022-11-17T16:06:57Z

Discussed with @echarlaix offline, seems that the neural-compressor+optimum integration will refactor its API quite soon? Should we hold off the promotion until then?

echarlaix · 2022-11-17T17:26:06Z

Hi @hshen14,

Let's wait for neural-compressor and optimum-intel refactorization before increasing visibility !

hshen14 · 2022-11-18T01:17:36Z

Hi @hshen14,

Let's wait for neural-compressor and optimum-intel refactorization before increasing visibility !

Thanks @anton-l @echarlaix. Sure, let's do that.

patrickvonplaten

@echarlaix waiting until you give me the green light here :-)

github-actions · 2022-12-24T15:03:04Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

hshen14 · 2022-12-24T23:22:36Z

Currently, Optimum-Intel was being upgraded with INC v2.0 API. Will re-visit this PR after the upgrade is done.

github-actions · 2023-01-19T15:04:19Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Thomas-MMJ · 2023-01-19T19:47:26Z

bump to keep issue open

hshen14 · 2023-01-19T23:28:41Z

@echarlaix , do you think it's good time to revisit this? Thanks.

echarlaix · 2023-02-07T13:04:26Z

Sure, I will work on it and open a PR on diffusers once everything is finalized, does that work for you @hshen14 ?

CrazyBoyM · 2023-05-26T07:58:55Z

great job.

hshen14 · 2023-05-26T08:07:19Z

Sure, I will work on it and open a PR on diffusers once everything is finalized, does that work for you @hshen14 ?

That would work perfectly! Thanks @echarlaix

Ender436 · 2024-02-18T20:12:36Z

Is int8 quantization still in the works? I would find this extremely helpful on some of the devices I'm trying to use, especially when running on cpu.

patrickvonplaten · 2024-02-19T12:16:37Z

cc @yiyixuxu @sayakpaul @DN6 here

sayakpaul · 2024-02-19T12:21:59Z

I think better person to tag here would be @echarlaix.

hshen14 added 2 commits November 17, 2022 16:19

Add INT8 Stable Diffusion through Optimum

efc9d67

Simplify pip install

2539850

hshen14 mentioned this pull request Nov 17, 2022

Add INT8 quantization doc #1277

Closed

Merge branch 'main' into sd-int8-optimum

ced4a55

patrickvonplaten reviewed Nov 20, 2022

View reviewed changes

Merge branch 'main' into sd-int8-optimum

05e181c

0xdevalias mentioned this pull request Nov 30, 2022

Add deepspeed, xformers, kernl, transformerengine, ColossalAI, tritonserver, VoltaML, etc stochasticai/x-stable-diffusion#21

Open

github-actions bot added the stale Issues that haven't received updates label Dec 24, 2022

Merge branch 'main' into sd-int8-optimum

f4d7ed7

patrickvonplaten added wip and removed stale Issues that haven't received updates labels Mar 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add INT8 Stable Diffusion through Optimum #1324

Add INT8 Stable Diffusion through Optimum #1324

hshen14 commented Nov 17, 2022

hshen14 commented Nov 17, 2022

HuggingFaceDocBuilderDev commented Nov 17, 2022

anton-l commented Nov 17, 2022

anton-l commented Nov 17, 2022

echarlaix commented Nov 17, 2022

hshen14 commented Nov 18, 2022

patrickvonplaten left a comment

github-actions bot commented Dec 24, 2022

hshen14 commented Dec 24, 2022

github-actions bot commented Jan 19, 2023

Thomas-MMJ commented Jan 19, 2023

hshen14 commented Jan 19, 2023

echarlaix commented Feb 7, 2023

CrazyBoyM commented May 26, 2023

hshen14 commented May 26, 2023

Ender436 commented Feb 18, 2024

patrickvonplaten commented Feb 19, 2024

sayakpaul commented Feb 19, 2024

Add INT8 Stable Diffusion through Optimum #1324

Are you sure you want to change the base?

Add INT8 Stable Diffusion through Optimum #1324

Conversation

hshen14 commented Nov 17, 2022

hshen14 commented Nov 17, 2022

HuggingFaceDocBuilderDev commented Nov 17, 2022

anton-l commented Nov 17, 2022

anton-l commented Nov 17, 2022

echarlaix commented Nov 17, 2022

hshen14 commented Nov 18, 2022

patrickvonplaten left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 24, 2022

hshen14 commented Dec 24, 2022

github-actions bot commented Jan 19, 2023

Thomas-MMJ commented Jan 19, 2023

hshen14 commented Jan 19, 2023

echarlaix commented Feb 7, 2023

CrazyBoyM commented May 26, 2023

hshen14 commented May 26, 2023

Ender436 commented Feb 18, 2024

patrickvonplaten commented Feb 19, 2024

sayakpaul commented Feb 19, 2024