Get Flux working on Apple Silicon #1264

conornash · 2024-08-18T09:25:26Z

Should fix #1103

vicento · 2024-08-19T19:20:52Z

we would love to see Flux work on Silicon Mac

l0stl0rd · 2024-08-22T07:11:34Z

Yup waiting impatiently to try Q versions or maybe the NF4

l0stl0rd · 2024-08-22T11:48:40Z

Should fix #1103

Also should it not be possible to set it to bfloat16 too or set it as an option, if I remember right starting PyTorch 2.3.0 ist is supported on MPS. I did try it once in Invoke.

Ok apparently bfloat16 only works on M2 or newer, still would be nice to have it ;)

conornash · 2024-09-13T13:22:50Z

Should fix #1103

Also should it not be possible to set it to bfloat16 too or set it as an option, if I remember right starting PyTorch 2.3.0 ist is supported on MPS. I did try it once in Invoke.

Ok apparently bfloat16 only works on M2 or newer, still would be nice to have it ;)

Tried using bfloat16 on my M3, but got the following error:

RuntimeError: "arange_mps" not implemented for 'BFloat16'

conornash · 2024-09-13T13:28:55Z

@DenOfEquity is there anything I should do to get this approved?

DenOfEquity · 2024-09-13T14:39:49Z

I guess none of the active collaborators/maintainers can actually test what's going on with MPS. Also, some confusion in the linked issue about it working or not, or only with some models. But it doesn't/can't break anything, so if it helps at least sometimes I'm calling it progress.
I'm also curious if it can't be float32 all the time - works for me, 100% identical results, sample size of 1.

MigCurto · 2024-09-13T22:29:45Z

Models I've tested: Schnell,Dev and GUFF all work ok with this "fix". Arguably the difference is GUFF is same as Dev but with less VRAM usage and Schnell is just different, resolves with less steps but looks like another thing:

NF4 wont work for obvious reasons (Bits and Bytes not being ported to Mac)

l0stl0rd · 2024-09-14T12:26:39Z

for me it seems none of the FP8 checkpoints works, I get:
Trying to convert Float8_e4m3fn to the MPS backend but it does not have support for that dtype.

Even if I select the Fp16 T5.

The GGUF version I tried did not work either, error: Unsupported type byte size: UInt16

The full FP16 Flux works but it is horribly slow on my M3 Pro, about 6 min for 20 steps.

Also not sure why bfloat16 does not work, even with pytorch 2.4.1. or nightly.

Upadate: or mybe it does work but the code needs to be different.

l0stl0rd · 2024-09-14T13:43:10Z

actually if I do this:
if pos.device.type == "mps":
scale = torch.arange(0, dim, 2, dtype=torch.float16, device=pos.device) / dim

and use torch nightly, need to recheck 2.4.1., then bfloat16 works it seems as I get this.
K-Model Created: {'storage_dtype': torch.bfloat16, 'computation_dtype': torch.bfloat16}

took 5:40 min on Torch 2.6 nightly

works with pytorch 2.4.1. too

However it does not matter really as the FP8 and de Q4_1 gguf still give the same error.

conornash requested a review from lllyasviel as a code owner August 18, 2024 09:25

achiever1984 mentioned this pull request Aug 19, 2024

Flux doesn't work on Macbook Pro M1 Max #1103

Closed

Get Flux working on Apple Silicon

d4da0a6

DenOfEquity merged commit 8bd7e05 into lllyasviel:main Sep 13, 2024

conornash deleted the flux_mps_fix branch September 14, 2024 18:43

DenOfEquity mentioned this pull request Oct 25, 2024

Please officially clarify whether the project intends to support Mac or not #2177

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get Flux working on Apple Silicon #1264

Get Flux working on Apple Silicon #1264

conornash commented Aug 18, 2024

vicento commented Aug 19, 2024

l0stl0rd commented Aug 22, 2024 •

edited

Loading

l0stl0rd commented Aug 22, 2024 •

edited

Loading

conornash commented Sep 13, 2024

conornash commented Sep 13, 2024

DenOfEquity commented Sep 13, 2024

MigCurto commented Sep 13, 2024

l0stl0rd commented Sep 14, 2024 •

edited

Loading

l0stl0rd commented Sep 14, 2024 •

edited

Loading

Get Flux working on Apple Silicon #1264

Get Flux working on Apple Silicon #1264

Conversation

conornash commented Aug 18, 2024

vicento commented Aug 19, 2024

l0stl0rd commented Aug 22, 2024 • edited Loading

l0stl0rd commented Aug 22, 2024 • edited Loading

conornash commented Sep 13, 2024

conornash commented Sep 13, 2024

DenOfEquity commented Sep 13, 2024

MigCurto commented Sep 13, 2024

l0stl0rd commented Sep 14, 2024 • edited Loading

l0stl0rd commented Sep 14, 2024 • edited Loading

l0stl0rd commented Aug 22, 2024 •

edited

Loading

l0stl0rd commented Aug 22, 2024 •

edited

Loading

l0stl0rd commented Sep 14, 2024 •

edited

Loading

l0stl0rd commented Sep 14, 2024 •

edited

Loading