m1/2 gpu support #101

stoneLee81 · 2023-07-01T10:32:07Z

Feature request

it seems that it just support Nvidia gpu and chatglm/chatglm2 can't run under apple silicon env

Motivation

No response

Other

No response

alew3 · 2023-07-01T20:48:46Z

+1 for this!

CPU works, but GPU only models are currently looking for NVidia GPUs which are not supported on the mac.

aarnphm · 2023-07-01T20:57:01Z

I can enable chatglm on CPU, but there is really no point since the inference are just way to slow to run on CPU anw

alew3 · 2023-07-02T13:36:51Z

@aarnphm Is there any chance of GPU support for the Mac on Metal? Tensorflow and Pytorch already support Metal (not sure of how extensively though) Since the Mac shares memory with the VRam, it would be much more viable running a LLM locally as the other alternative is buying a professional NVidia Card.

aarnphm · 2023-07-02T17:00:32Z

Support metal is irrelevant of this. Metal won't make model inference faster unless we implement the attention layer in bare metal code

alew3 · 2023-07-03T12:53:18Z

thanks for the feedback, much appreciated!

stoneLee81 closed this as completed Jul 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

m1/2 gpu support #101

m1/2 gpu support #101

stoneLee81 commented Jul 1, 2023

alew3 commented Jul 1, 2023

aarnphm commented Jul 1, 2023

alew3 commented Jul 2, 2023

aarnphm commented Jul 2, 2023

alew3 commented Jul 3, 2023

m1/2 gpu support #101

m1/2 gpu support #101

Comments

stoneLee81 commented Jul 1, 2023

Feature request

Motivation

Other

alew3 commented Jul 1, 2023

aarnphm commented Jul 1, 2023

alew3 commented Jul 2, 2023

aarnphm commented Jul 2, 2023

alew3 commented Jul 3, 2023