FEAT: Refactor device related code and add initial Intel GPU support #968

notsyncing · 2024-02-02T07:09:07Z

Refactors most device related code into device_utils (only pytorch backend, vllm and ctransformers are unsupported, 8bit and 4bit are also unsupported), and adds initial Intel GPU support.

You will need intel-extension-for-pytorch to run it: https://intel.github.io/intel-extension-for-pytorch/xpu/latest/tutorials/installation.html

Tested on Llama2-chat and ChatGlm3.

(for device_map, requires huggingface/accelerate#2383)

aresnow1 · 2024-02-02T08:01:53Z

Wow, impressive work!

aresnow1 · 2024-02-02T08:21:58Z

Lint failed, we use pre-commit(https://pre-commit.com/) to lint before commit, you can install it on local and commit again.

notsyncing · 2024-02-05T01:40:45Z

@aresnow1 All lint errors fixed, and tested again on chatglm3.

qinxuye · 2024-02-05T03:29:08Z

I solved the lint for you, other than flake8, we use black to format code and isort to sort the imports.

aresnow1

LGTM overall, some details need to be confirmed.

xinference/device_utils.py

xinference/model/image/stable_diffusion/core.py

xinference/model/llm/pytorch/core.py

aresnow1 · 2024-02-19T08:11:54Z

Hi, could you rebase main branch and push again?

notsyncing · 2024-02-20T01:50:03Z

Hi, could you rebase main branch and push again?

Rebased and pushed.

aresnow1 · 2024-02-21T02:25:58Z

Looks good to me, thanks for your contribution.

XprobeBot added feature gpu labels Feb 2, 2024

XprobeBot added this to the v0.8.2 milestone Feb 2, 2024

notsyncing force-pushed the intel-gpu-support branch 2 times, most recently from 19332e9 to f7b70f9 Compare February 2, 2024 07:20

XprobeBot modified the milestones: v0.8.2, v0.8.4 Feb 2, 2024

XprobeBot modified the milestones: v0.8.4, v0.8.5 Feb 4, 2024

notsyncing force-pushed the intel-gpu-support branch from f7b70f9 to d034b3b Compare February 5, 2024 01:06

qinxuye force-pushed the intel-gpu-support branch from 07e1c5b to e91a643 Compare February 5, 2024 03:26

aresnow1 reviewed Feb 5, 2024

View reviewed changes

xinference/device_utils.py Outdated Show resolved Hide resolved

xinference/model/image/stable_diffusion/core.py Show resolved Hide resolved

xinference/model/llm/pytorch/core.py Show resolved Hide resolved

notsyncing force-pushed the intel-gpu-support branch 3 times, most recently from 3a9ceb9 to 88f2214 Compare February 5, 2024 12:08

XprobeBot modified the milestones: v0.8.5, v0.9.0 Feb 6, 2024

notsyncing force-pushed the intel-gpu-support branch 2 times, most recently from 88f0935 to 7dafcf1 Compare February 12, 2024 14:11

notsyncing and others added 5 commits February 20, 2024 09:15

FEAT: Refactor device related code and add initial Intel GPU support

3b33d52

Fix lint

25378df

Fix lint

74ad983

Move IPEX import earlier

16215bd

Fix lint error due to rebase.

2278ac0

notsyncing force-pushed the intel-gpu-support branch from 7dafcf1 to 2278ac0 Compare February 20, 2024 01:15

aresnow1 approved these changes Feb 21, 2024

View reviewed changes

aresnow1 merged commit 9d81afc into xorbitsai:main Feb 21, 2024
9 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Refactor device related code and add initial Intel GPU support #968

FEAT: Refactor device related code and add initial Intel GPU support #968

notsyncing commented Feb 2, 2024 •

edited

Loading

aresnow1 commented Feb 2, 2024

aresnow1 commented Feb 2, 2024

notsyncing commented Feb 5, 2024

qinxuye commented Feb 5, 2024

aresnow1 left a comment

aresnow1 commented Feb 19, 2024

notsyncing commented Feb 20, 2024

aresnow1 commented Feb 21, 2024

FEAT: Refactor device related code and add initial Intel GPU support #968

FEAT: Refactor device related code and add initial Intel GPU support #968

Conversation

notsyncing commented Feb 2, 2024 • edited Loading

aresnow1 commented Feb 2, 2024

aresnow1 commented Feb 2, 2024

notsyncing commented Feb 5, 2024

qinxuye commented Feb 5, 2024

aresnow1 left a comment

Choose a reason for hiding this comment

aresnow1 commented Feb 19, 2024

notsyncing commented Feb 20, 2024

aresnow1 commented Feb 21, 2024

notsyncing commented Feb 2, 2024 •

edited

Loading