Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Intel Arc iGPU support (Meteor Lake) #1859

Open
sulliwane opened this issue May 5, 2024 · 0 comments
Open

Add Intel Arc iGPU support (Meteor Lake) #1859

sulliwane opened this issue May 5, 2024 · 0 comments

Comments

@sulliwane
Copy link

Feature request

After reading this blogpost https://huggingface.co/blog/phi2-intel-meteor-lake, I tried the instructions. My goal is to run phi2 100% locally.

I successfully executed the python file to apply 4-bit quantization to the 2.7-billion Microsoft Phi-2 model, and output a 1.7G openvino_model.bin model file.

I thought, let's run Huggingface text-generation-inference service locally + huggingface chat-ui locally and point chat-ui to tgi 127.0.0.1:8080/generate_stream.

But when trying to run TGI on my Intel® Core™ Ultra 7 155H, it says this: docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].

docker run --rm --network none --gpus all -p 8080:8080 -v /home/victor/llm-ai/huggingface/phi-2-openvino:/data --pull always ghcr.io/huggingface/text-generation-inference:latest --model-id /data/openvino_model.bin

Status: Image is up to date for ghcr.io/huggingface/text-generation-inference:latest
docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].

I guess TGI docker image is missing support for Intel Arc iGPU?

If that's the case, how did this guy shot this video?

Motivation

Leverage Intel Arc iGPU on Meteor Lake

Your contribution

Happy to try any solution on my hardware

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant