Fix GPU usage by removing `device` from `Transformers` class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

cpcdoy · 2024-01-03T17:55:31Z

The issue

Using the device parameter already provided by the Transformers wrapper doesn't work as intended. It first loads the model (by default on CPU) and then pushes the model to the device later. This is inefficient and hasn't been working at all on my end.

Example code snippet for the above:

from guidance import models

# This is confusing since the user can think this is given directly to the wrapped HF transformers class
gpt = models.Transformers('gpt2', device='cuda')

The fix

Removing the device parameter altogether fixes this and removes the confusion of having a device parameter that isn't actually directly being used by HF Transformers but only by the wrapper itself.

Also, it doesn't affect anything else since there is a self.device = self.model_obj.device that gets the device from the model directly. Also, the tests seem to pass.

It is now only necessary to use device_map from HF Transformers directly through kwargs and the model is loaded correctly.

Example code snippet for the above:

from guidance import models

gpt = models.Transformers('gpt2', device_map='auto') # `auto` or any other device map

…sed by HF Transformers in kwargs

slundberg · 2024-01-04T21:36:53Z

Thanks @cpcdoy ! This is a great fix. Once the unit tests run I'll go ahead and merge.

cpcdoy · 2024-01-06T21:42:53Z

Fixes #536

aadityabhatia · 2024-02-12T18:24:44Z

I tried the following

mistral = models.TransformersChat("mistralai/Mistral-7B-Instruct-v0.2", device=0)
mistral = models.Transformers("mistralai/Mistral-7B-Instruct-v0.2", device=0)

Both of those statements resulted in:

TypeError: MistralForCausalLM.__init__() got an unexpected keyword argument 'device'

Is that expected? Running the latest version from this repo: pip install git+https://github.com/guidance-ai/guidance

Del: device from Transformers wrapper to use the device directly expo…

ba5a4cd

…sed by HF Transformers in kwargs

slundberg merged commit e706971 into guidance-ai:main Jan 4, 2024
4 checks passed

cpcdoy mentioned this pull request Jan 6, 2024

How can I use my GPU 😂 #536

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix GPU usage by removing `device` from `Transformers` class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

Fix GPU usage by removing `device` from `Transformers` class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

cpcdoy commented Jan 3, 2024

slundberg commented Jan 4, 2024

cpcdoy commented Jan 6, 2024

aadityabhatia commented Feb 12, 2024

Fix GPU usage by removing device from Transformers class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

Fix GPU usage by removing device from Transformers class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

Conversation

cpcdoy commented Jan 3, 2024

The issue

The fix

slundberg commented Jan 4, 2024

cpcdoy commented Jan 6, 2024

aadityabhatia commented Feb 12, 2024

Fix GPU usage by removing `device` from `Transformers` class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569

Fix GPU usage by removing `device` from `Transformers` class wrapper to use the device/device_map directly exposed by HF Transformers in kwargs #569