Support passing model parameters in pipelines #500

davidmezzetti · 2023-07-08T20:39:02Z

Currently, all Hugging Face based pipelines have the following constructor.

def __init__(self, path=None, quantize=False, gpu=True, model=None):

This allows setting the model path and target GPU device but prevents many necessary parameters from being passed to the models. For example, many of the new LLMs being released require the trust_remote_code parameter to be set. Running 4 and 8 bit quantization is another example.

The current workaround is to create the models via the auto model classes and pass those to the pipelines. This works in Python but not with application YAML.

This issue will do the following:

Add kwargs parameters to all constructors and pass those parameters to the underlying pipeline
Resolve arguments from strings as necessary (for example torch_dtype)

The text was updated successfully, but these errors were encountered:

davidmezzetti added this to the v5.6.0 milestone Jul 8, 2023

davidmezzetti self-assigned this Jul 8, 2023

davidmezzetti closed this as completed in 4c205c2 Jul 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support passing model parameters in pipelines #500

Support passing model parameters in pipelines #500

davidmezzetti commented Jul 8, 2023

Support passing model parameters in pipelines #500

Support passing model parameters in pipelines #500

Comments

davidmezzetti commented Jul 8, 2023