Pipeline

Introduction
Examples

2.1. Pipeline Inference for INT8 Model

2.2. Pipeline Inference for Executor Backend

Introduction

The pipeline is inherited from huggingface/transformers pipeline, it is simple to use any model from Hub for inference on any language, computer vision, speech, and multimodal tasks. Two features for int8 model inference and model inference on executor backend have been added to the extension.

Examples

Pipeline Inference for INT8 Model

Initialize a pipeline instance with a model name and specific task.

from intel_extension_for_transformers.optimization.pipeline import pipeline
text_classifier = pipeline(
    task="text-classification",
    model="Intel/distilbert-base-uncased-finetuned-sst-2-english-int8-static",
    framework="pt",
    device=torch.device("cpu"),
)

Pass your input text to the pipeline instance for inference.

outputs = text_classifier("This is great !")
# output: [{'label': 1, 'score': 0.9998425245285034}]

Pipeline Inference for Executor Backend

For executor, we only accept ONNX model now for pipeline. Users can get ONNX model from PyTorch model with our existing API. Right now, pipeline for executor only supports text-classification task.

Initialize a pipeline instance with an ONNX model, model config, model tokenizer and specific backend. The MODEL_NAME is the pytorch model name you used for exporting the ONNX model.

from intel_extension_for_transformers.optimization.pipeline import pipeline
from transformers import AutoConfig, AutoTokenizer

config = AutoConfig.from_pretrained(MODEL_NAME)
tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME)
text_classifier = pipeline(
    task="text-classification",
    config=config,
    tokenizer=tokenizer,
    model='fp32.onnx',
    model_kwargs={'backend': "executor"},
)

Pass your input text to the pipeline instance for inference.

outputs = text_classifier(
    "But believe it or not , it 's one of the most "
    "beautiful , evocative works I 've seen ."
)
# output: [{'label': 'POSITIVE', 'score': 0.9998886585235596}]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pipeline.md

pipeline.md

Pipeline

Introduction

Examples

Pipeline Inference for INT8 Model

Pipeline Inference for Executor Backend

Files

pipeline.md

Latest commit

History

pipeline.md

File metadata and controls

Pipeline

Introduction

Examples

Pipeline Inference for INT8 Model

Pipeline Inference for Executor Backend