# Summarize CLI - Notebook

If you prefer working in a notebook environment rather than a command line tool, this is for you! You can run each of the steps (deployment, querying, removing resources) in its associated cell below. As always, please remember to run the cell which deletes your resources (final cell in the notebook) to avoid being charged.

In [1]:
from sagemaker.huggingface import HuggingFaceModel
import sagemaker

role = sagemaker.get_execution_role()
# Hub Model configuration. https://huggingface.co/models
hub = {"HF_MODEL_ID": "google/pegasus-xsum", "HF_TASK": "summarization"}

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
    transformers_version="4.17.0",
    pytorch_version="1.10.2",
    py_version="py38",
    env=hub,
    role=role,
)

## Deploy model to Sagemaker

In [2]:
print("deploying model to SageMaker. this may take a few minutes...")
# deploy model to SageMaker Inference
predictor = huggingface_model.deploy(
    initial_instance_count=1,  # number of instances
    instance_type="ml.m5.xlarge",  # ec2 instance type
)

deploying model to SageMaker. this may take a few minutes...
-----!

## Query for Inference

In [3]:
text = "replace this with the text you want to summarize."

predictor.predict({"inputs": text})[0].get("summary_text")

'If you have a question about this or any other article on this website, please use the form below.'

## Delete Resources

In [5]:
# delete model
predictor.delete_model()

# delete endpoint - also deletes the endpoint configuration
predictor.delete_endpoint()

Exception: One or more models cannot be deleted, please retry. 
Failed models: huggingface-pytorch-inference-2023-02-26-22-17-21-612