In [1]:
!pip install transformers




In [2]:
from transformers import T5Tokenizer, T5ForConditionalGeneration

def summarize_text(text, prompt, max_length=150):
    """
    Summarizes the given text using the specified prompt and a pre-trained T5 model.

    Args:
        text (str): The input text to summarize.
        prompt (str): The prompt to guide the summarization.
        max_length (int): The maximum length of the generated summary.

    Returns:
        str: The generated summary.
    """
    # Load pre-trained T5 model and tokenizer
    model = T5ForConditionalGeneration.from_pretrained("t5-small")
    tokenizer = T5Tokenizer.from_pretrained("t5-small")

    # Prepare the input with the prompt
    input_text = prompt + ": " + text
    inputs = tokenizer.encode(input_text, return_tensors="pt", max_length=512, truncation=True)

    # Generate the summary
    outputs = model.generate(inputs, max_length=max_length, num_beams=5, early_stopping=True)

    # Decode the generated tokens to text
    summary = tokenizer.decode(outputs[0], skip_special_tokens=True)

    return summary

# Example usage with different prompts
text = """
Artificial intelligence (AI) is intelligence demonstrated by machines, in contrast to the natural intelligence displayed by humans and animals. Leading AI textbooks define the field as the study of intelligent agents, any system that perceives its environment and takes actions that maximize its chance of achieving its goals.
"""

# Experiment with different prompts
prompt_1 = "summarize"
prompt_2 = "briefly summarize in one sentence"
prompt_3 = "generate a concise summary"

# Evaluate each prompt
summary_1 = summarize_text(text, prompt_1)
summary_2 = summarize_text(text, prompt_2)
summary_3 = summarize_text(text, prompt_3)

print("Summary 1:", summary_1)
print("Summary 2:", summary_2)
print("Summary 3:", summary_3)


The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.


config.json:   0%|          | 0.00/1.21k [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/242M [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/147 [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/2.32k [00:00<?, ?B/s]

spiece.model:   0%|          | 0.00/792k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.39M [00:00<?, ?B/s]

You are using the default legacy behaviour of the <class 'transformers.models.t5.tokenization_t5.T5Tokenizer'>. This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thoroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565


Summary 1: leading AI textbooks define the field as the study of intelligent agents, any system that perceives its environment and takes actions that maximize its chance of achieving its goals.
Summary 2: (AI) is intelligence demonstrated by machines, in contrast to natural intelligence displayed by humans and animals. leading AI textbooks define the field as the study of intelligent agents, any system that perceives its environment.
Summary 3: : Artificial intelligence (AI) is intelligence demonstrated by machines, in contrast to the natural intelligence displayed by humans and animals. Leading AI textbooks define the field as the study of intelligent agents, any system that perceives its environment and takes actions that maximize its chance of achieving its goals.
