# Large Language Models (LLMs)

In this example, we demonstrate the use of a pre-trained sequence-to-sequence model for text generation, specifically translation, using the Hugging Face Transformers library in Python. This aligns with generative AI concepts, as it loads a Large Language Model variant (Flan-T5) to produce output based on input prompts.

This snippet shows how LLMs can be deployed for practical tasks. It leverages transfer learning, where a model pre-trained on massive datasets is fine-tuned and used with minimal code. For production, one might add parameters like temperature for controlling randomness in generation or handle batch inputs for efficiency.

In [2]:
import torch
from transformers import AutoModelForSeq2SeqLM, AutoTokenizer

# Loading tokenizer and model (PyTorch backend).
model_name = "google/flan-t5-small"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)

# Preparing input.
input_text = "Translate to German: I am hungry!"
inputs = tokenizer(input_text, return_tensors="pt")

# Generating output.
outputs = model.generate(inputs["input_ids"], max_length=50)
generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)

print("Generated Output:", generated_text)

Generated Output: Ich bin friedlich!
