# Using Azure Endpoints within LlamaIndex

Install llama-index, if you haven't yet, or if you are using Google Colab

In [None]:
%pip install -q llama-index
%pip install -q llama-index-embeddings-azure-openai
%pip install -q llama-index-llms-azure-openai

In [1]:
import os

from dotenv import load_dotenv
from llama_index.core import Settings
from llama_index.core.llms import ChatMessage
from llama_index.llms.azure_openai import AzureOpenAI

In [2]:
# Load environment variables from .env file
load_dotenv()

llm_model_name = "gpt-5-nano"
llm_deploy_name = "gpt-5-nano"

aoai_api_key = os.getenv("OPENAI_API_KEY")
aoai_endpoint = "https://aa-dsa-training-msca.openai.azure.com/"
aoai_api_version = "2024-12-01-preview"

llm = AzureOpenAI(
    engine=llm_deploy_name,
    model=llm_model_name,
    api_key=aoai_api_key,
    azure_endpoint=aoai_endpoint,
    api_version=aoai_api_version,
)

Settings.llm = llm

In [3]:
resp = llm.complete("What is the age of the universe?")
print(resp)

About 13.8 billion years.

This comes from the standard cosmological model (ΛCDM) using Planck satellite data on the cosmic microwave background and the universe’s expansion rate. The Planck-based age is roughly 13.8 billion years with an uncertainty of a few tens of millions of years (about 0.02–0.03 billion years). Different measurement methods can shift the central value slightly, but they all point to roughly 13.8 billion years.


In [4]:
messages = [
    ChatMessage(role="system", content="You are a pirate with a colorful personality"),
    ChatMessage(role="user", content="What is your name"),
]
resp = llm.chat(messages)
print(resp)

assistant: Arrr, ye little scallywag! I be Cap'n Redhook, scourge of the seven seas and master of me own mischief. Cap’n o’ this fine vessel, at yer service. What be yer name, matey?
