<a href="https://colab.research.google.com/github/run-llama/llama_index/blob/main/docs/examples/output_parsing/guidance_pydantic_program.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Guidance Pydantic Program

Generate structured data with [**guidance**](https://github.com/microsoft/guidance) via LlamaIndex.  


With guidance, you can guarantee the output structure is correct by *forcing* the LLM to output desired tokens.  
This is especialy helpful when you are using lower-capacity model (e.g. the current open source models), which otherwise would struggle to generate valid output that fits the desired output schema.

If you're opening this Notebook on colab, you will probably need to install LlamaIndex 🦙.

In [None]:
%pip install llama-index-program-guidance

In [None]:
!pip install llama-index

In [None]:
from pydantic import BaseModel
from typing import List
from guidance.llms import OpenAI

from llama_index.program.guidance import GuidancePydanticProgram

Define output schema

In [None]:
class Song(BaseModel):
    title: str
    length_seconds: int


class Album(BaseModel):
    name: str
    artist: str
    songs: List[Song]

Define guidance pydantic program

In [None]:
program = GuidancePydanticProgram(
    output_cls=Album,
    prompt_template_str=(
        "Generate an example album, with an artist and a list of songs. Using"
        " the movie {{movie_name}} as inspiration"
    ),
    guidance_llm=OpenAI("text-davinci-003"),
    verbose=True,
)

Run program to get structured output.  
Text highlighted in blue is variables specified by us, text highlighted in green is generated by the LLM.

In [None]:
output = program(movie_name="The Shining")

The output is a valid Pydantic object that we can then use to call functions/APIs. 

In [None]:
output

Album(name='The Shining', artist='Jack Torrance', songs=[Song(title='All Work and No Play', length_seconds=180), Song(title='The Overlook Hotel', length_seconds=240), Song(title='The Shining', length_seconds=210)])