<h1>Generate and Improve Image Descriptions</h1>

In this notebook we prompt a model to describe our images and then we ask the model to improve descriptions. (a-few-shots prompt)

This involves using an image as a scenario of survey questions, and piping the answer to a question into another question.

In [12]:
from edsl import QuestionFreeText

q1 = QuestionFreeText(
    question_name = "describe",
    question_text = """
    I have an image of an object that I would like you to describe in detail. 
    Based on the image provided, please write a description that includes:

    1. The object's appearance, including color, shape, and material.
    2. Any notable design features or patterns.
    3. The object's functionality or potential use.
    4. The overall impression or aesthetic of the object.
    
    Here is the image: {{ image }}. Please make your description as precise and vivid as possible.
    """
)

In [13]:
from edsl import QuestionFreeText

q2 = QuestionFreeText(
    question_name = "improvements",
    question_text = """
    Evaluate the description of this image on a scale from 1 to 10.
    Explain the reasoning behind your evaluation.
    Image: {{ image }}
    Description: {{ describe.answer }}
    """
)

In [14]:
# all images sotred/cached on expectedparrot service.

IMAGE_UUIDS = {
    'cabin_luggage': ['4f41ad44-6472-4403-8e59-0fb0e5eca575', 'e30bbefa-b135-4a46-bf32-ed9437be8733', 'a56079a4-1f50-4019-9344-947101bd3e4b'],
    'packing_cubes': ['7c046f64-1209-447d-a68a-b984f3508e0b', '82ffd7a6-87dc-4365-9c9f-2e76263908be', '851dedbe-3b32-45ca-8c32-4ddc510f7327'],
    'water_bottle': ['2a81aec9-c164-4d29-9281-360547f04ac4', 'd34d3ec1-073d-45b9-baca-a0d3411e9e0c', 'ff0ab897-50b8-458e-bfa5-7550b8a48648']
}

In [15]:
%%time

from edsl import Scenario, ScenarioList, FileStore

def prefetch_images(image_uuids):
    return {
        product: {
            index: FileStore.pull(uuid) for index, uuid in enumerate(uuids)
        }
        for product, uuids in image_uuids.items()
    }

pre_fetched_images = prefetch_images(IMAGE_UUIDS)

sl = ScenarioList([
    Scenario({
        "question_name": f"{product}_{i + 1}",
        "image": pre_fetched_images[product][i]
    })
    for product in IMAGE_UUIDS
    for i in range(3)
])

sl

CPU times: user 256 ms, sys: 228 ms, total: 484 ms
Wall time: 12.3 s


Unnamed: 0,question_name,image
0,cabin_luggage_1,FileStore: self.path
1,cabin_luggage_2,FileStore: self.path
2,cabin_luggage_3,FileStore: self.path
3,packing_cubes_1,FileStore: self.path
4,packing_cubes_2,FileStore: self.path
5,packing_cubes_3,FileStore: self.path
6,water_bottle_1,FileStore: self.path
7,water_bottle_2,FileStore: self.path
8,water_bottle_3,FileStore: self.path


In [16]:
%%time

from edsl import Survey

survey = Survey(questions = [q1, q2])

CPU times: user 110 μs, sys: 1e+03 ns, total: 111 μs
Wall time: 112 μs


In [17]:
from edsl import Model

Model.services()

Unnamed: 0,service
0,anthropic
1,azure
2,bedrock
3,deep_infra
4,deepseek
5,google
6,groq
7,mistral
8,ollama
9,open_router


In [18]:
from edsl import Model, ModelList

ml = ModelList([
#   Model("gemini-2.5-flash-light", service_name = "google", temperature = 1, maxOutputTokens = 5000),
#   Model("claude-sonnet-4-20250514", service_name="anthropic", temperature = 1, max_tokens = 5000),
#   Model("claude-opus-4-1-20250805", service_name="anthropic", temperature = 1, max_tokens = 5000),
  Model("gpt-5-chat-latest", service_name = "openai", temperature = 1, max_tokens = 5000),
  Model("gpt-5", service_name = "openai", temperature = 1, max_tokens = 5000),
#   Model("gemini-2.0-flash", service_name = "google", temperature = 1, maxOutputTokens = 5000),
#   Model("gemini-2.5-flash", service_name = "google", temperature = 1, maxOutputTokens = 5000),
#   Model("gemini-1.5-flash", service_name = "google", temperature = 1, maxOutputTokens = 5000),
#   Model("gemini-2.5-pro", service_name = "google", temperature = 1, maxOutputTokens = 5000),
#   Model("gpt-4o", service_name = "openai", temperature = 0, max_tokens = 5000),
#   Model("anthropic.claude-3-5-sonnet-20240620-v1:0", service_name = "bedrock", temperature = 1, maxOutputTokens = 5000),
#   Model("anthropic.claude-3-opus-20240229-v1:0", service_name = "bedrock", temperature = 1, maxOutputTokens = 5000),
])

ml

# aws_bedrock_models = Model.available(service="bedrock")
# aws_bedrock_models


Unnamed: 0,model_name,service_name,temperature,max_tokens,top_p,frequency_penalty,presence_penalty,logprobs,top_logprobs
0,gpt-5-chat-latest,openai,1,5000,1,0,0,False,3
1,gpt-5,openai,1,5000,1,0,0,False,3


In [19]:
# import os
from dotenv import load_dotenv

load_dotenv()  # By default, loads from .env in the current directory

# Test if the API keys are loaded
# print(os.getenv("OPENAI_API_KEY"))
# print(os.getenv("GOOGLE_API_KEY"))
# print(os.getenv("EXPECTED_PARROT_API_KEY"))


True

In [20]:
%%time

results = survey.by(sl).by(ml).run()

CPU times: user 4.97 s, sys: 582 ms, total: 5.55 s
Wall time: 5.82 s


In [21]:
results.select('model', 'scenario.question_name', 'answer.*')

Unnamed: 0,model.model,scenario.question_name,answer.describe,answer.improvements
0,gpt-5-chat-latest,cabin_luggage_1,"The object in the image is a sleek and modern suitcase, designed with a minimalist yet highly functional aesthetic. 1. **Appearance (color, shape, material):** The suitcase is rectangular with slightly rounded edges, giving it a clean and polished silhouette. Its color is a metallic silver, which enhances its modern and premium look. The body appears to be made of durable aluminum or polycarbonate, giving it both strength and a reflective sheen. The smooth surface is contrasted by evenly spaced vertical ridges running from top to bottom, adding dimension and structure to the design. 2. **Notable design features or patterns:** The vertical grooves are the most distinctive design feature, serving both as a stylistic element and potentially reinforcing the strength of the body. The corners are reinforced with silver-toned protectors, secured by visible rivets, suggesting durability and long-lasting use. At the top, a built-in handle is seamlessly integrated into the structure, while the bottom corners feature smooth-rolling wheels designed with a circular pattern, enhancing mobility. The top front also displays a small metallic plate with engraved branding. 3. **Functionality or potential use:** This is a high-end suitcase designed for travel, offering robust protection for belongings while maintaining a stylish exterior. The strong material ensures resilience against impacts, while the multidirectional wheels allow for easy maneuvering through airports or city streets. The reinforced corners and handles suggest it is built to withstand frequent and demanding use. The vertical ridges likely help reduce visible wear and scratches. 4. **Overall impression or aesthetic:** The suitcase conveys an image of sophistication, strength, and modern luxury. Its design merges practicality with elegance, giving it a professional and cosmopolitan feel. The metallic silver color paired with the streamlined ridged surface creates a timeless and iconic aesthetic, making it stand out as both a functional travel tool and a fashion statement. Overall, this object exudes **premium craftsmanship, durability, and minimalist elegance**, appealing to travelers who value both utility and style.","I would evaluate the provided description of the image as a **9/10**. ### Reasoning: 1. **Accuracy of Physical Description (Color, Shape, Material):** - The suitcase is indeed rectangular with rounded corners. - It has a metallic silver color with vertical ridges—a central design feature the description captures correctly. - The material mentioned (aluminum or polycarbonate) is accurate; this type of suitcase is typically built with aluminum. ✅ Very precise. 2. **Design Features Recognition:** - The vertical grooves, reinforced corners, rivets, integrated handle, and multi-directional wheels with circular accents are all correctly noted. - The metallic branding plate at the top is also visible in the image and well described. ✅ Excellent level of detail—shows good observational accuracy. 3. **Functionality and Use:** - The description highlights durability, mobility, and practical travel use, which aligns with the visible features (roll wheels, hard-shell body, reinforced design). - The mention of ridges reducing scratches is insightful, though more speculative than explicitly visible. ✅ Strong reasoning, minor inference but plausible. 4. **Overall Aesthetic Impression:** - Words like ""sophistication,"" ""minimalist elegance,"" and ""premium craftsmanship"" align with both the real-world branding and the visual appearance of the suitcase. ✅ Well written and consistent with visual evidence. ### Why not a perfect 10? - The description leans slightly into subjective interpretation (""timeless and iconic aesthetic,"" ""cosmopolitan feel"") which, while stylish and likely true for this product, adds a bit of marketing-style flourish rather than strict visual analysis. - A small improvement would be to stick more to direct verifiable observations or balance subjective adjectives with more objective details (e.g., specific wheel count, latch system detail). ### Final Score: **9/10** A very strong, accurate, and detailed description—professional and thorough with just a touch of subjective bias. Would you like me to **rewrite the description in a purely objective style** (stripping away the marketing-like phrasing) so you could compare a 10/10 ""objective version""?"
1,gpt-5,cabin_luggage_1,"[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]","[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]"
2,gpt-5-chat-latest,cabin_luggage_2,"The object in the image is a sleek and modern travel suitcase with a highly polished and professional appearance. **1. Appearance (Color, Shape, Material):** The suitcase has a hard-shell exterior made of durable metallic silver material, giving it a glossy, reflective finish that suggests it is constructed from lightweight yet sturdy aluminum or polycarbonate. It is rectangular with slightly rounded edges, contributing to a streamlined and robust look. **2. Notable Design Features or Patterns:** Vertical ridges run uniformly across the body of the suitcase, adding not only a distinctive visual texture but also extra reinforcement to the structure. The suitcase features a telescopic handle with an ergonomic grip on top, extendable to accommodate comfortable pulling. It also has an additional sturdy side handle for easy lifting when needed. Two combination locks are integrated into the side, providing security and a touch of sophistication. The case is mounted on four smooth-rolling wheels, each capable of rotating in multiple directions, suggesting a high level of maneuverability. Subtle corner reinforcements at the edges enhance durability. **3. Functionality or Potential Use:** This suitcase is clearly designed for modern travel, ideal for use as a carry-on due to its compact size and efficient shape. Its hard shell ensures protection for belongings, while the locks add a layer of security. The 360-degree rotating wheels allow for effortless gliding through airports, and its lightweight yet durable construction ensures it can withstand frequent travel. **4. Overall Impression or Aesthetic:** The overall aesthetic is one of elegance and functionality combined. Its metallic silver tone conveys a sense of minimalism, modernity, and sophistication. The ridged pattern makes it iconic and recognizable, while also giving it a futuristic, almost architectural appearance. It exudes luxury, durability, and efficiency, appealing to travelers who value both style and practicality. Would you like me to take this description further by writing it in the style of a **product listing** (professional marketing tone) or in a more **creative, narrative style** (like describing how it feels to travel with it)?","I’d rate this description a **9 out of 10**. ### Reasoning: 1. **Accuracy (Strong Point):** The description is highly accurate to the image. The color (metallic silver), material (hard-shell polycarbonate/aluminum look), ridged pattern, telescopic handle, side handle, TSA combination locks, and 360° spinner wheels are all clearly visible in the photo and correctly described. These details demonstrate careful observation and precision. 2. **Level of Detail (Strong Point):** The breakdown into categories (appearance, features, functionality, overall impression) is thorough and organized. It reflects not just the physical characteristics but also the usability and style. The inclusion of functionality (carry-on suitability, maneuverability, locks for security, durability for frequent travel) makes this very close to professional product copy. 3. **Tone (Strong Point):** The tone strikes a balance between technical accuracy and engaging description. Phrases like *“futuristic, almost architectural appearance”* and *“exudes luxury, durability, and efficiency”* add sophistication. 4. **Minor Areas for Improvement (Why not a 10/10):** - The description assumes the suitcase is made of aluminum or polycarbonate without confirmation—this could be misleading if the material is specifically one or the other. - The description suggests it’s “clearly designed for carry-on” based on compactness, but actual size isn’t evident from the single image without context (dimensions would be better for certainty). - Some of the wording (e.g., “iconic”) leans toward marketing language, which might not fit if the goal is a purely objective description. ### Final Evaluation: **9/10** It’s a well-written, detailed, and accurate description, almost product-listing ready, with only very minor assumptions that slightly reduce objectivity. --- Would you like me to refine this description to be **100% objective (factual and image-based only)**, or keep the **slightly marketing-oriented style** you already have?"
3,gpt-5,cabin_luggage_2,"[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]","[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]"
4,gpt-5-chat-latest,cabin_luggage_3,"The object in the image is a hard-shell suitcase, designed with a sleek, modern, and highly functional aesthetic. 1. **Appearance (Color, Shape, Material):** - The suitcase is metallic silver in color, giving it a polished, industrial yet elegant appearance. - Its body is rectangular with slightly rounded edges and reinforced corners for durability. - The material appears to be metal, most likely aluminum, which provides strength and a premium feel while protecting belongings inside. 2. **Notable Design Features or Patterns:** - Vertical ridges run along the suitcase’s surface, adding both structural reinforcement and a distinct design motif. - It has two integrated TSA-approved combination locks, embedded on either end of the central latch system for secure closure. - The handle is flush with the body, designed for ease of carrying with a sturdy, angular grip. - Four multidirectional wheels are positioned at the bottom, allowing for easy maneuverability. - The suitcase also has visible rivets and metal brackets, reinforcing its industrial, utilitarian style. 3. **Functionality or Potential Use:** - This is a high-quality travel suitcase intended for air, train, or road travel. - The hard-shell metal body provides excellent protection for fragile or valuable belongings. - The TSA locks allow travelers to secure their luggage while still complying with security regulations. - The wheels and side handle make it convenient to pull or carry in tight spaces. 4. **Overall Impression / Aesthetic:** - The overall aesthetic is minimalistic yet robust — a perfect blend of luxury and functionality. - It conveys a modern, professional, and premium feel, appealing to frequent travelers who value both durability and sophisticated design. - The metallic finish, coupled with mechanical details, gives it a futuristic, almost industrial-chic appearance. This suitcase stands out as both a stylish and practical travel companion, projecting reliability, security, and status. Would you like me to write this description in a **more lifestyle-oriented tone** (as though for an advertisement) or in a **technical/product catalog tone**?","I would rate this description a **9/10**. ### Reasoning: **Strengths:** 1. **Detail & Accuracy:** - The description does a great job of identifying key features seen in the image, such as the metallic silver color, vertical ridges, TSA locks, side handle, and multidirectional wheels. - Details such as rivets, brackets, and reinforced corners are also mentioned, which reflect what is visible in the image. 2. **Structured Organization:** - Breaking it into **Appearance, Design Features, Functionality, and Overall Impression** makes it easy to follow and comprehensive. 3. **Well-Balanced Tone:** - The description sits between technical and lifestyle, giving both functional details and an appealing aesthetic impression. - Words like *industrial-chic*, *premium feel*, and *sophisticated design* make it sound polished and engaging. **Minor Weaknesses:** 1. The description goes slightly beyond what can actually be confirmed in the photo. For example: - Claiming the material is **metal (aluminum)** — plausible but not directly verifiable from the image alone. - The claim that the locks are **TSA-approved** is very likely correct (based on design), but it's an assumption without explicit labeling visible in the photo. - Aesthetic terms like ""luxury,"" ""status,"" and ""professional"" are subjective and not directly derived from the visual evidence. 2. Since the provided image only shows a **side view**, the description assumes the overall **rectangular body** and **four wheels** — consistent with luggage design, yes, but not fully evident from this narrow angle. ### Final Verdict: - **Score: 9/10** — Very strong description, accurate and well-written, with just a bit of extrapolation beyond the visual evidence. - To push it to a **10/10**, I’d recommend tightening the description to only what’s strictly visible in the photo, and then clarifying which details are *assumed* based on common product knowledge. Would you like me to **revise this description into a 100% image-verified version only** (without assumptions), or keep the more interpretive/product-review style?"
5,gpt-5,cabin_luggage_3,"[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]","[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]"
6,gpt-5-chat-latest,packing_cubes_1,"The image shows a set of neatly designed packing cubes, which are likely intended for travel or organized storage within luggage. 1. **Appearance (color, shape, material):** The objects are rectangular, soft-sided fabric containers in a sleek, matte black color. The material looks like a durable synthetic fabric, possibly nylon or polyester, with a smooth, slightly textured surface that suggests it is lightweight and water-resistant. Each pouch features black zippers with dual zipper pulls for easy access. 2. **Design features or patterns:** The cubes have a minimalist design, free from excessive decoration or branding, giving them a clean and modern look. One packing cube is open, revealing a spacious interior that is lined with light gray fabric, contrasting nicely with the exterior. Inside, rolled clothing items (in white, light blue, and gray) are shown to illustrate their storage capacity. The open cube also appears to have a mesh or thinner lining on the inside of the lid, which could help with breathability and visibility of contents. 3. **Functionality or potential use:** These are **packing cubes**, an organizational tool used by travelers to keep clothing, accessories, or other belongings neatly separated within a suitcase or travel bag. Their rectangular, box-like structure makes them perfect for maximizing luggage space, preventing clothes from shifting, and making packing/unpacking more efficient. The zippers with dual pulls enhance convenience, making it possible to open them from either side. 4. **Overall impression or aesthetic:** The overall impression is one of **practical elegance and efficiency**. The black exterior gives the cubes a professional, sleek, and versatile appearance suitable for both leisure and business travelers. The clean lines, soft structure, and functional details suggest an object designed for people who value organization, tidiness, and simplicity while traveling. ✨ In short: these packing cubes are modern, durable, and understated, combining functionality with a streamlined aesthetic. Would you like me to also suggest possible **sizes these cubes might come in (small, medium, large)** and what types of items they’d be most useful for?","I would rate the description of this image a **9 out of 10**. ### Reasoning: 1. **Accuracy (High):** - The description correctly identifies the objects as **packing cubes** and accurately conveys their **shape (rectangular, box-like)**, **material (synthetic, durable fabric)**, **color (black with inner gray lining)**, and **features (zippers with dual pulls)**. - The observation of one cube being open with **rolled clothing in white, light blue, and gray** is precise. - It also notes the possible **mesh/thinner lining inside the lid**, which is indeed present. 2. **Detail Level (Very Strong):** - The breakdown into **appearance, design, functionality, and aesthetic** provides a comprehensive overview. - The description does more than just say what’s in the image—it interprets **intended use** (organization in luggage), **design philosophy** (minimal, modern), and **overall impression** (professional and tidy). 3. **Clarity & Flow (Excellent):** - Written in a structured, professional tone that’s easy to follow. - Each section logically supports the conclusion. 4. **Minor Opportunities for Improvement:** - While very complete, the description **could briefly mention the presence of multiple cube sizes in the set** (small, medium, large) since the scale difference between the three cubes is visible in the image. - The mention of the material being “possibly nylon or polyester” is a good educated guess but could be hedged slightly more neutrally (""appears to be a synthetic fabric"") since the exact fabric type isn’t fully verifiable from just the image. ✅ Overall: The description is **highly accurate, thoughtful, and well-written**, with just small refinements possible. Would you like me to **revise the description into an even shorter, polished version** (like a product listing-style blurb) that gets to the essentials more directly?"
7,gpt-5,packing_cubes_1,"[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]","[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]"
8,gpt-5-chat-latest,packing_cubes_2,"The object in the image is a **packing cube** designed for organizing clothing and travel essentials. Here’s a detailed description based on its appearance and attributes: 1. **Appearance (Color, Shape, and Material):** The packing cube is rectangular with rounded corners and features a sleek, minimalist design. The exterior and base appear to be made of durable black fabric, likely nylon or polyester, while the lid interior is a light gray mesh or mesh-like fabric that provides breathability and visibility of the contents. The zipper encasing the cube is black, matching the exterior for a clean, uniform look. 2. **Notable Design Features or Patterns:** The cube opens clamshell-style with a zipper running along three sides, allowing the lid to lay flat when unzipped. Inside, the clothing is neatly folded and held in place by the cube’s structured walls. The gray mesh interior adds a subtle, modern texture while also serving a practical function—allowing air circulation and keeping garments fresh. There are no bold patterns or excessive detailing, emphasizing simplicity and efficiency. 3. **Functionality or Potential Use:** The primary purpose of the packing cube is for **travel organization**. It is designed to neatly store and compress clothing items, making it easier to pack efficiently within a suitcase or travel bag. In this case, it is shown holding a folded white collared shirt and a pair of blue denim jeans, keeping them tidy and wrinkle-reduced while maximizing packing space. The cube helps travelers separate outfits, keep clothing accessible, and enhance space management. 4. **Overall Impression or Aesthetic:** The object conveys a **modern, practical, and orderly aesthetic**. Its streamlined design, neutral colors, and smooth fabric emphasize functionality with an understated sophistication. It gives the impression of a well-thought-out travel accessory for someone who values efficiency, neatness, and style while packing. Would you like me to write this description in the form of **a product listing (persuasive and appealing for shoppers)** or more like **an artistic/visual piece (emphasizing imagery and style)?**","I would rate this description a **9 out of 10**. ### Reasoning: ✅ **Accuracy (Strong):** - The description correctly identifies the object as a **packing cube** and explains its purpose—organizing clothing for travel. - It accurately notes the **rectangular shape with rounded corners**, the **black exterior material**, the **gray mesh lid**, and the **zipper along three sides**. - The contents inside (a folded white collared shirt and blue denim jeans) were also described correctly. ✅ **Attention to Design Details (Strong):** - The analysis captures both functional and aesthetic aspects, such as the minimalist design, practical mesh feature, and compression/organization benefits. - It explains how the cube helps with wrinkle reduction and tidy packing—true to the product’s function. ✅ **Descriptive Depth (Strong):** - The description goes beyond mere identification and highlights **practicality, visual impression, and potential customer appeal**. This gives a well-rounded picture of the object. ⚠️ **Minor Point of Improvement:** - The description could be slightly more precise about the mesh. From the image, the **entire inside of the lid is covered with a gray mesh panel**, but it might be easy for a reader to assume only a small section is mesh. - Although thorough, the writing leans a bit verbose—it could be made more concise while retaining clarity. ### Conclusion: The description is **highly accurate, detailed, and practical**, making it very effective. A few tweaks for conciseness and sharper emphasis on the **lid construction** would make it a perfect **10/10**. Would you like me to **refine this description into a concise product-listing style version** (for marketing) or keep it as a **technical description** (more functional)?"
9,gpt-5,packing_cubes_2,"[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]","[ERROR: Error code: 400 - {'error': {'message': ""Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead."", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}]"
