## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [1]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [None]:
# Always remember to do this!
load_dotenv(override=True)

In [None]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

In [4]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [None]:
messages

In [None]:
openai = OpenAI()
response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
)
question = response.choices[0].message.content
print(question)


In [7]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [None]:
# The API we know well

model_name = "gpt-4o-mini"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-3-7-sonnet-latest"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.0-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "llama-3.3-70b-versatile"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [2]:
from openai import OpenAI
from IPython.display import Markdown, display
import json

In [3]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [4]:
# Generate a question
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2:1b"

response = ollama.chat.completions.create(model=model_name, messages=messages)
question = response.choices[0].message.content
display(Markdown(question))

Can you explain why the sentence "The quick brown fox jumps over the lazy dog" is an ungrammatical instance of modalized modality while still conveying a valid propositional meaning?

In [5]:
# Prompt for Answers
competitors= []
answers = []
messages = [{"role": "user", "content": question}]

In [6]:
# Ask Llama 3.2
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2:1b"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

A classic example from logic and linguistics.

The phrase "the quick brown fox jumps over the lazy dog" is indeed an ungrammatical sentence that conveys a valid propositional meaning. However, it can be considered an instance of modalized modality because it expresses a hypothetical situation or possibility.

Here's a breakdown:

* The word "quick," which denotes the color, has a proposition associated with it: "brown." This part of speech is often referred to as a "predicate."
* Similarly, the words "lazy" and "jumps over" are predicates, which express verbs or actions.
* Now, consider the sentence structure:
"The quick brown fox jumps over the lazy dog"

This sentence can be rephrased in two ways:

1. It doesn't assert an existing state of affairs; instead, it expresses a hypothetical possibility: "May a quick brown fox jump over a lazy dog."
2. Or, it could be argued that it's not actually possible for a quick brown fox to jump over a lazy dog.

In these rephrasings, we are not using modal verbs (e.g., "it is likely" or "it must") like in other sentences involving possibility or obligation. Instead, the focus is on representing the proposition associated with the predicates ("quick," "brown", "lazy"). This representation of a hypothetical situation or possibility allows us to express that a particular action may take place in certain circumstances.

The sentence as a whole can be seen as an expression of the modalized modality:

**Proposition: May (or Must) X Z, where X is a predicate and Z is a concept.**

In this case, **X = "jumps"**, **Z = "over the lazy dog,"** and is associated with predicates in the proposition ("quick" for brown).

So, while the sentence itself is ungrammatical due to its specific syntax, it effectively conveys a valid propositional meaning by representing a hypothetical situation or possibility.

In [7]:
# Ask Qwen 2.5
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "qwen2.5:0.5b"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Yes, I can answer your question.
In this case, "The quick brown fox jumps over the lazy dog" isn't a grammatically perfect sentence. However, as a linguistic phenomenon, it's often seen as ungrammatical in certain contexts. This is because while it expresses an overall sense of motion or movement ("The quick brown fox"), there are several factors that make it non-grammatical:

1. **Syntax**: Modifiers and other elements within the sentence should not be used to modify parts outside of them, which is known as "syntax error." In this case, "lazy dog" can't function as part of the main clause or modify any other words in a way that would make it legitimate.

2. **Pedants' point**: Some people argue for keeping word order consistent and applying a grammar rule, but there's a debate between these viewpoints since they don't address syntax errors directly; they might suggest "lazy fox jumps over."

3. **Semantic issues**: The sentence could be ungrammatical because the structure is inconsistent with standard English rules for verbs in second-person singular, as indicated by "The." However, it follows English orthographic conventions (usually).

4. **Word usage**: It's possible to interpret this more as a poetic or colloquial usage rather than a proper sentence.

**Grammatically incorrect but semantically valid**

If one considers the linguistic structure and potential misuse within certain contexts but maintains that it is ungrammatical per strict rules, we could consider it grammatically imperfect in a sense:

"It seems to suggest multiple parts being part of an entity without any logical connection outside these parts. It's like trying to say 'The cat chased the dog' when this specific statement doesn't adhere to common rules."

This form can be seen as a semantic alternative that maintains structural integrity while still conveying meaningful meaning about the entities involved, particularly "cat" and "dog."

**Conclusion**: This sentence isn't ungrammatical per standard English rules but rather in a non-standard context where its parts are connected without logical adherence to conventional language structures.

In [8]:
# Ask Phi 4
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "phi4-mini:3.8b"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Yes, I can help with that. In this case, it's important to remember what we mean by 'ungrammatical.' Grammatically speaking (syntax), sentences must follow rules and structures laid out in linguistic conventions for them be deemed grammatically viable within their language context.

The sentence "The quick brown fox jumps over the lazy dog" follows those conventional structural requirements. It's syntactically correct: noun phrases ('the quick brown fox', 'the lazy dog') are properly constructed, as is it conjunction with a verb phrase ('jumps over'). No parts of speech or rules related to subject-verb agreement seem violated.

However, in terms of modalized modality (which can involve expressions like *can*, must), this sentence appears ungrammatical because there's no explicit expression indicating necessity/reliability/fact/potentiality. These words ("can't", "mustn't") are typically used when expressing certain shades or flavors to a proposition.


Nonetheless - it conveys propositional meaning, as expected from the syntax: it's about an action ('jumps over') being performed by 'the quick brown fox' onto/in regards/to/from/within/later/the time-space of 'the lazy dog'. In this sense, even though modality isn't expressed in explicit terms here (no adverbs), we're clearly aware that a jumping act is taking place between two distinct agents or actors - the subjects.


In sum: there's no violation against syntactic rules. It's just an instance where propositional statements convey inherent modal flavors without requiring overt 'must', 'can' etc., expressions -- thus, in this context deemed ungrammatical for its lack of visible modality indicators despite accurate representation as per usual sentence building criteria.

In [9]:
# So where are we?
print(competitors)
print(answers)

['llama3.2:1b', 'qwen2.5:0.5b', 'phi4-mini:3.8b']
['A classic example from logic and linguistics.\n\nThe phrase "the quick brown fox jumps over the lazy dog" is indeed an ungrammatical sentence that conveys a valid propositional meaning. However, it can be considered an instance of modalized modality because it expresses a hypothetical situation or possibility.\n\nHere\'s a breakdown:\n\n* The word "quick," which denotes the color, has a proposition associated with it: "brown." This part of speech is often referred to as a "predicate."\n* Similarly, the words "lazy" and "jumps over" are predicates, which express verbs or actions.\n* Now, consider the sentence structure:\n"The quick brown fox jumps over the lazy dog"\n\nThis sentence can be rephrased in two ways:\n\n1. It doesn\'t assert an existing state of affairs; instead, it expresses a hypothetical possibility: "May a quick brown fox jump over a lazy dog."\n2. Or, it could be argued that it\'s not actually possible for a quick brow

In [10]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: llama3.2:1b

A classic example from logic and linguistics.

The phrase "the quick brown fox jumps over the lazy dog" is indeed an ungrammatical sentence that conveys a valid propositional meaning. However, it can be considered an instance of modalized modality because it expresses a hypothetical situation or possibility.

Here's a breakdown:

* The word "quick," which denotes the color, has a proposition associated with it: "brown." This part of speech is often referred to as a "predicate."
* Similarly, the words "lazy" and "jumps over" are predicates, which express verbs or actions.
* Now, consider the sentence structure:
"The quick brown fox jumps over the lazy dog"

This sentence can be rephrased in two ways:

1. It doesn't assert an existing state of affairs; instead, it expresses a hypothetical possibility: "May a quick brown fox jump over a lazy dog."
2. Or, it could be argued that it's not actually possible for a quick brown fox to jump over a lazy dog.

In these rep

In [11]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [12]:
print(together)

# Response from competitor 1

A classic example from logic and linguistics.

The phrase "the quick brown fox jumps over the lazy dog" is indeed an ungrammatical sentence that conveys a valid propositional meaning. However, it can be considered an instance of modalized modality because it expresses a hypothetical situation or possibility.

Here's a breakdown:

* The word "quick," which denotes the color, has a proposition associated with it: "brown." This part of speech is often referred to as a "predicate."
* Similarly, the words "lazy" and "jumps over" are predicates, which express verbs or actions.
* Now, consider the sentence structure:
"The quick brown fox jumps over the lazy dog"

This sentence can be rephrased in two ways:

1. It doesn't assert an existing state of affairs; instead, it expresses a hypothetical possibility: "May a quick brown fox jump over a lazy dog."
2. Or, it could be argued that it's not actually possible for a quick brown fox to jump over a lazy dog.

In thes

In [13]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{request}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [14]:
print(judge)

You are judging a competition between 3 competitors.
Each model has been given this question:

Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

A classic example from logic and linguistics.

The phrase "the quick brown fox jumps over the lazy dog" is indeed an ungrammatical sentence that conveys a valid propositional meaning. However, it can be considered an instance of modalized modality because it expresses a hypothetical situation or possibility.

Here's a breakdown:

* The word "quick," which denotes the color,

In [15]:
judge_messages = [{"role": "user", "content": judge}]

In [20]:
# Judgement time!

openai = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
response = openai.chat.completions.create(
    model="llama3.2:1b",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["3", "1", "2"]}


In [21]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: phi4-mini:3.8b
Rank 2: llama3.2:1b
Rank 3: qwen2.5:0.5b


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>