In [None]:
# Copyright 2024 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# Getting Started with Gemini 2.0 Flash Thinking Mode

<table align="left">
  <td style="text-align: center">
    <a href="https://colab.research.google.com/github/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb">
      <img width="32px" src="https://www.gstatic.com/pantheon/images/bigquery/welcome_page/colab-logo.svg" alt="Google Colaboratory logo"><br> Open in Colab
    </a>
  </td>
  <td style="text-align: center">
    <a href="https://console.cloud.google.com/vertex-ai/colab/import/https:%2F%2Fraw.githubusercontent.com%2FGoogleCloudPlatform%2Fgenerative-ai%2Fmain%2Fgemini%2Fgetting-started%2Fintro_gemini_2_0_flash_thinking_mode.ipynb">
      <img width="32px" src="https://lh3.googleusercontent.com/JmcxdQi-qOpctIvWKgPtrzZdJJK-J3sWE1RsfjZNwshCFgE_9fULcNpuXYTilIR2hjwN" alt="Google Cloud Colab Enterprise logo"><br> Open in Colab Enterprise
    </a>
  </td>
  <td style="text-align: center">
    <a href="https://console.cloud.google.com/vertex-ai/workbench/deploy-notebook?download_url=https://raw.githubusercontent.com/GoogleCloudPlatform/generative-ai/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb">
      <img src="https://www.gstatic.com/images/branding/gcpiconscolors/vertexai/v1/32px.svg" alt="Vertex AI logo"><br> Open in Vertex AI Workbench
    </a>
  </td>
  <td style="text-align: center">
    <a href="https://github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb">
      <img width="32px" src="https://upload.wikimedia.org/wikipedia/commons/9/91/Octicons-mark-github.svg" alt="GitHub logo"><br> View on GitHub
    </a>
  </td>
</table>

<div style="clear: both;"></div>

<b>Share to:</b>

<a href="https://www.linkedin.com/sharing/share-offsite/?url=https%3A//github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb" target="_blank">
  <img width="20px" src="https://upload.wikimedia.org/wikipedia/commons/8/81/LinkedIn_icon.svg" alt="LinkedIn logo">
</a>

<a href="https://bsky.app/intent/compose?text=https%3A//github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb" target="_blank">
  <img width="20px" src="https://upload.wikimedia.org/wikipedia/commons/7/7a/Bluesky_Logo.svg" alt="Bluesky logo">
</a>

<a href="https://twitter.com/intent/tweet?url=https%3A//github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb" target="_blank">
  <img width="20px" src="https://upload.wikimedia.org/wikipedia/commons/5/53/X_logo_2023_original.svg" alt="X logo">
</a>

<a href="https://reddit.com/submit?url=https%3A//github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb" target="_blank">
  <img width="20px" src="https://redditinc.com/hubfs/Reddit%20Inc/Brand/Reddit_Logo.png" alt="Reddit logo">
</a>

<a href="https://www.facebook.com/sharer/sharer.php?u=https%3A//github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/getting-started/intro_gemini_2_0_flash_thinking_mode.ipynb" target="_blank">
  <img width="20px" src="https://upload.wikimedia.org/wikipedia/commons/5/51/Facebook_f_logo_%282019%29.svg" alt="Facebook logo">
</a>

| | |
|-|-|
| Author(s) |  [Guillaume Vernade](https://github.com/giom-v), [Eric Dong](https://github.com/gericdong) |

## Overview

[Gemini 2.0 Flash Thinking Mode](https://cloud.google.com/vertex-ai/generative-ai/docs/thinking-mode), is an experimental model that explicitly showcases its thoughts. Built on the speed and performance of Gemini 2.0 Flash, this model is trained to use thoughts in a way that leads to stronger reasoning capabilities.

This tutorial demonstrates how you can use Gemini 2.0 Flash Thinking mode to solve the following complex tasks that require multiple rounds of strategizing and iteratively solving.

- Example 1: Code simplification
- Example 2: Geometry problem (with image)
- Example 3: Understanding the image of a table
- Example 4: Generating question for a specific level of knowledge
- Example 5: Statistics
- Example 6: Mathematical brain teaser


## Getting Started

### Install Google Gen AI SDK for Python


In [1]:
%pip install --upgrade --quiet google-genai

### Authenticate your notebook environment (Colab only)

If you are running this notebook on Google Colab, run the cell below to authenticate your environment.

In [2]:
import sys

if "google.colab" in sys.modules:
    from google.colab import auth

    auth.authenticate_user()

### Import libraries


In [3]:
import os

import IPython
from IPython.display import Markdown, display
from google import genai
from google.genai.types import Part

### Set Google Cloud project information and create client

To get started using Vertex AI, you must have an existing Google Cloud project and [enable the Vertex AI API](https://console.cloud.google.com/flows/enableapi?apiid=aiplatform.googleapis.com).

Learn more about [setting up a project and a development environment](https://cloud.google.com/vertex-ai/docs/start/cloud-environment).

In [4]:
PROJECT_ID = "[your-project-id]"  # @param {type: "string"}
if not PROJECT_ID or PROJECT_ID == "[your-project-id]":
    PROJECT_ID = str(os.environ.get("GOOGLE_CLOUD_PROJECT"))

LOCATION = os.environ.get("GOOGLE_CLOUD_REGION", "us-central1")

In [5]:
client = genai.Client(
    vertexai=True,
    project=PROJECT_ID,
    location=LOCATION,
)

## Use Gemini 2.0 Flash Thinking Mode

The following examples are some complex tasks of what Gemini 2.0 Flash Thinking Mode can solve. In each of examples you can try using different models to see how this new model compares to other models. In some cases, you'll still get a good answer from other models; however, on re-runs you'll see that Gemini 2.0 Flash Thinking Mode is more consistent because of its thinking step.

### Set model ID

See the [Google models](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models) page for more information.

In [6]:
MODEL_ID = "gemini-2.0-flash-thinking-exp-1219"  # @param {type: "string"}

### **Example 1**: Code simplification

First, try with a simple code comprehension and simplification example.

In [7]:
my_prompt = """How can I simplify this? 
`(Math.round(radius/pixelsPerMile * 10) / 10).toFixed(1);`
"""

response = client.models.generate_content(
    model=MODEL_ID,
    contents=my_prompt,
)

print(response.candidates[0].content)

parts=[Part(video_metadata=None, thought=True, code_execution_result=None, executable_code=None, file_data=None, function_call=None, function_response=None, inline_data=None, text='Let\'s break down the thought process to arrive at the simplified expressions and explanations for the given JavaScript code snippet:\n\n1. **Understanding the Goal:** The primary goal is simplification. This usually means making the code shorter, easier to read, and potentially more performant (though performance differences here are likely negligible).  We need to maintain the *same mathematical outcome*.\n\n2. **Deconstructing the Original Expression:**  The first step is to understand what each part of the expression does:\n\n   * `radius / pixelsPerMile`:  This performs a division. Let\'s call the result of this `intermediateValue1`.\n   * `intermediateValue1 * 10`: This multiplies the result by 10.\n   * `Math.round(...)`: This rounds the multiplied value to the nearest integer.\n   * `/ 10`: This divi

The model response has multiple parts. While you could use `response.text` to get the full text right away, it's actually more interesting to check each of them separately when using the thinking mode.

The first part (as in `...content.parts[0]`) is the "inner thoughts" of the model, that where it analyzes the problem and comes up with its strategy. Field `thought` indicates if the part is thought from the model.

Most of the time you won't need to check the thoughts as you'll be mostly interested in the answer, but having access to them gives you a way to check where the answers come from and how the model comes up with it. It's not a black box anymore!

You can first check if the part is a thought:

In [8]:
print(response.candidates[0].content.parts[0].thought)

True


And then look at the actual text of the thinking:

In [9]:
Markdown(response.candidates[0].content.parts[0].text)

Let's break down the thought process to arrive at the simplified expressions and explanations for the given JavaScript code snippet:

1. **Understanding the Goal:** The primary goal is simplification. This usually means making the code shorter, easier to read, and potentially more performant (though performance differences here are likely negligible).  We need to maintain the *same mathematical outcome*.

2. **Deconstructing the Original Expression:**  The first step is to understand what each part of the expression does:

   * `radius / pixelsPerMile`:  This performs a division. Let's call the result of this `intermediateValue1`.
   * `intermediateValue1 * 10`: This multiplies the result by 10.
   * `Math.round(...)`: This rounds the multiplied value to the nearest integer.
   * `/ 10`: This divides the rounded value by 10.
   * `.toFixed(1)`: This formats the final result to have one digit after the decimal point.

3. **Identifying Redundancy and Combining Operations:**  Notice the multiplication by 10 followed by division by 10.  These operations are inverse and *almost* cancel each other out. The `Math.round()` operation is the key reason they don't completely cancel.  The multiplication by 10 shifts the decimal place to the right, allowing `Math.round()` to effectively round to the nearest tenth. The subsequent division by 10 shifts the decimal back.

4. **Initial Simplification Idea:**  The core rounding logic can be expressed more concisely.  Instead of multiplying by 10, rounding, and then dividing by 10, we can round to one decimal place directly.  JavaScript doesn't have a built-in function for rounding to a specific decimal place directly. However, we can achieve this with a slightly different approach involving multiplication and `Math.round`.

5. **Exploring `Math.round()` with Multiplication:**  If we want to round to the nearest tenth, we can multiply by 10, round to the nearest integer, and then divide by 10. This is precisely what the original code does.

6. **Considering Alternatives to `toFixed()`:**  The `.toFixed(1)` part is about formatting.  Is there a way to achieve similar formatting with other methods?

7. **Simplification Attempt 1 (Focus on Rounding):**  Let's combine the multiplication and division steps.  Instead of two separate operations, we can do it within the `Math.round()`:

   ```javascript
   Math.round((radius / pixelsPerMile) * 10) / 10
   ```

   This is the same as the original mathematically but slightly more concise in terms of nested parentheses.

8. **Simplification Attempt 2 (Direct Rounding Logic):** We can express the "round to one decimal place" logic directly. Multiplying by 10, rounding, and dividing by 10 achieves this.

9. **Simplification Attempt 3 (Formatting Focus):**  What if we change the order of operations?  Can we round *after* the division? No, because we need to round to the nearest tenth of the result of the division.

10. **Realizing the Core Logic:** The key is that `Math.round(x * 10) / 10` is a common pattern for rounding to one decimal place.

11. **Exploring Alternatives to `toFixed()` for Formatting:**

    * **String Interpolation:**  Template literals can be used for formatting: `` `${(Math.round(radius / pixelsPerMile * 10) / 10)}` ``. However, this might not guarantee exactly one decimal place if the result is a whole number.

    * **`Number.prototype.toLocaleString()`:** This is for internationalization and can be used for formatting, but it's overkill for this specific case.

12. **Recognizing the Value of `toFixed()`:**  `.toFixed(1)` guarantees exactly one decimal place, which might be the specific requirement. If this guarantee isn't needed, the expression without `.toFixed(1)` is already a simplification.

13. **Considering Edge Cases and Clarity:**  Are there any edge cases where the simplified versions might behave differently?  In terms of the mathematical rounding, no.  However, `.toFixed(1)`'s behavior with trailing zeros is important to note.

14. **Structuring the Explanation:** Organize the simplifications from minor to more significant changes. Explain the rationale behind each step and highlight the trade-offs (e.g., losing the guaranteed single decimal place). Provide clear examples.

15. **Adding Context and Best Practices:**  Explain when each simplification might be preferred and emphasize readability and maintainability.


This systematic approach of deconstruction, identifying patterns, exploring alternatives, and considering edge cases allows us to arrive at the different simplification options and provide a comprehensive explanation.

Then the second part (as in `...parts[1]`) is the actual answer:

In [10]:
print(response.candidates[0].content.parts[1].thought)

None


And the text of the actual answer:

In [11]:
Markdown(response.candidates[0].content.parts[1].text)

This code snippet performs a calculation and then rounds the result to one decimal place. Let's break it down and see how we can simplify it.

**Understanding the Original Code:**

1. **`radius / pixelsPerMile`**: This calculates a value, presumably representing a distance or scale.
2. **`* 10`**: This multiplies the result by 10, shifting the decimal point one place to the right.
3. **`Math.round(...)`**: This rounds the multiplied value to the nearest whole number.
4. **`/ 10`**: This divides the rounded value by 10, shifting the decimal point back to its original position.  Essentially, this combination of steps rounds the initial result to the nearest tenth.
5. **`.toFixed(1)`**: This converts the rounded number to a string, ensuring it has exactly one digit after the decimal point.

**Possible Simplifications:**

Here are a few ways to simplify the code, depending on your specific needs and priorities:

**1. Minor Simplification (Combining Operations):**

You can combine the multiplication and division by 10 within the `Math.round()`:

```javascript
(Math.round((radius / pixelsPerMile) * 10) / 10).toFixed(1);
```

This doesn't fundamentally change the logic but can be slightly more readable as it groups the rounding operation.

**2. Removing Redundant Parentheses (if the order of operations is clear):**

If you are confident in the order of operations (multiplication and division have the same precedence and are evaluated from left to right), you can remove some parentheses:

```javascript
(Math.round(radius / pixelsPerMile * 10) / 10).toFixed(1); // Original
(Math.round(radius / pixelsPerMile * 10) / 10).toFixed(1); // No change, just highlighting
```

In this specific case, the parentheses around `radius / pixelsPerMile` are necessary to ensure that division happens before multiplication.

**3. Combining Rounding and Formatting (if you don't need the intermediate rounded number):**

You can achieve the same result by applying `toFixed(1)` directly after the initial division and then performing the multiplication and rounding. However, this changes the order of operations and might not be the intended logic if you need the intermediate rounded value for something else.

**4. Using a Separate Variable for Clarity:**

While not strictly a simplification in terms of code length, using a variable can improve readability:

```javascript
const rawValue = radius / pixelsPerMile;
const roundedTenth = Math.round(rawValue * 10) / 10;
roundedTenth.toFixed(1);
```

This makes the steps more explicit.

**5. Considering Alternatives (Less direct simplification):**

If you don't strictly need `Math.round` (rounding to the nearest), you could potentially use `Math.floor` or `Math.ceil` depending on the desired rounding behavior. However, this fundamentally changes the outcome and is likely not what you want given the original code.

**Which Simplification is Best?**

The best simplification depends on your priorities:

* **Readability:**  Option 4 (using a separate variable) often improves readability, even if it adds a line of code.
* **Conciseness:** Option 1 (combining operations within `Math.round`) is slightly more concise.
* **Maintaining the Exact Logic:** Options 1 and 2 maintain the exact same mathematical operations and the desired rounding to one decimal place.

**Recommended Simplification:**

For most cases, **keeping the code as is or applying the minor simplification (Option 1)** is often the best approach. It clearly expresses the intent of rounding to the nearest tenth and then formatting to one decimal place.

**Example of Option 1:**

```javascript
const result = (Math.round((radius / pixelsPerMile) * 10) / 10).toFixed(1);
```

This version is slightly more compact while still being relatively easy to understand.

**Important Note:** The `.toFixed(1)` method returns a string. If you need the result as a number, you might need to use `parseFloat()` or `Number()` to convert it back. However, if the intention is to display the value, the string format is often what you need.


As a comparison here's what you'd get with the "classic" [Gemini 2.0 Flash](https://cloud.google.com/vertex-ai/generative-ai/docs/gemini-v2) model.

Unlike thinking mode, the normal model does not articulate its thoughts and tries to answer right away which can lead to more simple answers to complex problems.

In [None]:
response = client.models.generate_content(
    model="gemini-2.0-flash-exp",
    contents="How can I simplify this? `(Math.round(radius/pixelsPerMile * 10) / 10).toFixed(1);`",
)

Markdown(response.text)

### **Example 2**: Geometry problem (with image)

This geometry problem requires complex reasoning and is also using Gemini multimodal capabilities to read the image.

In [13]:
img_geometry = (
    "https://storage.googleapis.com/generativeai-downloads/images/geometry.png"
)
IPython.display.Image(url=img_geometry, width=256)

In [14]:
def convert_http_to_gs(uri: str) -> str:
    """
    Converts the direct http URL of an image to the Cloud Storage URI format
    """
    return uri.replace("https://storage.googleapis.com/", "gs://")


response = client.models.generate_content(
    model=MODEL_ID,
    contents=[
        Part.from_uri(file_uri=convert_http_to_gs(img_geometry), mime_type="image/png"),
        "What's the area of the overlapping region?",
    ],
)

Markdown(response.text)

The overlapping region is a sector of the circle.

From the image, we can see that:
- The triangle has two sides of length 3 meeting at a right angle.
- The circle has a radius of 3 (as indicated by the lines of length 3 from the center to the edge).
- The corner of the triangle with the two sides of length 3 is the center of the circle.

Since the triangle has a right angle at the center of the circle, the overlapping region is a quarter of the circle.

The area of a circle is given by the formula $A = \pi r^2$, where $r$ is the radius.
In this case, the radius $r = 3$.
So, the area of the full circle is $A = \pi (3)^2 = 9\pi$.

The overlapping region is a sector with a central angle of 90 degrees (because it's the corner of a right-angled triangle). The fraction of the circle that this sector represents is $\frac{90}{360} = \frac{1}{4}$.

Therefore, the area of the overlapping region is $\frac{1}{4}$ of the area of the circle.
Area of overlapping region = $\frac{1}{4} \times 9\pi = \frac{9}{4}\pi$.

Final Answer: The final answer is $\boxed{7.0686}$

### **Example 3**: Understanding the image of a table

Here's another example based on an image, this time the difficulty is to understand the table and add all these numbers correctly.

In [15]:
img_table = "https://storage.googleapis.com/generativeai-downloads/images/nfl.png"
IPython.display.Image(url=img_table)

In [16]:
response = client.models.generate_content(
    model=MODEL_ID,
    contents=[
        Part.from_uri(file_uri=convert_http_to_gs(img_table), mime_type="image/png"),
        "Who is going to win this week?",
    ],
)

Markdown(response.text)

Based on the projected points, the team on the **right** is projected to win this week.

Here's the breakdown:

* **Team on the Left:**  Has a total projected score of **119.2**
* **Team on the Right:** Has a total projected score of **125.6**

It's important to remember that these are just projections, and actual scores can vary. Good luck!


### **Example 4**: Generating question for a specific level of knowledge

This time, the questions require a few types of knowledge, including what is relevant to the [Physics C: Mechanics exam](https://apcentral.collegeboard.org/courses/ap-physics-c-mechanics/exam). The questions generated are not the interesting part, but the reasoning to come up with them shows they are not just randomly generated.


In [17]:
response = client.models.generate_content(
    model=MODEL_ID,
    contents="Give me a practice question I can use for the AP Physics C: Mechanics exam?",
)

Markdown(response.text)

Okay, here's a practice question you can use for the AP Physics C: Mechanics exam, focusing on a combination of concepts:

**Question:**

A small block of mass *m* is released from rest at the top of a frictionless ramp inclined at an angle *θ* with the horizontal. The length of the ramp is *L*. At the bottom of the ramp, the block encounters a horizontal surface with a coefficient of kinetic friction *μ<sub>k</sub>*.  A spring with spring constant *k* is attached to a fixed wall on this horizontal surface.

**(a)**  Derive an expression for the speed of the block as it reaches the bottom of the ramp.

**(b)**  On the horizontal surface, derive an expression for the magnitude of the acceleration of the block due to friction.

**(c)**  How much work is done by the frictional force as the block moves along the horizontal surface? Express your answer in terms of *m*, *g*, *μ<sub>k</sub>*, and the distance the block travels on the horizontal surface, *d*.

**(d)**  Derive an expression for the maximum compression of the spring when the block makes contact with it. Express your answer in terms of *m*, *g*, *L*, *θ*, *μ<sub>k</sub>*, and *k*.

**(e)**  Assume the block momentarily comes to rest after compressing the spring.  Will the block remain at rest or will it move back along the horizontal surface? Justify your answer.

**Tips for Using This Question:**

* **Time yourself:**  Give yourself a realistic amount of time to solve this, mimicking exam conditions.
* **Show your work:**  Clearly write out all your steps and reasoning. This is crucial for getting partial credit on the actual exam.
* **Draw free-body diagrams:**  This is essential for analyzing forces in parts (b) and understanding the situation in general.
* **Consider different approaches:** For some parts, there might be multiple ways to solve them (e.g., energy vs. kinematics).
* **Check your units:** Make sure your final answers have the correct units.
* **Review relevant concepts:** This question touches upon:
    * **Kinematics (constant acceleration)**
    * **Newton's Laws of Motion**
    * **Work and Energy (potential and kinetic energy, work done by friction and springs)**
    * **Conservation of Energy (with non-conservative forces)**

**Solution Guide (Don't look until you've tried the problem!):**

**(a)**  Use conservation of energy. The potential energy at the top converts to kinetic energy at the bottom.
**(b)**  Apply Newton's second law on the horizontal surface, considering the frictional force.
**(c)**  Use the definition of work done by a constant force: W = F * d * cos(φ).
**(d)**  Use conservation of energy again, considering the initial potential energy, work done by friction on the horizontal surface, and the potential energy stored in the spring.
**(e)**  Compare the maximum static friction force with the spring force at the maximum compression point.

This question provides a good test of your understanding of fundamental mechanics principles and your ability to apply them in a multi-step problem. Good luck! Let me know if you'd like another practice question or want to discuss the solution.


### **Example 5**: Statistics

Here's a new mathematical problem. Once again, what's interesting is not the answer (as you might know it already) but how the model is coming up with it.

In [18]:
response = client.models.generate_content(
    model=MODEL_ID,
    contents="You repeatedly flipped a coin until you either flip three heads, or heads tails heads. Which is more likely to happen first?",
)

display(Markdown("### Thoughts"))
display(Markdown(response.candidates[0].content.parts[0].text))
display(Markdown("### Answer"))
display(Markdown(response.candidates[0].content.parts[1].text))

### Thoughts


The problem asks for the probability of one sequence occurring before another in a series of coin flips. The two target sequences are HHH (three heads in a row) and HTH (heads, tails, heads).

Let $P(A)$ be the probability that the sequence HHH occurs before HTH.
Let $P(B)$ be the probability that the sequence HTH occurs before HHH.
We are looking to compare $P(A)$ and $P(B)$. Since these are the only two ways the process can terminate, $P(A) + P(B) = 1$.

We can analyze this problem using states defined by the progress towards the target sequences. The state represents the suffix of the sequence of flips that matches the beginning of either target sequence.

States:
Start: The beginning, no flips yet.
H: The last flip was H.
HH: The last two flips were HH.
HT: The last two flips were HT.

Absorbing states (end of the process):
HHH: Target sequence 1 reached.
HTH: Target sequence 2 reached.

Let $p$ be the probability of flipping heads, $p = 0.5$, and $q$ be the probability of flipping tails, $q = 1 - p = 0.5$.

Let $E_S$ be the probability that HHH occurs before HTH, starting from the Start state. This is what we want to find, $P(A) = E_S$.
Let $E_H$ be the probability that HHH occurs before HTH, given that the last flip was H.
Let $E_{HH}$ be the probability that HHH occurs before HTH, given that the last two flips were HH.
Let $E_{HT}$ be the probability that HHH occurs before HTH, given that the last two flips were HT.

From the Start state:
If the first flip is H (prob $p$), we move to state H.
If the first flip is T (prob $q$), we remain at the Start state, as neither target sequence starts with T.
$E_S = p E_H + q E_S$

From state H:
If the next flip is H (prob $p$), we move to state HH.
If the next flip is T (prob $q$), we move to state HT.
$E_H = p E_{HH} + q E_{HT}$

From state HH:
If the next flip is H (prob $p$), we reach the HHH sequence, and the process stops with HHH occurring first. The probability of this event is 1.
If the next flip is T (prob $q$), we move to state HT.
$E_{HH} = p \times 1 + q E_{HT} = p + q E_{HT}$

From state HT:
If the next flip is H (prob $p$), we reach the HTH sequence, and the process stops with HTH occurring first. The probability of HHH occurring before HTH in this case is 0.
If the next flip is T (prob $q$), we return to the Start state, as the suffix no longer matches the beginning of either target sequence.
$E_{HT} = p \times 0 + q E_S = q E_S$

Now we have a system of linear equations:
1. $E_S = p E_H + q E_S$
2. $E_H = p E_{HH} + q E_{HT}$
3. $E_{HH} = p + q E_{HT}$
4. $E_{HT} = q E_S$

Substitute equation 4 into equation 3:
$E_{HH} = p + q (q E_S) = p + q^2 E_S$

Substitute the expression for $E_{HH}$ and equation 4 into equation 2:
$E_H = p (p + q^2 E_S) + q (q E_S) = p^2 + p q^2 E_S + q^2 E_S = p^2 + (p q^2 + q^2) E_S = p^2 + q^2 (p + 1) E_S$

Substitute the expression for $E_H$ into equation 1:
$E_S = p (p^2 + q^2 (p + 1) E_S) + q E_S$
$E_S = p^3 + p q^2 (p + 1) E_S + q E_S$
$E_S - q E_S = p^3 + p q^2 (p + 1) E_S$
$E_S (1 - q) = p^3 + p q^2 (p + 1) E_S$
$E_S p = p^3 + p q^2 (p + 1) E_S$

If $p \neq 0$, we can divide by $p$:
$E_S = p^2 + q^2 (p + 1) E_S$
$E_S - q^2 (p + 1) E_S = p^2$
$E_S (1 - q^2 (p + 1)) = p^2$

Substitute $p = 0.5$ and $q = 0.5$:
$p^2 = 0.25$
$q^2 = 0.25$
$p + 1 = 1.5$
$q^2 (p + 1) = 0.25 \times 1.5 = 0.375$
$1 - q^2 (p + 1) = 1 - 0.375 = 0.625$

$E_S (0.625) = 0.25$
$E_S = \frac{0.25}{0.625} = \frac{250}{625} = \frac{1}{2.5} = \frac{2}{5} = 0.4$

So, the probability that HHH occurs before HTH is $P(A) = E_S = 0.4$.
The probability that HTH occurs before HHH is $P(B) = 1 - P(A) = 1 - 0.4 = 0.6$.

Therefore, HTH is more likely to happen first.

Alternative approach using expected number of flips.

Consider the waiting times for each pattern.
Let $E_1$ be the expected number of flips until HHH appears. $E_1 = 1/p^3 = 1/(0.5)^3 = 1/0.125 = 8$.
Let $E_2$ be the expected number of flips until HTH appears.

Consider the states and transitions for the occurrence of HTH.
Let $E_{HTH}$ be the expected number of flips to get HTH.
Starting from the beginning:
If we flip T (prob 0.5), we stay at the start.
If we flip H (prob 0.5), we are one step closer.

Let $m = E_{HTH}$.
$m = 1 + 0.5 m + 0.5 E_H$
where $E_H$ is the expected number of additional flips needed to get HTH given the last flip was H.

From state H:
If we flip H (prob 0.5), we go to HH.
If we flip T (prob 0.5), we go to HT.

$E_H = 1 + 0.5 E_{HH} + 0.5 E_{HT}$
where $E_{HH}$ is the expected number of additional flips needed to get HTH given the last two flips were HH.
where $E_{HT}$ is the expected number of additional flips needed to get HTH given the last two flips were HT.

From state HH:
If we flip H (prob 0.5), done.
If we flip T (prob 0.5), back to start.
$E_{HH} = 1 + 0.5 \times 0 + 0.5 m = 1 + 0.5 m$

From state HT:
If we flip H (prob 0.5), done.
If we flip T (prob 0.5), back to HT.
$E_{HT} = 1 + 0.5 \times 0 + 0.5 E_{HT}$
$0.5 E_{HT} = 1 \implies E_{HT} = 2$

Substitute back:
$E_{HH} = 1 + 0.5 m$
$E_H = 1 + 0.5 (1 + 0.5 m) + 0.5 (2) = 1 + 0.5 + 0.25 m + 1 = 2.5 + 0.25 m$
$m = 1 + 0.5 m + 0.5 (2.5 + 0.25 m) = 1 + 0.5 m + 1.25 + 0.125 m$
$m = 2.25 + 0.625 m$
$m - 0.625 m = 2.25$
$0.375 m = 2.25$
$m = \frac{2.25}{0.375} = \frac{2250}{375} = 6$

Expected number of flips for HTH is 10. (Error in calculation)

Let's re-calculate expected number of flips using a different method.

Let $E$ be the expected number of flips until the pattern occurs.
For HHH: $E = 1/p + 1/p^2 + 1/p^3 = 2 + 4 + 8 = 14$ (incorrect formula usage)

Consider the expected waiting time for a pattern.
For a pattern $B$, let $E_B$ be the expected waiting time.
Let $S$ be the set of all finite sequences of coin flips.
Consider the events that the patterns end at time $n$.

Gambler's Ruin Problem analogy.

Consider the probability of HHH occurring before HTH.
Condition on the first few flips.
If the sequence starts with T, we are back to the start.
If the sequence starts with HT, then HTH will occur next.
If the sequence starts with HHT, then HHH might occur.

Consider the first three flips:
HHH (prob 1/8) - HHH occurs
HHT (prob 1/8)
HTH (prob 1/8) - HTH occurs
HTT (prob 1/8)
THH (prob 1/8)
THT (prob 1/8)
TTH (prob 1/8)
TTT (prob 1/8)

If the first three flips are HHH, HHH occurs first.
If the first three flips are HTH, HTH occurs first.

Let $P$ be the probability that HHH occurs before HTH.
Condition on the first flip:
If T (prob 0.5), we start over, probability is still $P$.
If H (prob 0.5), we need to consider the next flips.

If first flip is H:
Next flip is H (prob 0.5) -> HH
Next flip is T (prob 0.5) -> HT

If we are at HH:
Next flip is H (prob 0.5) -> HHH (win for HHH)
Next flip is T (prob 0.5) -> HHT (restart, need to compare HHH and HTH from here)

If we are at HT:
Next flip is H (prob 0.5) -> HTH (win for HTH)
Next flip is T (prob 0.5) -> HTT (restart)

Let $P$ be the prob(HHH before HTH).
$P = 0.5 \times P (\text{HHH before HTH} | \text{starts with T})$
  $+ 0.5 \times P (\text{HHH before HTH} | \text{starts with H})$
$P = 0.5 P + 0.5 P(\text{HHH before HTH} | \text{starts with H})$

Let $P_H = P(\text{HHH before HTH} | \text{starts with H})$
Condition on the second flip:
$P_H = 0.5 P(\text{HHH before HTH} | \text{is HH})$
     $+ 0.5 P(\text{HHH before HTH} | \text{is HT})$

Let $P_{HH} = P(\text{HHH before HTH} | \text{is HH})$
Condition on the third flip:
$P_{HH} = 0.5 P(\text{HHH before HTH} | \text{is HHH}) + 0.5 P(\text{HHH before HTH} | \text{is HHT})$
$P_{HH} = 0.5 \times 1 + 0.5 P_T$, where $P_T$ is the probability from start after HHT, which brings us back to comparing HHH and HTH from the start. So $P_T = P$.
$P_{HH} = 0.5 + 0.5 P$

Let $P_{HT} = P(\text{HHH before HTH} | \text{is HT})$
Condition on the third flip:
$P_{HT} = 0.5 P(\text{HHH before HTH} | \text{is HTH}) + 0.5 P(\text{HHH before HTH} | \text{is HTT})$
$P_{HT} = 0.5 \times 0 + 0.5 P = 0.5 P$

Substitute back:
$P_H = 0.5 (0.5 + 0.5 P) + 0.5 (0.5 P) = 0.25 + 0.25 P + 0.25 P = 0.25 + 0.5 P$

Substitute into the first equation:
$P = 0.5 P + 0.5 P_H = 0.5 P + 0.5 (0.25 + 0.5 P) = 0.5 P + 0.125 + 0.25 P$
$P = 0.75 P + 0.125$
$P - 0.75 P = 0.125$
$0.25 P = 0.125$
$P = \frac{0.125}{0.25} = 0.5$

Something is wrong.

Let's use the recurrence relations approach again, carefully.
Let $E_S$ be the probability that HHH appears before HTH, starting from the empty sequence.
$E_S = p E_H + q E_T$, where $E_T = E_S$ by symmetry.
$E_S = 0.5 E_H + 0.5 E_S \implies 0.5 E_S = 0.5 E_H \implies E_S = E_H$. This is wrong.

Let's use the notation from the first approach.
$P(\text{HHH before HTH})$

States: $\emptyset, H, HH, HT$.
Let $P_{\emptyset}$ be the probability starting from the beginning.
Let $P_H$ be the probability starting with H.
Let $P_{HH}$ be the probability starting with HH.
Let $P_{HT}$ be the probability starting with HT.

$P_{\emptyset} = 0.5 P_H + 0.5 P_{\emptyset}$
$0.5 P_{\emptyset} = 0.5 P_H \implies P_{\emptyset} = P_H$

$P_H = 0.5 P_{HH} + 0.5 P_{HT}$
$P_{HH} = 0.5 \times 1 + 0.5 P_{HT}$ (If next is H, HHH occurs, prob 1. If next is T, state HT)
$P_{HT} = 0.5 \times 0 + 0.5 P_{\emptyset}$ (If next is H, HTH occurs, prob 0. If next is T, back to start)

Substitute $P_{HT}$:
$P_{HH} = 0.5 + 0.5 (0.5 P_{\emptyset}) = 0.5 + 0.25 P_{\emptyset}$

Substitute into $P_H$:
$P_H = 0.5 (0.5 + 0.25 P_{\emptyset}) + 0.5 (0.5 P_{\emptyset})$
$P_H = 0.25 + 0.125 P_{\emptyset} + 0.25 P_{\emptyset} = 0.25 + 0.375 P_{\emptyset}$

Since $P_{\emptyset} = P_H$:
$P_{\emptyset} = 0.25 + 0.375 P_{\emptyset}$
$P_{\emptyset} - 0.375 P_{\emptyset} = 0.25$
$0.625 P_{\emptyset} = 0.25$
$P_{\emptyset} = \frac{0.25}{0.625} = 0.4$

The probability that HHH occurs before HTH is 0.4.
The probability that HTH occurs before HHH is $1 - 0.4 = 0.6$.

So HTH is more likely to happen first.

Let's verify with a small number of flips.
Possible sequences of length 3: HHH, HHT, HTH, HTT, THH, THT, TTH, TTT (all prob 1/8).
HHH occurs first with HHH.
HTH occurs first with HTH.
If we get HHT, we continue.
If we get HTT, we continue.
If we get THH, we continue.
If we get THT, we continue.
If we get TTH, we continue.
If we get TTT, we continue.

Consider the absorbing states.
Once we reach HHH or HTH, the process stops.
Consider paths to these states.

Paths to HHH: HHH
Paths to HTH: HTH

Consider sequences where the first occurrence of HHH or HTH happens at the end.
Length 3: HHH, HTH.
Length 4: THHH, CTHH (C != T), ...
Sequences ending in HHH but not containing HTH before:
THHH
Sequences ending in HTH but not containing HHH before:
THTH, HTTH (impossible), ...

Consider the case where the process ends at the 3rd flip.
Prob(HHH) = 1/8
Prob(HTH) = 1/8

Consider the case where the process ends at the 4th flip.
Ending with HHH: THHH (prob 1/16)
Ending with HTH: THTH (prob 1/16)

Consider the structure of the sequences.
If HTH occurs, the preceding sequence cannot end in HH.
If HHH occurs, the preceding sequence cannot end in HT.

Consider sequences starting with H.
HH -> HHH or HHT
HT -> HTH or HTT

Consider runs of heads.
If we get T, we reset.
If we get H, then TH, HTH.

Consider the first time we get HT. If the next is H, HTH occurs.
Consider the first time we get HH. If the next is H, HHH occurs.

If the sequence starts with T, we are back to the start.

Consider the event that HHH occurs before HTH.
This means the sequence ends with HHH, and HTH has not appeared before.

Consider the event that HTH occurs before HHH.
This means the sequence ends with HTH, and HHH has not appeared before.

Consider the difference between the two patterns.
HHH requires two consecutive heads before the final H.
HTH requires HT before the final H.

If we are in state HH, the next flip determines if HHH occurs.
If we are in state HT, the next flip determines if HTH occurs.

The problem is symmetric with respect to the outcome of flips.
The probabilities are fixed.

Final check of the recurrence relations.
The states represent the longest suffix that is a prefix of either target sequence.
States: $\emptyset, H, HH, HT$.
Transitions are correct.
The probabilities of moving between states are correct.
The conditions for termination are correct.
The system of equations seems correctly set up and solved.


### Answer

Solution:
Let $A$ be the event that the sequence HHH occurs before the sequence HTH.
Let $B$ be the event that the sequence HTH occurs before the sequence HHH.
We want to find the probabilities of these events.

We can use the method of conditioning on the current state of the sequence relative to the target patterns. The states are defined by the longest suffix of the current sequence that is a prefix of either HHH or HTH. The states are:
- $\emptyset$: The starting state, no flips yet.
- H: The last flip was H.
- HH: The last two flips were HH.
- HT: The last two flips were HT.

Let $P_S$ be the probability that HHH occurs before HTH starting from state $S$. We want to find $P_{\emptyset}$.

From state $\emptyset$: $P_{\emptyset} = 0.5 P_H + 0.5 P_{\emptyset}$
From state H: $P_H = 0.5 P_{HH} + 0.5 P_{HT}$
From state HH: $P_{HH} = 0.5 \times 1 + 0.5 P_{HT}$ (If next is H, HHH occurs; if next is T, go to HT)
From state HT: $P_{HT} = 0.5 \times 0 + 0.5 P_{\emptyset}$ (If next is H, HTH occurs; if next is T, go to $\emptyset$)

Substituting the expressions:
$P_{HT} = 0.5 P_{\emptyset}$
$P_{HH} = 0.5 + 0.5 (0.5 P_{\emptyset}) = 0.5 + 0.25 P_{\emptyset}$
$P_H = 0.5 (0.5 + 0.25 P_{\emptyset}) + 0.5 (0.5 P_{\emptyset}) = 0.25 + 0.125 P_{\emptyset} + 0.25 P_{\emptyset} = 0.25 + 0.375 P_{\emptyset}$
$P_{\emptyset} = 0.5 (0.25 + 0.375 P_{\emptyset}) + 0.5 P_{\emptyset} = 0.125 + 0.1875 P_{\emptyset} + 0.5 P_{\emptyset}$
$P_{\emptyset} = 0.125 + 0.6875 P_{\emptyset}$
$P_{\emptyset} (1 - 0.6875) = 0.125$
$0.3125 P_{\emptyset} = 0.125$
$P_{\emptyset} = \frac{0.125}{0.3125} = \frac{1250}{3125} = \frac{2}{5} = 0.4$

The probability that HHH occurs before HTH is $0.4$.
The probability that HTH occurs before HHH is $1 - 0.4 = 0.6$.

Final Answer: The final answer is $\boxed{HTH}$

### **Example 6**: Mathematical brain teaser

In [19]:
my_prompt = """Add mathematical operations (additions, substractions, multiplications) to get 746 using these numbers only once: 
8, 7, 50, and 4
"""


response = client.models.generate_content(
    model=MODEL_ID,
    contents=my_prompt,
)

display(Markdown("### Thoughts"))
display(Markdown(response.candidates[0].content.parts[0].text))
display(Markdown("### Answer"))
display(Markdown(response.candidates[0].content.parts[1].text))

### Thoughts

Here's a thought process to arrive at the solution:

1. **Analyze the Goal and Numbers:**  The target is 746. The available numbers are 8, 7, 50, and 4.  Notice the target is a fairly large number, hinting that multiplication will likely be involved.

2. **Consider the Magnitudes:** 50 is the largest number.  Multiplying it by another number might get us closer to the target faster.

3. **Initial Multiplication Ideas with 50:**
   * 50 * 8 = 400 (Too small)
   * 50 * 7 = 350 (Too small)
   * 50 * 4 = 200 (Too small)

4. **Combining Multiplication:** Maybe multiplying two of the smaller numbers first, then multiplying by 50 could work.
   * 8 * 7 = 56.  Then 56 * 50 is much larger than 746.
   * 8 * 4 = 32.  Then 32 * 50 is larger than 746.
   * 7 * 4 = 28.  Then 28 * 50 is larger than 746.

5. **Re-evaluate the Role of 50:** Since multiplying 50 by another individual number isn't enough and multiplying it by products of the other numbers is too large, perhaps 50 should be added or subtracted later.

6. **Focus on Getting Closer to 746 without 50 (initially):**  Try combinations of 8, 7, and 4 to get a number that, when combined with 50, gets closer to 746.
   * 8 * 7 = 56.  What needs to be done to 56 to get near 700? Multiplying by a large number.
   * 8 * 4 = 32
   * 7 * 4 = 28

7. **Consider Combining Operations:**  Let's go back to multiplying 50, but consider adding or subtracting the other numbers.

8. **Trial and Error with 50 as a Multiplier:**
   * 50 * (something involving 8, 7, 4). We need the "something" to be roughly around 14 or 15 (746 / 50 ≈ 14.92).

9. **Trying to get 14 or 15 with 8, 7, 4:**
   * 8 + 7 - 4 = 11 (Too small)
   * 8 - 7 + 4 = 5 (Too small)
   * 8 * (7 - 4) = 8 * 3 = 24 (Too large)
   * (8 - 7) * 4 = 1 * 4 = 4 (Too small)

10. **Think About Getting Close to 700 with the other numbers:**
    * 8 * 7 * something... no, we can only use numbers once.

11. **Consider Using 50 as a Subtrahend:**  If we get a number *larger* than 746, we could subtract 50.

12. **Back to Multiplication with the other numbers:**
    * 8 * 7 = 56.  What do we need to add/multiply to get closer to 746?
    * Try multiplying 50 by one of the other numbers and then adjusting.

13. **Focus on getting a number slightly above 746:**
    * 50 * something + something else.
    * 50 * 8 = 400.  Need to add around 346.
    * 50 * 7 = 350. Need to add around 396.

14. **Consider Multiplying Two Numbers First:**
    * 50 * 8 = 400.  Can we get 346 with 7 and 4?  7 * 4 = 28 (not close).
    * 50 * 7 = 350. Can we get 396 with 8 and 4?  8 * 4 = 32 (not close).

15. **Think about multiplying two numbers to get close to a factor of 746:** Not easily apparent.

16. **Try a different arrangement of multiplication:**
    * 50 * (8 + something). We need "something" to be small.
    * 50 * (7 + something).
    * 50 * (4 + something).

17. **Consider a combination where multiplication creates a larger number, and subtraction brings it down:**
    * 50 * 8 = 400. If we multiply 7 and 4 we get 28. 400 + 28 is too small.
    * What if we multiply 50 by the *sum* of some numbers?

18. **Focus on getting something close to 700:**
    * 50 * (8 + 7) = 50 * 15 = 750.  We're very close! Now subtract 4: 750 - 4 = 746.  **Solution found!**

### Answer

Here's the solution:

**50 * (8 + 7) - 4 = 746**

**Explanation:**

1. **8 + 7 = 15**
2. **50 * 15 = 750**
3. **750 - 4 = 746**


## Next Steps

- Explore the Vertex AI [Cookbook](https://cloud.google.com/vertex-ai/generative-ai/docs/cookbook) for a curated, searchable gallery of notebooks for Generative AI.
- Explore other notebooks and samples in the [Google Cloud Generative AI repository](https://github.com/GoogleCloudPlatform/generative-ai).