<a href="https://www.kaggle.com/code/oswind/stockchat-towards-a-stock-market-assistant?scriptVersionId=237328387" target="_blank"><img align="left" alt="Kaggle" title="Open in Kaggle" src="https://kaggle.com/static/images/open-in-kaggle.svg"></a>

In [1]:
# Prepare the notebook environment for use.
!pip uninstall -qqy kfp jupyterlab libpysal thinc spacy fastai ydata-profiling google-cloud-bigquery google-generativeai
!pip install -qU google-genai==1.7.0 chromadb==0.6.3 langchain-community langchain-text-splitters wikipedia

import ast, chromadb, csv, json, pandas, pytz, requests, time, warnings, wikipedia
from bs4 import Tag
from chromadb import Documents, EmbeddingFunction, Embeddings
from datetime import datetime, timedelta
from dateutil.parser import parse
from dateutil.tz import gettz
from google import genai
from google.api_core import retry
from google.genai import types
from IPython.display import HTML, Markdown, display
from kaggle_secrets import UserSecretsClient
from langchain.document_loaders.csv_loader import CSVLoader
from langchain_text_splitters.character import RecursiveCharacterTextSplitter
from langchain_text_splitters.html import HTMLSemanticPreservingSplitter
from langchain_text_splitters.json import RecursiveJsonSplitter
from tqdm import tqdm
from typing import Optional
from wikipedia.exceptions import DisambiguationError, PageError

In [30]:
# Prepare the gemini client for use.
# Setup a retry helper in case we hit the RPM limit on generate_content or embed_content.
is_retriable = lambda e: (isinstance(e, genai.errors.APIError) and e.code in {429, 503, 500})
genai.models.Models.generate_content = retry.Retry(
    predicate=is_retriable)(genai.models.Models.generate_content)
genai.models.Models.embed_content = retry.Retry(
    predicate=is_retriable)(genai.models.Models.embed_content)

# Import the secret api keys.
GOOGLE_API_KEY = UserSecretsClient().get_secret("GOOGLE_API_KEY")

# Rate-limits vary by generative model, flash variants have a 1500 RPD limit per project. 
project_model_1 = "models/gemini-2.0-flash"
project_model_2 = "models/gemini-2.0-flash-exp"
project_model = project_model_2 # Update this if you hit api usage limits.

# Create the genai client.
client = genai.Client(api_key=GOOGLE_API_KEY)

# Laying the foundation with Gemini 2.0

<span style="font-size:18px;">
A programming instructor once suggested the idea of a Stock Market application for final project topics. They did this knowing good investing app UX is challenging. The idea has stuck with me since because it's true. In the past I've worked with some REST api's building toys. None of them could ever reach my expectations because of API limits. I'm sure many of you have also toyed with some of those API's only to reach their limits. I always knew the secret to great finance UX is a great AI to help out. When posed with so many topics for 2025's 5-Day GenAI Course, I first tinkered with many of the other capabilities of Gemini until I posed Gemini the question:
</span> 

In [3]:
# This is an accurate retelling of events. 
config_with_search = types.GenerateContentConfig(
    tools=[types.Tool(google_search=types.GoogleSearch())],
    temperature=0.0
)
chat = client.chats.create(
    model=project_model, config=config_with_search, history=[]) # Ignoring the part about dark elves, and tengwar.

response = chat.send_message('Do you know anything about the stock market?')
Markdown(response.text)

Yes, I do. Here's some information about the stock market:

**What it is:**

*   The stock market is a place where stocks or shares of publicly traded companies are bought and sold. This transfer of stocks happens between a seller and a buyer, who must agree on a price.
*   It provides a platform for companies to raise capital by issuing stocks and for investors to own a piece of those companies.

**Key Components:**

*   **Stock Exchanges:** These are exchanges where stockbrokers and traders can buy and sell shares, bonds, and other securities. Examples include the New York Stock Exchange (NYSE) and the London Stock Exchange (LSE). Many companies have their stocks listed on these exchanges, which increases the stock's liquidity and attractiveness to investors.
*   **Market Participants:** These range from small, individual investors to large institutions like banks, insurance companies, pension funds, and hedge funds.
*   **Market Capitalization:** This refers to the total value of all outstanding shares of a publicly traded company. The total market capitalization of all stocks worldwide was $111 trillion by the end of 2023.

**How it Works:**

1.  **Primary Market:** This is where new financial assets are created when companies issue stocks for the first time.
2.  **Secondary Market:** This is where existing financial assets (stocks that have already been issued) are traded between investors.

**Important Considerations:**

*   Stock prices are influenced by various factors, including company performance, economic conditions, and investor sentiment.
*   Investing in the stock market involves risk, and it's possible to lose money.
*   Factors like trading prices, market ratings, information about stock exchange dynamics, and financial institutions can influence participation in stock markets.

**Stock Market Indexes:**

*   These are used to measure the performance of a group of stocks. Examples include the Dow Jones Industrial Average, S&P 500, and Nasdaq Composite.

**Trends:**

*   The US500 stock market index has decreased by 4.10% since the beginning of 2025.

**Where to Find More Information:**

*   **Google Finance:** Provides stock market prices, real-time quotes, and business news.
*   **Investor.gov:** Offers educational resources about the stock market.


# How much Gemini 2.0 knows

<span style="font-size:18px;">
I thought to myself: Could grounding really make it that easy? Grounding potentially could answer many of the questions about the stock market. We just need to remember grounding confidence isn't about truth, it's about similarity. I decided to limit myself to free tier in finding out.
</span>

In [4]:
# And so I asked a more challenging questions.
response = chat.send_message('I have an interest in AMZN stock')
Markdown(response.text)

Okay, let's look into AMZN (Amazon) stock. Here's some information that might be helpful:

Here's what I found about AMZN (Amazon) stock as of May 1, 2025:

**Key Data:**

*   **Current Price:** Around \$184.42 (as of May 1, 2025).
*   **52-Week High:** \$242.52
*   **52-Week Low:** \$151.61
*   **1 Year Target:** Analysts estimate a one-year target price of $240.00.
*   **Market Cap:** \$1,954,433,247,096 (approximately \$1.95 Trillion)
*   **P/E Ratio:** 95.55
*   **Forward P/E (1 Yr):** 30.57
*   **Earnings Per Share (EPS):** \$1.93

**Recent Performance & News:**

*   **Q1 2025 Earnings:** Amazon reported Q1 2025 EPS of \$1.59 (vs \$1.37 estimated) and revenue exceeding estimates at \$155.7B (vs \$155.15B estimated).
*   **AWS Growth:** Strong growth in Amazon Web Services (AWS), with an annualized revenue run rate exceeding \$117B.
*   **AI Initiatives:** Introduced a new AI-powered personal assistant, Alexa Plus.
*   **Tariff Concerns:** Uncertainty about tariffs on goods from China remains a concern.
*   **Analyst Ratings:** Amazon is considered an appealing choice for investors seeking exposure to e-commerce, cloud computing, and advertising.

**Company Overview**

*   Amazon is a leading online retailer and marketplace.
*   Key revenue sources include retail, Amazon Web Services (AWS), and advertising services.
*   Amazon's entertainment business, including Prime Video, has a significant presence in India.

**Important Note:** Stock prices can fluctuate, so it's essential to consult with a financial advisor and do your own research before making any investment decisions.


<span style="font-size:18px;"> 
Impressed, I was reminded of the dreaded REST api's (some official) that I've worked in the past. I'm sure anyone who's ever worked with one thinks its the worst part of development. So I next asked Gemini to distill it's vast news knowledge.
</span>

In [5]:
response = chat.send_message(
    '''Tell me about AMZN current share price, short-term trends, and bullish versus bearish predictions''')
Markdown(response.text)

Here's an overview of AMZN (Amazon) stock, including its current share price, short-term trends, and bullish/bearish predictions:

**Current Share Price:**

*   As of May 1, 2025, the price of AMZN is around \$190.20.
*   The stock has increased by 3.13% in the last 24 hours.

**Short-Term Trends:**

*   **Mixed Signals:** Technical analysis indicates mixed signals. While there are buy signals from short and long-term moving averages, the stock also has a sell signal based on the relationship between these averages.
*   **Potential for Volatility:** Expect increased volatility in the short term, especially around earnings announcements.
*   **Support and Resistance:** Immediate support is near \$178.41, with resistance at \$191.10.
*   **Possible Consolidation or Decline:** Given recent downward price action and bearish technical indicators, a cautious consolidation or slight decline is possible in the short term.
*   **Analysts' short-term forecasts**: Some analysts predict a decrease to around \$178.82 by May 30, 2025.
*   **Neutral Sentiment:** Technical indicators show a neutral sentiment, while the Fear & Greed Index indicates "Fear".

**Bullish Predictions:**

*   **Analyst Ratings:** The average analyst rating for Amazon stock is "Strong Buy," indicating expectations of strong performance in the near future.
*   **Upside Potential:** Analysts predict a one-year target price averaging \$249.82, with some estimates as high as \$290.
*   **Growth Drivers:** Bullish scenarios assume Amazon will continue to exceed Wall Street projections due to AWS expansion, e-commerce dominance, and growth in advertising and AI.
*   **Long-Term Growth:** In a bullish scenario, Amazon's stock price could reach \$430.50 by 2030.

**Bearish Predictions:**

*   **Slowing AWS Growth:** Concerns exist about Amazon losing market share to competitors like Microsoft's Azure.
*   **Economic Factors:** A worsening economic climate, rising inflation, or increased competition could negatively impact the stock.
*   **Bearish Scenarios:** In a bearish scenario, the stock price could struggle to maintain its current value and might decline, potentially trading at just \$77 per share in 2030.
*   **Tariff Concerns:** Uncertainty about tariffs, especially on goods from China, could negatively affect Amazon's growth outlook.
*   **Short-Term Dip:** Some forecasts suggest a potential dip to the \$157 or even \$141 range in the near term.
*   **Neutral Sentiment:** Technical indicators show a neutral sentiment, while the Fear & Greed Index indicates "Fear".

**Factors to Consider:**

*   **Earnings Reports:** Monitor Amazon's earnings reports closely, paying attention to the performance of AWS and the e-commerce business.
*   **Market Conditions:** Keep an eye on economic conditions, consumer behavior, and overall market sentiment.
*   **Analyst Projections:** Stay informed about analyst ratings and price targets, but remember that these are just estimates.
*   **Company Initiatives:** Consider Amazon's strategic moves, such as investments in rural delivery networks and AI advancements.

**Disclaimer:** This information is for general knowledge only and does not constitute financial advice. Consider consulting with a financial advisor before making any investment decisions.


# The (current) limits reached

<span style="font-size:18px;">
With two prompts Gemini 2.0 made all the effort I've spent on finance api's obsolete. To produce such a well written summary is one objective when working with finance data. This is great! Now all we need is a generative AI capable in our own language. There's a limit of course. The grounding is subjectively true based only on it's grounding supports -- it may even be hallucinated:
</span>

In [6]:
response = chat.send_message('''What is mgm studio's stock ticker symbol?''')
Markdown(response.text)

It appears you might be looking for the stock ticker for either Metro-Goldwyn-Mayer (MGM) Studios or MGM Resorts International. Here's the breakdown:

**1. MGM Resorts International:**

*   **Ticker Symbol:** MGM (NYSE)
*   **Business:** Resorts and Casinos

**2. Metro-Goldwyn-Mayer (MGM) Studios:**

*   MGM Studios was acquired by Amazon in March 2022. In October 2023, Amazon Studios absorbed MGM Holdings and rebranded itself as Amazon MGM Studios.
*   Prior to the acquisition, MGM Holdings Inc. was a privately held company. So, it did not have a publicly traded stock ticker at the time of the acquisition.
*   Before being a private company, it had a few times in the past been publicly traded under the ticker symbol **MGM**.
*   As of March 2025, Jen Salke, who heads Amazon MGM Studios, is stepping down.



<span style="font-size:18px;">
The order of results and/or content of results is interesting here. The AI is confused about which MGM Studios I'm referring to. On non-thinking variants Gemini may not even mention Amazon. Yet, we've been having a meaningful discussion about Amazon, and the AI is aware of this, just not right now. Otherwise it would link my question to to the real MGM Studio, and exclude the unrelated MGM Resorts. The confusion is linked to the use of the MGM word token. The unrelated MGM stock ticker has now entered the discussion. Depending on how you prompt Gemini 2.0 it's even possible to produce a summary in which MGM Resort's International is the owner of Amazon and MGM Studios. There's two more caveat. It's not currently possible to combine code execution with grounding except on the live, experimental Gemini api. Which means that although a grounded Gemini can generate python code to plot the finance data, we need to input the data manually here. That includes matching a schema or prompting it's output.
</span>

In [7]:
response = chat.send_message('''Can you run some python to plot that last open,close,hig,low like a candlestick''')
Markdown(response.text)

I can't directly create a visual plot like a candlestick chart within this text-based interface. However, I can provide you with the Python code using libraries like `yfinance` and `matplotlib` that you can run in a Python environment (like Jupyter Notebook, Google Colab, or a local Python installation) to generate the candlestick plot.

Here's the code:

```python
import yfinance as yf
import matplotlib.pyplot as plt
import matplotlib.dates as mdates
from mplfinance.original_flavor import candlestick_ohlc
import pandas as pd

def plot_candlestick(ticker, start_date, end_date):
    """
    Fetches stock data and plots a candlestick chart.

    Args:
        ticker (str): The stock ticker symbol (e.g., "AMZN").
        start_date (str): The start date for the data (YYYY-MM-DD).
        end_date (str): The end date for the data (YYYY-MM-DD).
    """
    try:
        # Download stock data from Yahoo Finance
        data = yf.download(ticker, start=start_date, end=end_date)

        if data.empty:
            print(f"No data found for ticker {ticker} between {start_date} and {end_date}.")
            return

        # Prepare data for candlestick plot
        data['Date'] = mdates.date2num(data.index)
        ohlc = data[['Date', 'Open', 'High', 'Low', 'Close']].copy()

        # Create the candlestick plot
        fig, ax = plt.subplots()
        candlestick_ohlc(ax, ohlc.values, width=0.6, colorup='green', colordown='red')

        # Format the x-axis to show dates
        date_format = mdates.DateFormatter('%Y-%m-%d')
        ax.xaxis.set_major_formatter(date_format)
        plt.xticks(rotation=45)

        # Set labels and title
        plt.xlabel('Date')
        plt.ylabel('Price')
        plt.title(f'{ticker} Candlestick Chart ({start_date} to {end_date})')

        # Display the plot
        plt.grid(True)
        plt.tight_layout()
        plt.show()

    except Exception as e:
        print(f"An error occurred: {e}")

# Example usage:
ticker_symbol = "AMZN"  # Amazon stock
start_date = "2025-04-25"  # Start date (YYYY-MM-DD)
end_date = "2025-05-02"  # End date (YYYY-MM-DD)
plot_candlestick(ticker_symbol, start_date, end_date)
```

**Explanation:**

1.  **Import Libraries:** Imports `yfinance` to fetch stock data, `matplotlib.pyplot` for plotting, `matplotlib.dates` for handling dates, `mplfinance` for candlestick plotting, and `pandas` for data manipulation.
2.  **`plot_candlestick` Function:**
    *   Takes the stock ticker, start date, and end date as input.
    *   Downloads the stock data using `yf.download()`.
    *   Checks if the data is empty and prints an error message if so.
    *   Converts the date index to numerical dates using `mdates.date2num()` for `matplotlib`.
    *   Prepares the data in the format required by `candlestick_ohlc`.
    *   Creates a figure and axes using `plt.subplots()`.
    *   Generates the candlestick plot using `candlestick_ohlc()`.
    *   Formats the x-axis to display dates correctly.
    *   Sets the labels and title for the plot.
    *   Displays the plot using `plt.show()`.
    *   Includes error handling to catch potential exceptions.
3.  **Example Usage:**
    *   Sets the `ticker_symbol`, `start_date`, and `end_date`.
    *   Calls the `plot_candlestick` function to generate the plot.

**To use this code:**

1.  **Install Libraries:**  If you don't have them already, install the necessary libraries:

```bash
pip install yfinance matplotlib mplfinance pandas
```

2.  **Run the Code:** Copy and paste the code into a Python environment and run it.

This will generate a candlestick chart showing the open, high, low, and close prices for AMZN stock over the specified date range.  The green candles indicate that the closing price was higher than the opening price, and red candles indicate the opposite.


In [8]:
response = chat.send_message('''Generate some python that plots this last open, close, high, and low.''')
Markdown(response.text)

Okay, I will generate Python code to plot the last available Open, Close, High, and Low values for a given stock ticker as a simple line plot. This will give you a basic visualization of those four data points.

```python
import yfinance as yf
import matplotlib.pyplot as plt

def plot_last_ohlc(ticker):
    """
    Fetches the most recent OHLC data and plots it.

    Args:
        ticker (str): The stock ticker symbol (e.g., "AMZN").
    """
    try:
        # Download the most recent stock data (1 day)
        data = yf.download(ticker, period="1d")

        if data.empty:
            print(f"No data found for ticker {ticker}.")
            return

        # Extract the last OHLC values
        last_open = data['Open'].iloc[-1]
        last_high = data['High'].iloc[-1]
        last_low = data['Low'].iloc[-1]
        last_close = data['Close'].iloc[-1]

        # Create the plot
        plt.figure(figsize=(8, 6))  # Adjust figure size for better readability
        plt.plot(['Open', 'High', 'Low', 'Close'], [last_open, last_high, last_low, last_close], marker='o', linestyle='-')

        # Add labels and title
        plt.xlabel('Price Type')
        plt.ylabel('Price')
        plt.title(f'Last OHLC Values for {ticker}')
        plt.grid(True)

        # Display the plot
        plt.show()

    except Exception as e:
        print(f"An error occurred: {e}")

# Example usage:
ticker_symbol = "AMZN"  # Amazon stock
plot_last_ohlc(ticker_symbol)
```

Key improvements and explanations:

*   **Clearer Plot:**  Uses a line plot with markers to clearly show the relationship between the Open, High, Low, and Close values.
*   **Labels and Title:** Adds appropriate labels to the x and y axes and a title to the plot for better understanding.
*   **Error Handling:** Includes a `try...except` block to catch potential errors during data fetching or plotting.  This makes the code more robust.
*   **`yfinance` for Data:** Uses `yfinance` to download the stock data directly from Yahoo Finance.  This is a reliable and convenient way to get stock data.  It also handles the data download and parsing.
*   **Concise Data Extraction:**  Uses `.iloc[-1]` to efficiently extract the last values from the DataFrame.
*   **`period="1d"`:**  Specifies a period of "1d" to get only the most recent day's data, which is what we need for the last OHLC values.
*   **Figure Size:** Sets the figure size using `plt.figure(figsize=(8, 6))` to make the plot more readable.
*   **Grid:** Adds a grid to the plot for easier value reading.

To use this code:

1.  **Install `yfinance` and `matplotlib`:**

```bash
pip install yfinance matplotlib
```

2.  **Run the Code:** Copy and paste the code into a Python environment (like a Jupyter Notebook, Google Colab, or a Python script) and run it.  The plot will be displayed.


In [9]:
response = chat.send_message('''What was the last open,close,high,low data for AMZN again?''')
Markdown(response.text)

Based on the information I found, here's the last available OHLC data for AMZN (Amazon) as of May 1, 2025:

*   **Open:** \$190.60
*   **Close:** \$189.82
*   **High:** \$191.80
*   **Low:** \$187.50

Please note that this information is delayed and might not be perfectly real-time.


In [10]:
response = chat.send_message(
    '''What is AMZN open,close,high,low data for the past month? 
    Present the data with multiple columns for display in markdown.''')
Markdown(response.text)

Okay, here's the AMZN (Amazon) Open, High, Low, Close (OHLC) data for the past month (approximately April 2, 2025, to May 2, 2025), presented in a Markdown table.

```python
import yfinance as yf
import pandas as pd
from datetime import date, timedelta

def get_amzn_ohlc_past_month():
    """
    Fetches AMZN OHLC data for the past month and prints it in a Markdown table.
    """
    try:
        today = date.today()
        one_month_ago = today - timedelta(days=30)
        start_date = one_month_ago.strftime("%Y-%m-%d")
        end_date = today.strftime("%Y-%m-%d")

        # Download AMZN data from Yahoo Finance
        amzn_data = yf.download("AMZN", start=start_date, end=end_date)

        if amzn_data.empty:
            print("No data found for AMZN in the past month.")
            return

        # Format the data for Markdown display
        amzn_data = amzn_data[['Open', 'High', 'Low', 'Close']]
        amzn_data.index = amzn_data.index.strftime('%Y-%m-%d')  # Format date
        amzn_data.index.name = 'Date'
        markdown_table = amzn_data.to_markdown()

        print(markdown_table)

    except Exception as e:
        print(f"An error occurred: {e}")

get_amzn_ohlc_past_month()
```

Key improvements and explanations:

*   **Date Range Calculation:**  Calculates the start and end dates dynamically to ensure you always get the data for the *past month* relative to the current date.
*   **Clearer Date Formatting:** Formats the date in the `YYYY-MM-DD` format for better readability in the Markdown table.
*   **`to_markdown()`:** Uses the `to_markdown()` method of the Pandas DataFrame to directly generate the Markdown table. This is much cleaner and more efficient than manually constructing the table string.
*   **Index Name:** Sets the index name to "Date" for clarity in the Markdown table.
*   **Error Handling:** Includes a `try...except` block to catch potential errors during data fetching.
*   **Conciseness:** Streamlines the code for better readability.
*   **Yahoo Finance:** Uses `yfinance` to get the data.

To use this code:

1.  **Install `yfinance` and `pandas`:**

```bash
pip install yfinance pandas
```

2.  **Run the Code:** Copy and paste the code into a Python environment (like a Jupyter Notebook, Google Colab, or a Python script) and run it.  The Markdown table will be printed to the console.  You can then copy and paste that output into a Markdown editor or document.

This will output a Markdown table that looks something like this (the actual data will vary depending on the current date):

```
| Date       |   Open |   High |    Low |   Close |
|:-----------|-------:|-------:|-------:|--------:|
| 2025-04-02 | 184.23 | 185.41 | 182.6  |  184.92 |
| 2025-04-03 | 185.5  | 186.17 | 183.81 |  184.07 |
| 2025-04-04 | 184.5  | 185.2  | 182.84 |  184.3  |
| 2025-04-07 | 183.9  | 185.74 | 183.3  |  185.47 |
| 2025-04-08 | 185.8  | 186.24 | 184.64 |  185.1  |
| 2025-04-09 | 185.2  | 186.36 | 184.71 |  185.92 |
| 2025-04-10 | 186.1  | 187.2  | 185.6  |  186.39 |
| 2025-04-11 | 186.5  | 187.19 | 185.8  |  186.   |
| 2025-04-14 | 186.1  | 187.5  | 185.8  |  187.14 |
| 2025-04-15 | 187.4  | 188.3  | 186.8  |  187.45 |
| 2025-04-16 | 187.7  | 188.5  | 186.9  |  187.2  |
| 2025-04-17 | 187.5  | 188.1  | 186.6  |  187.1  |
| 2025-04-18 | 187.3  | 188.   | 186.5  |  186.8  |
| 2025-04-21 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-22 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-23 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-24 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-25 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-28 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-29 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-04-30 | 187.   | 188.   | 186.3  |  186.9  |
| 2025-05-01 | 190.6  | 191.8  | 187.5  |  189.82 |
| 2025-05-02 | 190.6  | 191.8  | 187.5  |  189.82 |
```

Remember to run the code to get the *actual* data for the past month from the current date.  The values above are just an example.


<span style="font-size:18px;">
The second caveat is a lack of access to realtime data. Although the candlestick data (it usually produces) is nice, and we can prompt Gemini to return any type of containing structure including json. It also produces non-deterministic output for all stock symbols. Even with temperature set to zero Gemini will sometimes say it doesn't know basic indicators for a given symbol. It sometimes knows a fact in one chat session, that it insists it has no knowledge of in another. Some of you that run the above blocks of code will get vastly different results. Sometimes including the whole month of candlestick data.
</span>

# Enter StockChat

<span style="font-size:18px;">
Still, with a total of four prompts Gemini replaces all past effort on wrapping finance api's. It's also capable of generating summary responses more elegant than I could find the effort to write. Enter StockChat, the assistant that knows finance data. It's an assistant capable of generating your personalised finance feed with structured output and realtime delivery via Firebase. It knows what you're interested in and can advise you, like a good-broker buddy with insider tips. It has the spreadsheets but knows you don't want to see them. It knows you want to play with the data so it produces multimodal content. 
<hr>
In order to solve these problems we'll need to move beyond a basic chat session to a multi-tool approach. This notebook is the first in a series detailing the building of our good-broker buddy, whom I shall dub 'essy'. This part, which was made during 2025's Intensive GenAI Course, details the formative steps taken.
</span> 

<span style="font-size:18px;">
The main problem to address before starting is the state of multi-tool support in Gemini-2.0. It's currently only possible to combine grounding, function calling, and code execution on the live (websocket) api. That is, as long as we're ok with the experimental, and subject to change part. Clearly that's not an option for our Essy. We'll start with a multi-model approach. Each expert can be good at different parts of the problem. One such expert will use function calling to chain the models together. One expert to rule them all. We can solve the caveats mentioned easily enough by providing real-time data from existing finance api's. It's not a limit that Gemini cannot execute code (and thus generate plots on it's own), because we can use function calling as a substitute.
</span>

<span style="font-size:18px;">
We can't have a knowledgeable Essy without a vector database to store our knowledge. In fact the majority of solving this problem is likely be the structure of Essy's vector database. So it'll definately change dramatically over time as we progress towards building a stable Essy. We'll use the popular Chroma and build a RAG expert to begin. That way we have someplace to store all our foundational bits of knowledge. For the Chroma embedding function we'll use <code>models/text-embedding-004</code> due to it's 1500 request-per-minute quota. We'll need to be mindful of the smaller 2,048 token input. Though, this shouldn't be a hindrance for digesting the smaller chunks of finance data in our foundation data set. For the augmented generation phase we'll use <code>models/gemini-2.0-flash</code> variants due to it's 1500 request-per-day quota.
</span>

In [11]:
# An embedding function based on text-embedding-004.
class GeminiEmbeddingFunction:
    document_mode = True # Generate embeddings for documents (T), or queries (F).

    def __init__(self, genai_client):
        self.client = genai_client
    
    @retry.Retry(predicate=is_retriable)
    def __call__(self, input: Documents) -> Embeddings:
        return self.embed_with_retry(input)

    # This should catch any errors that the retry helper misses.
    def embed_with_retry(self, input: Documents, max_retries=10, delay=3) -> Embeddings:
        if self.document_mode:
            embedding_task = "retrieval_document"
        else:
            embedding_task = "retrieval_query"
            
        for attempt in range(max_retries):
            try:
                response = self.client.models.embed_content(
                    model="models/text-embedding-004",
                    contents=input,
                    config=types.EmbedContentConfig(
                        task_type=embedding_task,
                    )
                )
                return [e.values for e in response.embeddings]
            except Exception:
                time.sleep(delay * (attempt + 1))
        return None

In [12]:
# An implementation of Retrieval-Augmented Generation.
# - using Chroma and text-embedding-004 for storage and retrieval
# - using gemini-2.0-flash for augmented generation
class RetrievalAugmentedGenerator:
    chroma_client = chromadb.PersistentClient(path="vector_db")
    config_temp = types.GenerateContentConfig(temperature=0.0)

    def __init__(self, genai_client, collection_name):
        self.client = genai_client
        self.embed_fn = GeminiEmbeddingFunction(genai_client)
        self.db = self.chroma_client.get_or_create_collection(
            name=collection_name, 
            embedding_function=self.embed_fn, 
            metadata={"hnsw:space": "cosine"})

    def add_documents_list(self, docs: list):
        self.embed_fn.document_mode = True # Switch to document mode.
        ids = list(map(str, range(self.db.count(), self.db.count()+len(docs))))
        metas=[{"source": doc.metadata["source"]} for doc in docs]
        content=[doc.page_content for doc in docs]
        tqdm(self.db.add(ids=ids, documents=content, metadatas=metas), desc="Generate document embedding (batched)")

    def add_api_document(self, query: str, api_response: str, topic: str, source: str = "add_api_document"):
        self.embed_fn.document_mode = True # Switch to document mode.
        splitter = RecursiveJsonSplitter(max_chunk_size=2000) # chunk by token limit of models/text-embedding-004
        docs = splitter.create_documents(texts=[api_response], convert_lists=True)
        ids = list(map(str, range(self.db.count(), self.db.count()+len(docs))))
        content = [json.dumps({"question": query, "answer": doc.page_content}) for doc in docs]
        metas = [{"source": source, "topic": topic}]*len(docs)
        tqdm(self.db.add(ids=ids, documents=content, metadatas=metas), desc="Generate api embedding (batched)")

    def add_peers_document(self, query: str, peers: str, topic: str, source: str, group: str):
        self.embed_fn.document_mode = True # Switch to document mode.
        document = [{"question": query, "answer": peers}]
        tqdm(self.db.add(ids=str(self.db.count()), 
                             documents=json.dumps(document), 
                             metadatas=[{"source": source,  "topic": topic, "group": group}]), 
             desc="Generate peers embedding")

    def get_peers_document(self, query: str, topic: str, group: str):
        return self.get_documents_list(query, where={"$and": [{"group" : group}, {"topic": topic}]})

    def add_quote_document(self, query: str, quote: str, topic: str, timestamp: int, source: str):
        self.embed_fn.document_mode = True # Switch to document mode.
        document = [{"question": query, "answer": quote}]
        tqdm(self.db.add(ids=str(self.db.count()), 
                             documents=json.dumps(document), 
                             metadatas=[{"source": source,  "topic": topic, "timestamp": timestamp}]), 
             desc="Generate quote embedding")

    def get_api_documents(self, query: str, topic: str, source: str = "add_api_document"):
        return self.get_documents_list(query, where={"$and": [{"source" : source}, {"topic": topic}]})

    def query_api_documents(self, query: str, topic: str, source: str = "add_api_document"):
        return self.generate_answer(query, where={"$and": [{"source" : source}, {"topic": topic}]})

    def add_grounded_document(self, query: str, topic: str, result):
        self.embed_fn.document_mode = True # Switch to document mode.
        chunks = result.candidates[0].grounding_metadata.grounding_chunks
        supports = result.candidates[0].grounding_metadata.grounding_supports
        if supports is not None: # Only add grounded documents which have supports
            text = [f"{s.segment.text}" for s in supports]
            source = [f"{c.web.title}" for c in chunks]
            score = [f"{s.confidence_scores}" for s in supports]
            document = [{"text": ", ".join(text)}]
            tqdm(self.db.add(ids=str(self.db.count()), 
                             documents=json.dumps(document), 
                             metadatas=[{"source": ", ".join(source), 
                                         "confidence_score": ", ".join(score), 
                                         "topic": topic,
                                         "question": query}]), 
                 desc="Generate grounding embedding")

    def get_grounding_documents(self, query: str, topic: str):
        self.embed_fn.document_mode = False # Switch to query mode.
        return self.db.get(where={"$and": [{"question" : query}, {"topic": topic}]})
            
    def add_wiki_documents(self, title: str, documents: list):
        self.embed_fn.document_mode = True # Switch to document mode.
        result = self.get_wiki_documents(title)
        if len(result["documents"]) == 0:
            ids = list(map(str, range(self.db.count(), self.db.count()+len(documents))))
            metas=[{"title": title, "source": "add_wiki_documents"}]*len(documents)
            tqdm(self.db.add(ids=ids, documents=documents, metadatas=metas), desc="Generate wiki embeddings (batched)")

    @retry.Retry(
        predicate=is_retriable,
        initial=2.0,
        maximum=64.0,
        multiplier=2.0,
        timeout=600,
    )
    def generate_with_wiki_passages(self, query: str, title: str, passages: list):
        return self.generate_answer(query, where={"title": title}, passages=passages)
    
    def get_wiki_documents(self, title: Optional[str] = None):
        self.embed_fn.document_mode = False # Switch to query mode.
        if title is None:
            return self.db.get(where={"source": "add_wiki_document"})
        else:
            return self.db.get(where={"title": title})

    def get_documents_list(self, query: str, max_sources: int = 10, where: Optional[dict] = None):
        self.embed_fn.document_mode = False # Switch to query mode.
        result = self.db.query(query_texts=[query], n_results=max_sources, where=where)
        [all_passages] = result["documents"]
        [all_dist] = result["distances"]
        [all_meta] = result["metadatas"]
        return all_passages, all_dist, all_meta

    @retry.Retry(
        predicate=is_retriable,
        initial=2.0,
        maximum=64.0,
        multiplier=2.0,
        timeout=600,
    )
    def get_exchanges_csv(self, query: str):
        return self.generate_answer(query, max_sources=100, where={"source": "exchanges.csv"})

    @retry.Retry(
        predicate=is_retriable,
        initial=2.0,
        maximum=64.0,
        multiplier=2.0,
        timeout=600,
    )
    def generate_answer(self, query: str, max_sources: int = 200, 
                        where: Optional[dict] = None, passages: Optional[list] = None):
        passage_list, dist_list, meta_list = self.get_documents_list(query, max_sources, where)
        query_oneline = query.replace("\n", " ")
        prompt = f"""You're an expert writer. You understand how to interpret html and markdown. You will accept the
        question below and answer based only on the passages. Never mention the passages in your answers. Be sure to 
        respond in concise sentences. Include all relevant background information when possible. If a passage is not 
        relevant to the answer you must ignore it. If no passage answers the question respond with: I don't know.

        QUESTION: {query_oneline}
        
        """
        # Add the retrieved documents to the prompt.
        for passage in passage_list if passages is None else passage_list + passages:
            passage_oneline = passage.replace("\n", " ")
            prompt += f"PASSAGE: {passage_oneline}\n"
    
        return self.client.models.generate_content(model=project_model, 
                                                   config=self.config_temp, 
                                                   contents=prompt)

In [13]:
# An implementation of Wiki-Grounding Generation.
# - using gemini-2.0-flash for response generation
# - using a RAG-implementation to store groundings
# - create new groundings by similarity to topic
# - retrieve existing groundings by similarity to topic
class WikiGroundingGenerator:   
    def __init__(self, genai_client, rag_impl):
        self.client = genai_client
        self.rag = rag_impl
        with warnings.catch_warnings():
            warnings.simplefilter("ignore") # suppress beta-warning
            self.splitter = HTMLSemanticPreservingSplitter(
                headers_to_split_on=[("h2", "Main Topic"), ("h3", "Sub Topic")],
                separators=["\n\n", "\n", ". ", "! ", "? "],
                max_chunk_size=2000, # chunk by token limit of models/text-embedding-004
                chunk_overlap=50,
                preserve_links=True,
                preserve_images=True,
                preserve_videos=True,
                preserve_audio=True,
                elements_to_preserve=["table", "ul", "ol", "code"],
                denylist_tags=["script", "style", "head"],
                custom_handlers={"code": self.code_handler},
            )

    @retry.Retry(
        predicate=is_retriable,
        initial=2.0,
        maximum=64.0,
        multiplier=2.0,
        timeout=600,
    )
    def generate_answer(self, query: str, topic: str):
        result = self.rag.get_wiki_documents(topic)
        if len(result["documents"]) > 0:
            return self.rag.generate_with_wiki_passages(query, topic, result["documents"]).text
        else:
            pages = wikipedia.search(topic + " company")
            if len(pages) > 0:
                p_topic_match = 0.80
                for i in range(len(pages)):
                    if tqdm(self.get_topic_similarity(topic, pages[i]) > p_topic_match, 
                            desc= "Score wiki search by similarity to topic"):
                        request = requests.get(f"https://en.wikipedia.org/wiki/{pages[i]}")
                        documents = [document.page_content for document in self.splitter.split_text(request.text)]
                        self.rag.add_wiki_documents(topic, documents)
                        return self.rag.generate_with_wiki_passages(query, topic, documents).text

    def code_handler(self, element: Tag) -> str:
        data_lang = element.get("data-lang")
        code_format = f"<code:{data_lang}>{element.get_text()}</code>"
        return code_format

    @retry.Retry(predicate=is_retriable)
    def get_topic_similarity(self, topic: str, page: str):
        return self.similarity_with_retry(topic, page)

    # This should catch any errors that the retry helper misses.
    def similarity_with_retry(self, topic: str, page: str, max_retries=10, delay=3):
        content = [topic + " company", page]
        for attempt in range(max_retries):
            try:
                similarity = client.models.embed_content(
                    model="models/text-embedding-004",
                    contents=content,
                    config=types.EmbedContentConfig(task_type="semantic_similarity"))
                df = pandas.DataFrame([e.values for e in similarity.embeddings], index=content)
                score = df @ df.T
                return score.iloc[0].iloc[1]
            except Exception:
                time.sleep(delay * (attempt + 1))
        return None

In [14]:
# An implementation of Grounding Generation.
# - using gemini-2.0-flash with GoogleSearch tool for response generation
# - using a RAG-implementation to store groundings
# - create new groundings by exact match to topic
# - retrieve existing groundings by similarity to topic
class GroundingGenerator:
    config_ground = types.GenerateContentConfig(
        tools=[types.Tool(google_search=types.GoogleSearch())],
        temperature=0.0
    )
    
    def __init__(self, genai_client, rag_impl):
        self.client = genai_client
        self.rag = rag_impl

    def generate_answer(self, query: str, topic: str):
        docs = self.rag.get_grounding_documents(query, topic)
        if len(docs["documents"]) > 0:
            for i in range(len(docs["metadatas"])):
                doc = docs["documents"][i]
                meta_q = docs["metadatas"][i]["question"]
                p_ground_match = 0.95 # This can be really high ~ 95-97%
                if tqdm(self.get_grounding_similarity(query, meta_q) > p_ground_match,
                        desc="Score similarity to stored grounding"):
                    return ast.literal_eval(doc)[0]["text"]
        return self.get_grounding(query, topic)

    @retry.Retry(predicate=is_retriable)
    def get_grounding_similarity(self, question: str, compare: str):
        return self.similarity_with_retry(question, compare)

    # This should catch any errors that the retry helper misses.
    def similarity_with_retry(self, question: str, compare: str, max_retries=10, delay=3):
        content = [question, compare]
        for attempt in range(max_retries):
            try:
                similarity = client.models.embed_content(
                    model="models/text-embedding-004",
                    contents=content,
                    config=types.EmbedContentConfig(task_type="semantic_similarity"))
                df = pandas.DataFrame([e.values for e in similarity.embeddings], index=content)
                score = df @ df.T
                return score.iloc[0].iloc[1]
            except Exception:
                time.sleep(delay * (attempt + 1))
        return None

    @retry.Retry(
        predicate=is_retriable,
        initial=2.0,
        maximum=64.0,
        multiplier=2.0,
        timeout=600,
    )
    def get_grounding(self, query: str, topic: str):
        contents = [types.Content(role="user", parts=[types.Part(text=query)])]
        contents += f"""
        You're a search assistant that provides grounded answers to questions about {topic}. You will provide only 
        results that discuss {topic}. Be brief and specific in answering and omit extra details.
        If an answer is not possible respond with: I don't know."""
        response = self.client.models.generate_content(
            model=project_model, 
            config=self.config_ground, 
            contents=contents)
        if response.candidates[0].grounding_metadata.grounding_supports is not None:
            if topic.replace("'", "") not in response.text: # Exact topic match required
                return "I don't know." # Workaround a bug in gemini-2.0-flash (MGM Studio becomes MGM Resorts)
            else:
                self.rag.add_grounded_document(query, topic, response)
                return response.text
        return "I don't know." # Empty grounding_supports means grounding not possible for query.

# Testing the RAG Implementation

<span style="font-size:18px;">
Let's load some test data and see what the RAG can do. The test data is a CSV file containing stock market exchange data. It includes the market id code, name, locale, and operating hours. The import will use CSVLoader from <code>langchain-community</code> to parse the exchange data into Documents that our RAG can ingest.
</span>

In [15]:
# Load the exchange data from source csv.
# - Identifies exchanges by a 1-2 letter code which can be used to filter response data.
# - Also maps the exchange code to exchange details.
df = pandas.read_csv("/kaggle/input/exchanges/exchanges_src.csv").drop(["close_date"], axis=1).fillna("")
df.to_csv("exchanges.csv", index=False)
exchanges = CSVLoader(file_path="exchanges.csv", encoding="utf-8", csv_args={"delimiter": ","}).load()

# Prepare a RAG tool for use and add the exchange data.
tool_rag = RetrievalAugmentedGenerator(client, "finance")
tool_rag.add_documents_list(exchanges)

# Prepare a the grounding tools for use.
tool_wiki = WikiGroundingGenerator(client, tool_rag)
tool_ground = GroundingGenerator(client, tool_rag)

Generate document embedding (batched): 0it [00:00, ?it/s]


<span style="font-size:18px;">
Now that the data is loaded lets ask our RAG to perform some augmenting. We can ask it to perform all sorts of useful tasks. We'll generate some useful reusable data structures and check to make sure it can answer important questions. The exchanges all have id's which are used to filter the realtime data. So we'll make sure the RAG know how to create this mapping. We'll also check it's awareness of operating hours. After all, Essy, doesn't mindlessly hammer away at api's when no new data is available.
</span>

In [16]:
# The RAG tool is a helpful expert.

response = tool_rag.get_exchanges_csv("""Give me a dictionary in string form. It must contain key:value pairs mapping 
                                         exchange code to name. Just the dictionary string in pretty form.""")
print(response.text)

response = tool_rag.get_exchanges_csv("""What is the Germany exchange code? Return only the exchange codes as a simple
                                         comma separated value that I can copy.""")
print(response.text)

response = tool_rag.get_exchanges_csv("What are the Germany exchanges and thier corresponding exchange codes?")
print(response.text, "\n")

response = tool_rag.generate_answer("What are Google's stock ticker symbols?")
print(response.text, "\n")

response = tool_rag.get_exchanges_csv("What are the US exchange operating hours?")
print(response.text, "\n")

est = pytz.timezone('US/Eastern') # Exchanges data is in eastern time.
response = tool_rag.get_exchanges_csv(
    f"""Answer based on your knowledge of exchange operating hours.
        Do not answer in full sentences. Omit all chat and provide the answer only.
        All exchanges are open on weekdays. Weekdays are: Mon, Tue, Wed, Thu, Fri.
        Exchanges open and close on weekdays.
        
        The current date and time is: {datetime.now(est).strftime('%c')}
        
        When was the US exchange's last operating hours? Exclude weekends.
        Provide just the close. Include post-market hours.
        Answer with a date that uses this format: '%a %b %d %X %Y'.""")
print(response.text)

```
{
    "SC": "BOERSE_FRANKFURT_ZERTIFIKATE",
    "SX": "DEUTSCHE BOERSE Stoxx",
    "HK": "HONG KONG EXCHANGES AND CLEARING LTD",
    "DB": "DUBAI FINANCIAL MARKET",
    "NZ": "NEW ZEALAND EXCHANGE LTD",
    "QA": "QATAR EXCHANGE",
    "KS": "KOREA EXCHANGE (STOCK MARKET)",
    "SW": "SWISS EXCHANGE",
    "DU": "BOERSE DUESSELDORF",
    "BC": "BOLSA DE VALORES DE COLOMBIA",
    "KQ": "KOREA EXCHANGE (KOSDAQ)",
    "SN": "SANTIAGO STOCK EXCHANGE",
    "SI": "SINGAPORE EXCHANGE",
    "AD": "ABU DHABI SECURITIES EXCHANGE",
    "CO": "OMX NORDIC EXCHANGE COPENHAGEN A/S",
    "L": "LONDON STOCK EXCHANGE",
    "ME": "MOSCOW EXCHANGE",
    "TO": "TORONTO STOCK EXCHANGE",
    "BD": "BUDAPEST STOCK EXCHANGE",
    "TG": "DEUTSCHE BOERSE TradeGate",
    "US": "US exchanges (NYSE, Nasdaq)",
    "TW": "TAIWAN STOCK EXCHANGE",
    "JK": "INDONESIA STOCK EXCHANGE",
    "SZ": "SHENZHEN STOCK EXCHANGE",
    "VS": "NASDAQ OMX VILNIUS",
    "MX": "BOLSA MEXICANA DE VALORES (MEXICAN STOCK EXCHANGE)",
 

<span style="font-size:18px;">
Excellent! Though, despite my best effort I could not convince Gemini to apply date correction (during chaining) based on holiday. It simply wasn't stable enough to be useful. I would either have to add a holiday data set, or (what I chose) apply a quick temporary fix. A real-time API endpoint may fail due to a holiday being selected as the date. If that happens I'll just retry Thursday if the failure happened on Friday, likewise choosing Friday if the failure happened on Monday. Crude but simple for foundational purposes.
</span>

# Declaring the Function Calling Metadata

<span style="font-size:18px;">
Our Function Calling expert will chain together the other experts we've implemented thus far. It also provides the final response through augmentation. This time using the tools as a source of grounding truth. It'd like to say it's all truth organised by topic and other metadata. It's still a precarious situation if Essy incidently chains into mining data on another topic. We want Amazon to be the owner of MGM Studio's not MGM Resorts International. We also don't want a summary to include another company unless that company is a peer.
</span>

<span style="font-size:18px;">
The function calling metadata is thus extremely important. It needs to combine our other experts with the real-time api's data. Essy will use two API providers as sources of finance data. The primary motivation being that each provider has limits in their own way, yet both are useful in their own own way. This is useful anywhere you need a broad spectrum of sources of truth. At metadata creation I'll adopt the naming convention of appending the provider (if any) id. This helps keep functions more understandable when you know which provider you're dealing with.
</span>

In [17]:
# Declare callable functions using OpenAPI schema.
get_symbol_1 = types.FunctionDeclaration(
    name="get_symbol_1",
    description="""Search for the stock ticker symbol of a given company, security, isin or cusip. Each ticker
                   entry provides a description, symbol, and asset type. If this doesn't help you should try 
                   calling get_wiki_tool_response next.""",
    parameters={
        "type": "object",
        "properties": {
            "q": {
                "type": "string",
                "description": """The company, security, isin or cusip to search for a symbol."""
            },
            "exchange": {
                "type": "string",
                "description": """The exchange code used to filter results. When not specified the default exchange 
                                  code you should use is 'US' for the US exchanges. A dictionary mapping all supported 
                                  exchange codes to their names be retrieved by calling get_exchange_codes_1. 
                                  Search for an exchange code to use by calling get_exchange_code_1, specifying the
                                  exchange code to search for."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["q", "exchange", "query"]
    }
)

get_name_1 = types.FunctionDeclaration(
    name="get_name_1",
    description="""Search for the name associated with a stock ticker or symbol's company, security, isin or cusip. 
    Each ticker entry provides a description, matching symbol, and asset type.""",
    parameters={
        "type": "object",
        "properties": {
            "q": {
                "type": "string",
                "description": """The symbol or ticker to search for."""
            },
            "exchange": {
                "type": "string",
                "description": """The exchange code used to filter results. When not specified the default exchange 
                                  code you should use is 'US' for the US exchanges. A dictionary mapping all supported 
                                  exchange codes to their names be retrieved by calling get_exchange_codes_1. 
                                  Search for an exchange code to use by calling get_exchange_code_1, specifying the
                                  exchange code to search for."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            },
            "company": {
                "type": "string",
                "description": "The company you're searching for."
            }
        },
        "required": ["q", "exchange", "query", "company"]
    }
)

get_symbol_quote_1 = types.FunctionDeclaration(
    name="get_symbol_quote_1",
    description="""Search for the current price or quote of a stock ticker or symbol. The response is
                   provided in json format. Each response contains the following key-value pairs:
                   
                   c: Current price,
                   d: Change,
                  dp: Percent change,
                   h: High price of the day,
                   l: Low price of the day,
                   o: Open price of the day,
                  pc: Previous close price,
                   t: Epoch timestamp of price in seconds.

                   Parse the response and respond according to this information.""",
    parameters={
        "type": "object",
        "properties": {
            "symbol": {
                "type": "string",
                "description": "The stock ticker symbol for a company, security, isin, or cusip." 
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            },
            "exchange": {
                "type": "string",
                "description": "The exchange code used to filter quotes. This must always be 'US'."
            }
        },
        "required": ["symbol", "query", "exchange"]
    }
)

get_local_datetime_1 = types.FunctionDeclaration(
    name="get_local_datetime_1",
    description="""Converts an array of timestamps from epoch time to the local timezone format. The result is an array
                   of date and time in locale appropriate format. Suitable for use in a locale appropriate response.
                   Treat this function as a vector function. Always prefer to batch timestamps for conversion. Use this
                   function to format date and time in your responses.""",
    parameters={
        "type": "object",
        "properties": {
            "t": {
                "type": "array",
                "description": """An array of timestamps in seconds since epoch to be converted. The order of
                                  timestamps matches the order of conversion.""",
                "items": {
                    "type": "integer"
                }
            }
        },
        "required": ["t"]
    }
)

get_market_status_1 = types.FunctionDeclaration(
    name="get_market_status_1",
    description="""Get the current market status of global exchanges. Includes whether exchanges are open or closed.  
                   Also includes holiday details if applicable. The response is provided in json format. Each response 
                   contains the following key-value pairs:

                   exchange: Exchange code,
                   timezone: Timezone of the exchange,
                    holiday: Holiday event name, or null if it's not a holiday,
                     isOpen: Whether the market is open at the moment,
                          t: Epoch timestamp of status in seconds (Eastern Time),
                    session: The market session can be 1 of the following values: 
                    
                    pre-market,regular,post-market when open, or null if closed.
                    
                    Parse the response and respond according to this information.""",
    parameters={
        "type": "object",
        "properties": {
            "exchange": {
                "type": "string",
                "description": """The exchange code used to filter results. The default if omitted is 'US' for the 
                                  US exchanges. A dictionary mapping supported exchange codes (key) to their 
                                  description (value) can be obtained from get_exchange_codes_1. Search the values for
                                  a matching exchange code if unsure."""
            }
        },
        "required": ["exchange"]
    }
)

get_company_peers_1 = types.FunctionDeclaration(
    name="get_company_peers_1",
    description="""Search for a company's peers. Returns a list of peers operating in the same country and in the same
                   sector, industry, or subIndustry. Each response contains the following key-value pairs: 
                   
                   symbol: The company's stock ticker symbol, 
                   peers: A list containing the peers.
                   
                   Each peers entry contains the following key-value pairs:
                   
                   symbol: The peer company's stock ticker symbol, 
                   name: The peer company's name.
                   
                   Parse the response and respond according to this information.""",
    parameters={
        "type": "object",
        "properties": {
            "symbol": {
                "type": "string",
                "description": "The stock ticker symbol of a company to obtain peers."
            },
            "grouping": {
                "type": "string",
                "description": """Specify the grouping category for choosing peers. When not specified the default
                                  category is subIndustry. This parameter may be one of the following values: 
                                  sector, industry, subIndustry."""
            },
            "exchange": {
                "type": "string",
                "description": """The exchange code used to filter results. When not specified the default exchange 
                                  code you should use is 'US' for the US exchanges. A dictionary mapping all supported 
                                  exchange codes to their names be retrieved by calling get_exchange_codes_1. 
                                  Search for an exchange code to use by calling get_exchange_code_1, specifying the
                                  exchange code to search for."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["symbol", "grouping", "exchange", "query"]
    }
)

get_exchange_codes_1 = types.FunctionDeclaration(
    name="get_exchange_codes_1",
    description="""Get a dictionary mapping all supported exchange codes to their names."""
)

get_exchange_code_1 = types.FunctionDeclaration(
    name="get_exchange_code_1",
    description="""Search for the exchange code to use when filtering by exchange. The result will be one or
                   more exchange codes provided as a comma-separated string value.""",
    parameters={
        "type": "object",
        "properties": {
            "q": {
                "type": "string",
                "description": "Specifies which exchange code to search for."
            }
        },
        "required": ["q"]
    }
)

get_financials_1 = types.FunctionDeclaration(
    name="get_financials_1",
    description="""Get company basic financials such as margin, P/E ratio, 52-week high/low, etc. Parse the response for 
                   key-value pairs in json format and interpret their meaning as stock market financial indicators.""",
    parameters={
        "type": "object",
        "properties": {
            "symbol": {
                "type": "string",
                "description": "Stock ticker symbol for a company."
            },
            "metric": {
                "type": "string",
                "description": "It must always be declared as the value 'all'"
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["symbol", "metric", "query"]
    }
)

get_daily_candlestick_2 = types.FunctionDeclaration(
    name="get_daily_candlestick_2",
    description="""Get a historical daily stock ticker candlestick / aggregate bar (OHLC). 
                   Includes historical daily open, high, low, and close prices. Also includes historical daily trade
                   volume and pre-market/after-hours trade prices. It does not provide today's data until after 
                   11:59PM Eastern Time.""",
    parameters={
        "type": "object",
        "properties": {
            "stocksTicker": {
                "type": "string",
                "description": "The stock ticker symbol of a company to search for.",
            },
            "date": {
                "type": "string",
                "format": "date-time",
                "description": """The date of the requested candlestick in format YYYY-MM-DD. The default is one 
                                  weekday prior to get_last_market_close (excluding weekends). This date must never 
                                  be more recent than the default. Replace more recent dates with the default."""
            },
            "adjusted": {
                "type": "string",
                "description": """May be true or false. Indicated whether or not the results are adjusted for splits. 
                                  By default, results are adjusted. Set this to false to get results that are NOT 
                                  adjusted for splits."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["stocksTicker", "date", "adjusted", "query"]
    },
)

get_company_news_1 = types.FunctionDeclaration(
    name="get_company_news_1",
    description="Retrieve the most recent news articles related to a specified ticker.",
    parameters={
        "type": "object",
        "properties": {
            "symbol": {
                "type": "string",
                "description": "Stock ticker symbol for a company.",
            },
            "from": {
                "type": "string",
                "format": "date-time",
                "description": """A date in format YYYY-MM-DD. It must be older than the parameter 'to'. The default
                                  value is one month ago."""
            },
            "to": {
                "type": "string",
                "format": "date-time",
                "description": """A date in format YYYY-MM-DD. It must be more recent than the parameter 'from'. The
                                  default value is today's date."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["symbol", "from", "to", "query"]
    },
)

get_custom_candlestick_2 = types.FunctionDeclaration(
    name="get_custom_candlestick_2",
    description="""Get a historical stock ticker candlestick / aggregate bar (OHLC) over a custom date range and 
                   time interval in Eastern Time. Includes historical open, high, low, and close prices. Also 
                   includes historical daily trade volume and pre-market/after-hours trade prices. It does not
                   include today's open, high, low, or close until after 11:59PM Eastern Time.""",
    parameters={
        "type": "object",
        "properties": {
            "stocksTicker": {
                "type": "string",
                "description": "The stock ticker symbol of a company to search for.",
            },
            "multiplier": {
                "type": "integer",
                "description": "This must be 1 unless told otherwise."
            },
            "timespan": {
                "type": "string",
                "description": """The size of the candlestick's time window. This is allowed to be one of the following:
                                  second, minute, hour, day, week, month, quarter, or year. The default value is day."""
            },
            "from": {
                "type": "string",
                "format": "date-time",
                "description": """A date in format YYYY-MM-DD must be older than the parameter 'to'. The default
                                  value is one-month ago from today's date."""
            },
            "to": {
                "type": "string",
                "format": "date-time",
                "description": """A date in format YYYY-MM-DD must be more recent than the parameter 'from'. The 
                                  default is one weekday prior to get_last_market_close (excluding weekends).
                                  Replace more recent dates with the default."""
            },
            "adjusted": {
                "type": "string",
                "description": """May be true or false. Indicated whether or not the results are adjusted for splits. 
                                  By default, results are adjusted. Set this to false to get results that are NOT 
                                  adjusted for splits."""
            },
            "sort": {
                "type": "string",
                "description": """May be one of asc or desc. asc will sort by timestmap in ascending order. desc will
                                  sort by timestamp in descending order."""
            },
            "limit": {
                "type": "integer",
                "description": """Set the number of base aggregates used to create this candlestick. This must be 5000 
                                  unless told to limit base aggregates to something else."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["stocksTicker", "multiplier", "timespan", "from", "to", "query", "adjusted", "sort", "limit"]
    },
)

get_last_market_close = types.FunctionDeclaration(
    name="get_last_market_close",
    description="""Get the date and time of the US exchange market's last close. Provides the last US market close in 
                   locale appropriate format."""
)

get_ticker_overview_2 = types.FunctionDeclaration(
    name="get_ticker_overview_2",
    description="""Retrieve comprehensive details for a single ticker symbol. It's a deep look into a company’s 
    fundamental attributes, including its primary exchange, standardized identifiers (CIK, composite FIGI, 
    share class FIGI), market capitalization, industry classification, and key dates. Also includes branding assets in
    the form of icons and logos.
    """,
    parameters={
        "type": "object",
        "properties": {
            "ticker": {
                "type": "string",
                "description": "Stock ticker symbol of a company."
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["ticker", "query"]
    }
)

get_recommendation_trends_1 = types.FunctionDeclaration(
    name="get_recommendation_trends_1",
    description="""Get the latest analyst recommendation trends for a company.
                The data includes the latest recommendations as well as historical
                recommendation data for each month. The data is classified according
                to these categories: strongBuy, buy, hold, sell, and strongSell.
                The date of a recommendation indicated by the value of 'period'.""",
    parameters={
        "type": "object",
        "properties": {
            "symbol": {
                "type": "string",
                "description": "Stock ticker symbol for a company."
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["symbol", "query"]
    }
)

get_news_with_sentiment_2 = types.FunctionDeclaration(
    name="get_news_with_sentiment_2",
    description="""Retrieve the most recent news articles related to a specified ticker. Each article includes 
                   comprehensive coverage. Including a summary, publisher information, article metadata, 
                   and sentiment analysis.""",
    parameters={
        "type": "object",
        "properties": {
            "ticker": {
                "type": "string",
                "description": "Stock ticker symbol for a company."
            },
            "published_utc": {
                "type": "string",
                "format": "date-time",
                "description": """Omit this parameter unless you're told told to filter by published_utc."""
            },
            "order": {
                "type": "string",
                "description": """Must be desc for descending order, or asc for ascending order.
                                  When order is not specified the default is descending order.
                                  Ordering will be based on the parameter: sort."""
            },
            "limit": {
                "type": "integer",
                "description": """This must be 100 unless told to limit news results to something else."""
            },
            "sort": {
                "type": "string",
                "description": """The sort field used for ordering. This value must
                                  always be published_utc."""
            },
            "query": {
                "type": "string",
                "description": "The question you're attempting to answer."
            }
        },
        "required": ["ticker", "order", "limit", "sort", "query"]
    }
)

get_rag_tool_response = types.FunctionDeclaration(
    name="get_rag_tool_response",
    description="""A database containing useful financial information. Always check here for answers first.""",
    parameters={
        "type": "object",
        "properties": {
            "question": {
                "type": "string",
                "description": "A question needing an answer. Asked as a simple string."
            }
        }
    }
)

get_wiki_tool_response = types.FunctionDeclaration(
    name="get_wiki_tool_response",
    description="""Answers questions that still have unknown answers. Retrieve a wiki page related to a company, 
                   product, or service. Each web page includes detailed company information, financial indicators, 
                   tickers, symbols, history, and products and services.""",
    parameters={
        "type": "object",
        "properties": {
            "id": {
                "type": "string",
                "description": "The question's company or product. Just the name and no other details."
            },
            "q": {
                "type": "string",
                "description": "The complete, unaltered, query string."
            }
        },
        "required": ["id", "q"]
    }
)

get_search_tool_response = types.FunctionDeclaration(
    name="get_search_tool_response",
    description="Answers questions that still have unknown answers. Use it after checking all your other tools.",
    parameters={
        "type": "object",
        "properties": {
            "q": {
                "type": "string",
                "description": "The question needing an answer. Asked as a simple string."
            },
            "id": {
                "type": "string",
                "description": "The question's company or product. In one word. Just the name and no other details."
            }
        },
        "required": ["q", "id"]
    }
)

# Implementing the Function Calls

<span style="font-size:18px;">
One downside of this part being the main part was the lack of time to refactor this part more. Our formative Essy implements as much useful data from two finacial APIs. In order to use it you will need to declare secrets for <a class="anchor-link" href="https://finnhub.io/dashboard">Finnhub</a> and <a class="anchor-link" href="https://polygon.io/dashboard">Polygon</a> finance APIs. Register at their respective sites for your free API key. Then import the secret using the same method as how you setup Google's API key.
</span>

In [28]:
# Implement the callable functions and the function handler.

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def ask_rag_tool(content):
    return tool_rag.generate_answer(content["question"]).text

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def ask_wiki_tool(content):
    return tool_wiki.generate_answer(content["q"], content["id"])

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def ask_search_tool(content):
    return tool_ground.generate_answer(content["q"], content["id"])

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def rag_exchange_codes_1(content):
    response = tool_rag.get_exchanges_csv("""Give me a dictionary in string form. It must contaihttps://api.polygon.io/v3/reference/tickers/AAPL?apiKey=4xJe226Z23RZmEc1bN8az1zz4pmNWdOpn key:value pairs 
                                             mapping exchange code to name. Just the dictionary string.
                                             Omit all other information or details. Do not chat or use sentences.""")
    codes = list(ast.literal_eval(response.text.strip("\`")).items())
    return codes

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def rag_exchange_code_1(content):
    codes = tool_rag.get_exchanges_csv(f"""What is the {content} exchange code? Return only the exchange codes 
                                           as a list in string form. Just the list string.
                                           Omit all other information or details. Do not chat or use sentences.""").text
    return ast.literal_eval(codes)

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def rag_last_market_close(content):
    est = pytz.timezone('US/Eastern') # Exchanges data is in eastern time.
    return tool_rag.get_exchanges_csv(
        f"""Answer based on your knowledge of exchange operating hours.
        Do not answer in full sentences. Omit all chat and provide the answer only.
        All exchanges are open on weekdays. Weekdays are: Mon, Tue, Wed, Thu, Fri.
        Exchanges open and close on weekdays.
        
        The current date and time is: {datetime.now(est).strftime('%c')}
        
        When was the US exchange's last operating hours? Exclude weekends.
        Provide just the close. Include post-market hours.
        Answer with a date that uses this format: '%a %b %d %X %Y'.""").text

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def get_similarity_score(content):
    similarity = client.models.embed_content(
        model="models/text-embedding-004",
        contents=content,
        config=types.EmbedContentConfig(task_type="semantic_similarity"))
    df = pandas.DataFrame([e.values for e in similarity.embeddings], index=content)
    score = df @ df.T
    return score.iloc[0].iloc[1]
    
def impl_get_symbol_1(content, by_name: bool = True):
    response = tool_rag.get_api_documents(content["query"], content["q"], "get_symbol_1")
    if len(response[0]) == 0: # index [0] for document content
        url = f"https://finnhub.io/api/v1/search?q={content['q']}&exchange={content['exchange']}&token={FINNHUB_API_KEY}"
        try:
            response = json.loads(requests.get(url).text)
        except:
            return "I don't know."
        else:
            matches = []
            max_failed_match = len(response["result"]) if not by_name else 3
            p_desc_match = 0.80
            p_symb_match = 0.95
            if response["count"] > 0:
                for result in tqdm(response["result"], desc="Score similarity to query"):
                    if max_failed_match > 0:
                        desc = [content['q'].upper(), result["description"].split("-", -1)[0]]
                        symb = [content['q'].upper(), result["symbol"]]
                        if by_name and get_similarity_score(desc) > p_desc_match: 
                            matches.append(result["symbol"])
                        elif not by_name and get_similarity_score(symb) > p_symb_match:
                            matches.append(result["description"])
                            max_failed_match = 0
                        else:
                            max_failed_match -= 1
            if len(matches) > 0:
                tool_rag.add_api_document(content["query"], matches, content["q"], "get_symbol_1")
                return matches
            else:
                return "I don't know."
    else:
        doc = ast.literal_eval(response[0][0])[0]
        return doc["answer"]

def impl_get_name_1(content):
    return impl_get_symbol_1(content, by_name = False)

def impl_get_quote_1(content):
    quotes = tool_rag.get_api_documents(content["query"], content["symbol"], "get_quote_1")
    isOpen = dict(impl_get_market_status_1(content))["isOpen"]
    if len(quotes[0]) == 0 or isOpen: 
        return get_current_price_1(content)
    else:
        last_close = parse(rag_last_market_close(content)).timestamp()
        for quote in quotes[2]: # index [2] for metadata
            if quote["timestamp"] >= last_close:
                return quotes
        return get_current_price_1(content)

def get_current_price_1(content):
    url = f"https://finnhub.io/api/v1/quote?symbol={content['symbol']}&token={FINNHUB_API_KEY}"
    # This is a high-demand endpoint. Expect random failure under heavy (free) use.
    try:
        response = json.loads(requests.get(url).text)
    except:
        return "I don't know."
    else:
        if len(response) > 0 and response["t"] > 0:
            tool_rag.add_quote_document(content["query"], response, content["symbol"], response["t"], "get_quote_1")
            return list(response.items())
        return "I don't know."

def impl_get_market_status_1(content):
    url = f"https://finnhub.io/api/v1/stock/market-status?exchange={content['exchange']}&token={FINNHUB_API_KEY}"
    try:
        response = json.loads(requests.get(url).text)
    except:
        return "I don't know."
    else:
        if len(response) > 0:
            return list(response.items())
        return "I don't know."

def impl_get_peers_1(content):
    docs = tool_rag.get_peers_document(content["query"], content["symbol"], content['grouping'])
    if len(docs[0]) == 0: # index [0] for document content
        url = f"https://finnhub.io/api/v1/stock/peers?symbol={content['symbol']}&grouping={content['grouping']}&token={FINNHUB_API_KEY}"
        try:
            peers = json.loads(requests.get(url).text)
        except:
            return "I don't know."
        else:
            if len(peers) > 0:
                names = []
                for peer in peers:
                    if peer == content["symbol"]:
                        continue # skip including the query symbol in peers (included in metadata anyway)
                    name_lookup = dict(q=peer, exchange=content["exchange"], query=content["query"])
                    name = impl_get_name_1(name_lookup)
                    if name != "I don't know.":
                        p = {"symbol": peer, "name": name}
                        names.append(p)
                peers = {"symbol": content["symbol"], "peers": names}
                tool_rag.add_peers_document(content["query"], peers, content["symbol"], "get_peers_1", content['grouping'])
                return list(peers.items())
            return "I don't know."
    else:
        peers = ast.literal_eval(docs[0][0])[0]["answer"] # The first document should be most relevant.
        return list(peers.items())

def impl_local_datetime_1(content):
    local_t = []
    for timestamp in content["t"]:
        local_t.append(local_date_from_epoch(timestamp))
    return local_t

def local_date_from_epoch(timestamp):
    est = pytz.timezone('US/Eastern') # Exchanges data is in eastern time.
    if len(str(timestamp)) == 13:
        return datetime.fromtimestamp(timestamp/1000, tz=est).strftime('%c')
    else:
        return datetime.fromtimestamp(timestamp, tz=est).strftime('%c')

def impl_get_financials_1(content):
    fins = tool_rag.get_api_documents(content["query"], content["symbol"], "get_financials_1")
    if len(fins[0]) == 0:
        url = f"https://finnhub.io/api/v1/stock/metric?symbol={content['symbol']}&metric={content['metric']}&token={FINNHUB_API_KEY}"
        try:
            fin = json.loads(requests.get(url).text)
        except:
            return "I don't know."
        else:
            if not fin:
                return "I don't know."
            tool_rag.add_api_document(content["query"], fin, content["symbol"], "get_financials_1")
            return list(fin.items())
    return fins

def impl_get_news_1(content):
    #news = tool_rag.get_api_documents(content["query"], content["symbol"], "get_news_1")
    #if len(news[0]) == 0:
        url = f"https://finnhub.io/api/v1/company-news?symbol={content['symbol']}&from={content['from']}&to={content['to']}&token={FINNHUB_API_KEY}"
        try:
            news = json.loads(requests.get(url).text)
        except:
            return "I don't know."
        else:
            if len(news) == 0:
                return "I don't know."
            #tool_rag.add_api_document(content["query"], news, content["symbol"], "get_news_1")
            return list(news.items())
    #return news

def impl_daily_candle_2(content):
    daily_candle = tool_rag.get_api_documents(content["query"], content["stocksTicker"], "daily_candle_2")
    if len(daily_candle[0]) == 0:
        url = f"https://api.polygon.io/v1/open-close/{content['stocksTicker']}/{content['date']}?adjusted={content['adjusted']}&apiKey={POLYGON_API_KEY}"
        try:
            request = requests.get(url)
            daily_candle = ast.literal_eval(request.text)
        except:
            return f"I don't know. Endpoint returned status {request.status_code}"
        else:
            if daily_candle["status"] in ["OK","DELAYED"]:
                tool_rag.add_api_document(content["query"], daily_candle, content["stocksTicker"], "daily_candle_2")
                return list(daily_candle.items())
            else:
                date = parse(content["date"])
                new_date = None
                if date.weekday() == 4: # index 4 for friday
                    new_date = date - timedelta(days=1)
                elif date.weekday() == 0: # index 0 for monday
                    new_date = date - timedelta(days=3)
                if new_date is None:
                    return "I don't know."
                content["date"] = new_date.strftime("%Y-%m-%d")
                return impl_daily_candle_2(content)
    return daily_candle

def impl_custom_candle_2(content):
    url = f"""https://api.polygon.io/v2/aggs/ticker/{content['stocksTicker']}/range/{content['multiplier']}/{content['timespan']}/{content['from']}/{content['to']}?adjusted={content['adjusted']}&sort={content['sort']}&limit={content['limit']}&apiKey={POLYGON_API_KEY}"""
    try:
        request = requests.get(url)
        custom_candle = json.loads(request.text)
    except:
        return f"I don't know. Endpoint returned status {request.status_code}"
    else:
        if custom_candle["status"] in ["OK","DELAYED"]:
            tool_rag.add_api_document(content["query"], custom_candle, content["stocksTicker"], "custom_candle_2")
            return list(custom_candle.items())
        return "I don't know."

def impl_ticker_overview_2(content):
    overview = tool_rag.get_api_documents(content["query"], content["ticker"], "ticker_overview_2")
    if len(overview[0]) == 0:
        url = f"https://api.polygon.io/v3/reference/tickers/{content['ticker']}?apiKey={POLYGON_API_KEY}"
        try:
            request = requests.get(url)
            overview = json.loads(request.text)
        except:
            return f"I don't know. Endpoint returned status {request.status_code}"
        else:
            if overview["status"] in ["OK","DELAYED"]:
                tool_rag.add_api_document(content["query"], overview, content["ticker"], "ticker_overview_2")
                return list(overview.items())
            return "I don't know."
    return overview

def impl_trends_1(content):
    trends = tool_rag.get_api_documents(content["query"], content["symbol"], "trends_1")
    if len(trends[0]) == 0:
        url = f"https://finnhub.io/api/v1/stock/recommendation?symbol={content['symbol']}&token={FINNHUB_API_KEY}"
        try:
            trends = json.loads(requests.get(url).text)
        except:
            return "I don't know."
        else:
            if len(trends) > 0:
                tool_rag.add_api_document(content["query"], trends, content["symbol"], "trends_1")
                return trends
            return "I don't know."
    return trends

def impl_get_news_2(content):
    #news = tool_rag.get_api_documents(content["query"], content["ticker"], "get_news_2")
    #if len(news[0]) == 0:
        url = f"https://api.polygon.io/v2/reference/news?ticker={content['ticker']}&order={content['order']}&limit={content['limit']}&sort={content['sort']}&apiKey={POLYGON_API_KEY}"
        try:
            request = requests.get(url)
            news = json.loads(request.text)
        except:
            return f"I don't know. Endpoint returned status {request.status_code}"
        else:
            if news["status"] in ["OK","DELAYED"]:
                #tool_rag.add_api_document(content["query"], news, content["ticker"], "get_news_2")
                return list(news.items())
            return "I don't know."
    #return news
        
finance_tool = types.Tool(
    function_declarations=[
        get_symbol_1,
        get_name_1,
        get_symbol_quote_1,
        get_market_status_1,
        get_company_peers_1,
        get_local_datetime_1,
        get_last_market_close,
        get_exchange_codes_1,
        get_exchange_code_1,
        get_financials_1,
        get_daily_candlestick_2,
        get_custom_candlestick_2,
        get_ticker_overview_2,
        get_recommendation_trends_1,
        get_news_with_sentiment_2,
        get_rag_tool_response,
        get_wiki_tool_response,
        get_search_tool_response
    ]
)

function_handler = {
    "get_symbol_1": impl_get_symbol_1,
    "get_name_1": impl_get_name_1,
    "get_symbol_quote_1": impl_get_quote_1,
    "get_market_status_1": impl_get_market_status_1,
    "get_company_peers_1": impl_get_peers_1,
    "get_local_datetime_1": impl_local_datetime_1,
    "get_last_market_close": rag_last_market_close,
    "get_exchange_codes_1": rag_exchange_codes_1,
    "get_exchange_code_1": rag_exchange_code_1,
    "get_financials_1": impl_get_financials_1,
    "get_daily_candlestick_2": impl_daily_candle_2,
    "get_custom_candlestick_2": impl_custom_candle_2,
    "get_ticker_overview_2": impl_ticker_overview_2,
    "get_recommendation_trends_1": impl_trends_1,
    "get_news_with_sentiment_2": impl_get_news_2,
    "get_rag_tool_response": ask_rag_tool,
    "get_wiki_tool_response": ask_wiki_tool,
    "get_search_tool_response": ask_search_tool
}

In [19]:
# Define the system prompt.

instruction = f"""You are a helpful and informative bot that answers finance and stock market questions. 
Only answer the question asked and do not change topic. While the answer is still
unknown you must follow these rules for predicting function call order:

RULE#1: Always consult your other functions before get_search_tool_response.
RULE#2: Always consult get_wiki_tool_response before get_search_tool_response.
RULE#3: Always consult get_search_tool_response last.
RULE#4: Always convert timestamps with get_local_datetime_1 and use the converted date/time in your response.
RULE#5: Always incorporate as much useful information from tools and functions in your response."""

In [20]:
# Import the finance api secret keys.

POLYGON_API_KEY = UserSecretsClient().get_secret("POLYGON_API_KEY")
FINNHUB_API_KEY = UserSecretsClient().get_secret("FINNHUB_API_KEY")

In [21]:
# Implement the function calling expert.

@retry.Retry(
    predicate=is_retriable,
    initial=2.0,
    maximum=64.0,
    multiplier=2.0,
    timeout=600,
)
def send_message(prompt):
    #display(Markdown("#### Prompt"))
    #print(prompt, "\n")
    # Define the user prompt part.
    contents = [types.Content(role="user", parts=[types.Part(text=prompt)])]
    # Gemini's innate notion of current date and time is unstable.
    est = pytz.timezone('US/Eastern') # The finance api data is in eastern time.
    contents += f"""
    The current date and time is: {datetime.now(est).strftime('%c')}
    
    Give a concise, and detailed summary. Use information that you learn from the API responses.
    Use your tools and function calls according to the rules. Convert any all-upper case identifiers
    to proper case in your response. Convert any abbreviated or shortened identifiers to their full forms.
    Convert timestamps according to the rules before including them.
    """
    # Enable system prompt, function calling and minimum-randomness.
    config_fncall = types.GenerateContentConfig(
        system_instruction=instruction,
        tools=[finance_tool],
        temperature=0.0
    )
    # Handle cases with multiple chained function calls.
    function_calling_in_process = True
    while function_calling_in_process:
        # Send the user prompt and function declarations.
        response = client.models.generate_content(
            model=project_model, config=config_fncall, contents=contents
        )
        # A part can be a function call or natural language response.
        for part in response.candidates[0].content.parts:
            if function_call := part.function_call:
                # Extract the function call.
                fn_name = function_call.name
                #display(Markdown("#### Predicted function name"))
                #print(fn_name, "\n")
                # Extract the function call arguments.
                fn_args = {key: value for key, value in function_call.args.items()}
                #display(Markdown("#### Predicted function arguments"))
                #print(fn_args, "\n")
                # Call the predicted function.
                api_response = function_handler[fn_name](fn_args)[:20000] # Stay within the input token limit
                #display(Markdown("#### API response"))
                #print(api_response[:500], "...", "\n")
                # Create an API response part.
                api_response_part = types.Part.from_function_response(
                    name=fn_name,
                    response={"content": api_response},
                )
                # Append the model's function call part.
                contents.append(types.Content(role="model", parts=[types.Part(function_call=function_call)])) 
                # Append the api response part.
                contents.append(types.Content(role="user", parts=[api_response_part]))
            else:
                # The model gave a natural language response
                function_calling_in_process = False
                break # No more parts in response.
        if not function_calling_in_process:
            break # The function calling chain is complete.
            
    # Show the final natural language summary
    display(Markdown("#### Natural language response"))
    display(Markdown(response.text.replace("$", "\\\\$")))

# Ask a question

In [22]:
send_message("What is the current price of Amazon stock?")

Generate quote embedding: 0it [00:00, ?it/s]


#### Natural language response

The current price of Amazon (AMZN) is \\$190.2 as of May 1, 2025, at 4:00 PM. The price changed by \\$5.78 today, a 3.1342% increase. The high price of the day was \\$191.807112, and the low was \\$187.5. The opening price for the day was \\$190.63, and the previous close price was \\$184.42.


In [23]:
send_message(
    """Tell me Amazon's current share price and provide candlestick data for the past month.
    Sort the data in descending order by date. Format the prices consistently as currency.
    Round prices to two decimal places.
    Present the data with multiple columns for display in markdown.""")

Generate quote embedding: 0it [00:00, ?it/s]
Generate api embedding (batched): 0it [00:00, ?it/s]


#### Natural language response

Here's a summary of Amazon's (AMZN) stock information:

**Current Share Price:** As of May 01, 2025, the current price is \\$190.20. This represents a change of \\$5.78, or 3.13%, from the previous close.

**Candlestick Data for the Past Month (descending order by date):**

| Date               | Open    | High    | Low     | Close   | Volume    |
| ------------------ | ------- | ------- | ------- | ------- | --------- |
| 2025-05-01         | \\$190.63 | \\$191.81 | \\$187.50 | \\$190.20 | 74,228,963|
| 2025-04-30         | \\$182.17 | \\$185.05 | \\$178.85 | \\$184.42 | 55,176,543|
| 2025-04-29         | \\$183.99 | \\$188.02 | \\$183.68 | \\$187.39 | 41,667,255|
| 2025-04-28         | \\$190.11 | \\$190.22 | \\$184.89 | \\$187.70 | 33,224,732|
| 2025-04-25         | \\$187.62 | \\$189.94 | \\$185.49 | \\$188.99 | 36,413,330|
| 2025-04-24         | \\$180.92 | \\$186.74 | \\$180.18 | \\$186.54 | 43,051,696|
| 2025-04-23         | \\$183.45 | \\$187.38 | \\$180.19 | \\$180.60 | 63,470,094|
| 2025-04-22         | \\$169.85 | \\$176.78 | \\$169.35 | \\$173.18 | 56,607,202|
| 2025-04-21         | \\$169.60 | \\$169.60 | \\$165.29 | \\$167.32 | 48,126,111|
| 2025-04-17         | \\$176.00 | \\$176.21 | \\$172.00 | \\$172.61 | 44,726,453|
| 2025-04-16         | \\$176.29 | \\$179.10 | \\$171.41 | \\$174.33 | 51,866,916|
| 2025-04-15         | \\$181.41 | \\$182.35 | \\$177.93 | \\$179.59 | 43,617,902|
| 2025-04-14         | \\$186.84 | \\$187.44 | \\$179.23 | \\$182.12 | 48,002,540|
| 2025-04-11         | \\$179.93 | \\$185.86 | \\$178.00 | \\$184.87 | 50,594,339|
| 2025-04-10         | \\$185.44 | \\$186.87 | \\$175.85 | \\$181.22 | 68,302,045|
| 2025-04-09         | \\$172.12 | \\$192.65 | \\$169.93 | \\$191.10 | 116,804,328|
| 2025-04-08         | \\$185.23 | \\$185.90 | \\$168.57 | \\$170.66 | 87,710,360|
| 2025-04-07         | \\$162.00 | \\$183.41 | \\$161.38 | \\$175.26 | 109,297,115|
| 2025-04-04         | \\$167.15 | \\$178.14 | \\$166.00 | \\$171.00 | 123,136,859|
| 2025-04-03         | \\$183.00 | \\$184.13 | \\$176.92 | \\$178.41 | 95,553,617|
| 2025-04-02         | \\$187.66 | \\$198.34 | \\$187.66 | \\$196.01 | 53,679,198|
| 2025-04-01         | \\$187.86 | \\$193.93 | \\$187.20 | \\$192.17 | 41,246,065|

**Note:** The times associated with the candlestick data is 00:00:00 for each day.


In [25]:
send_message(
    '''Tell me about Amazon's current bullish versus bearish predictions, and recommendation trends.
    Include a discussion of any short-term trends, and sentiment analysis.''')

#### Natural language response

As of May 2, 2025, here's a summary of the bullish versus bearish predictions and recommendation trends for Amazon, incorporating sentiment analysis and short-term trends from recent news:

**Recommendation Trends:**

*   Based on analyst recommendations from May 1, 2025, the sentiment is strongly bullish. There are 22 strong buy, 51 buy, and 6 hold recommendations. There are no sell or strong sell recommendations.
*   The trend in analyst recommendations has been consistently positive over the past few months.

**Sentiment Analysis:**

*   Recent news articles present a mixed sentiment regarding Amazon.
*   **Positive Sentiments:** Several articles highlight Amazon's strong position in AI, cloud services, and e-commerce, with analysts largely bullish on the company's long-term prospects. Some analysts predict that Amazon could surpass Nvidia, Microsoft, and Apple in market valuation within the next 5 years.
*   **Neutral Sentiments:** Some articles present a mixed outlook, citing both opportunities and risks for Amazon. While the company's cloud and advertising businesses are expected to show growth, the impact of US sanctions and tariffs remains a concern for investors.
*   **Negative Sentiments:** Several articles discuss the potential negative impact of tariffs on Amazon's business, as a significant portion of products sold on its platform are imported from China. Some articles also mention concerns about a potential slowdown in AI infrastructure demand and the impact of economic uncertainty on Amazon's cloud computing business.

**Short-Term Trends:**

*   **Tariff Concerns:** The potential impact of tariffs on Amazon's business is a significant short-term concern. Tariffs could raise prices on numerous types of goods, including electronics, textiles, and apparel, which could impact Amazon's e-commerce sales.
*   **AI and Cloud Growth:** Amazon's cloud and advertising segments are expected to show growth, and the company is making significant investments in AI infrastructure.
*   **Economic Uncertainty:** Concerns about a potential recession and its impact on digital advertising and cloud spending are also weighing on investor sentiment.

**Concise Summary:**

Amazon's recommendation trends are strongly bullish, with a majority of analysts recommending to buy or strongly buy the stock. However, sentiment analysis reveals a mixed outlook, with positive sentiments related to Amazon's strong position in AI, cloud services, and e-commerce, but also negative sentiments related to the potential impact of tariffs and economic uncertainty. Short-term trends indicate that tariff concerns and AI/cloud growth are key factors influencing investor sentiment.


In [32]:
send_message("What is Google's stock ticker symbol?")

Score similarity to query: 100%|██████████| 11/11 [00:00<00:00, 19.56it/s]


#### Natural language response

Google's stock ticker symbols on the NASDAQ are GOOGL and GOOG.


In [33]:
send_message(
    '''Tell me about Google's share price over the past month.
    Perform a sentiment analysis of news during the same period. Include trends.''')

Score similarity to query: 100%|██████████| 11/11 [00:00<00:00, 20.02it/s]
Generate api embedding (batched): 0it [00:00, ?it/s]


#### Natural language response

Here's a summary of Google's (GOOGL) share price and news sentiment over the past month (April 1, 2025, to May 1, 2025):

**Share Price Summary:**

Over the past month, Google's (GOOGL) share price has fluctuated. Starting at \\$153.62 on April 1, 2025, it reached a high of \\$166.10 around April 25, 2025, before closing at \\$161.30 on May 1, 2025.

**News Sentiment Summary:**

The news sentiment surrounding Google has been mixed.

*   **Positive Trends:** Several articles highlight Google's strong position in artificial intelligence (AI), with its AI-powered products and services driving growth. Google Cloud and Waymo are also seen as promising growth areas. Additionally, some analysts consider the stock undervalued.
*   **Neutral Trends:** Some articles present a neutral outlook, mentioning potential risks to Google's search engine business and the impact of tariffs.
*   **Negative Trends:** Some articles point out concerns about antitrust lawsuits and potential risks to Google's search engine business.

**Key News Themes:**

*   **AI Focus:** Google's investments and advancements in AI are a recurring theme, with many articles discussing the company's AI strategy and its potential to drive growth.
*   **Antitrust Concerns:** Antitrust lawsuits and regulatory challenges remain a concern for Google, with some articles highlighting the potential impact on the company's business.
*   **Cloud Computing Growth:** Google Cloud is consistently mentioned as a growth driver for the company, with articles highlighting its strong performance and potential for future expansion.
*   **Market Volatility:** Several articles mention the broader market sell-off and its impact on tech stocks, including Google.

In summary, while Google's share price has experienced some volatility over the past month, the company's strong position in AI and cloud computing, along with its solid financial performance, suggests a positive long-term outlook. However, investors should be aware of the potential risks associated with antitrust lawsuits and market uncertainty.


In [None]:
send_message("What is MGM Studio's stock symbol?")

In [None]:
send_message("What is Amazon MGM Studio's stock symbol?")

In [None]:
send_message("What is Facebook's stock ticker symbol?")

In [None]:
send_message(
    '''How is the outlook for Apple based on trends and news sentiment over the past month?
    Perform the same analysis on Apple's peers. Then compare Apple result to it's peers.''')

In [None]:
send_message('''What does the recent news say about Apple and the impact of tariffs? Over the past 2 month.''')

# Conclusion

<span style="font-size:18px;">
For now that will have to do. Our Essy has a solid foundation but more could be done to organise metadata. No evaluation or validation has been performed (except fuzzing the prompt). Next steps include restructuring the vector database based on lessons learned. That'll be followed by plotting, multi-modal, and structured output. The last close date (generative) function can be temperamental. In the same way Gemini always feels regarding dates. I've learnt so much. I'm happy I decided to participate in the event! It really has been a joy to see Essy grow from random chat with Gemini into the foundation for a good-broker buddy. I hope you enjoy playing with this edition as much as I enjoyed building it!
</span>