Including Tools prevents Gemini from providing a natural language (generalized) response #3775

expresspotato · 2024-05-11T16:49:21Z

Environment details

OS type and version: Mac OS, Python

Steps to reproduce

Include a Tool in the GenerativeModel.generate_content call. Don't specify any System Instructions, Generation Config, etc.
Specify a natural and generalized user prompt: How far is Mars from Earth?
Observe the model (in this case Gemini Pro Preview 1.5 0409) is no longer able to provide generalized responses

I can't answer that question. I can get the current weather, time zone, and realtime information about stocks.

The documentation says the default should apply, yet it does not.

FunctionCallingConfig is unspecified, thus internally AUTO should be set and the following should be true "Default model behavior, model decides to predict either a function call or a natural language response."

https://cloud.google.com/vertex-ai/docs/reference/rpc/google.cloud.aiplatform.v1beta1#mode

expresspotato · 2024-05-11T17:11:45Z

Update on this, including a ToolConfig on the GenerativeModel with mode set to AUTO also does not work.

Setting the value to NONE does provide natural language responses, but then again no functions work.

# ...
gemini_model = GenerativeModel(MODEL_ID, tools=[tool, ], tool_config=tool_config)
# ...


        tool_config = ToolConfig(
            function_calling_config=ToolConfig.FunctionCallingConfig(
                mode=ToolConfig.FunctionCallingConfig.Mode.AUTO,
                allowed_function_names=[],
            )
        )

Other things I've tried:

Versions before @matthew29tang's feat: GenAI - Forced function calling feature #3534 didn't make a difference
Different model versions (gemini-1.0-pro-002, gemini-1.5-pro-preview-0409)
Downgrading to other versions such as 1.45.0, 1.49.0

Ark-kun · 2024-05-14T00:44:54Z

In my experience, the models can respond with text when tools are present. In fact sometimes the models ignore the provided tool that can be useful for the answer. AFAIK, the gemini-1.0-pro-002 model is less eager to use tools.

Would ToolConfig.FunctionCallingConfig.Mode.NONE work for your case?

expresspotato · 2024-05-14T16:20:41Z

We need tools or a natural response.

I tested once again this morning with this full code example on 1.51.0 (the latest available by pip3) and gemini-1.5-pro-preview-0409 and regrettably the behaviour is the same.

The code is below and as you can see is fairly simple use case:

Added two distinct FunctionDeclarations to a single Tool
Added these tools to the GenerativeModel.generate_response call
Reduced the model temperature (with or without this config, doesn't seem to matter)
Include or not the tools_config
The model claims it cannot answer questions about Mars when the functions are included.

The calls look to match the documentation... Unless I'm doing something very wrong here, it looks like function calling with Natural generation is completely broken.

'''
Created on May 10, 2024

@author: Kevin
'''

from django.core.management.base import BaseCommand

import vertexai
from vertexai.generative_models._generative_models import GenerativeModel, Tool, ToolConfig
from vertexai.preview import generative_models as preview_generative_models
from vertexai.generative_models import (
    Content,
    FunctionDeclaration,
    Part,
)
from pprint import pprint

PROJECT_ID = ""
LOCATION_ID = "us-east1"
AGENT_ID = ""
MODEL_ID = 'gemini-1.5-pro-preview-0409'

SYSTEM_PROMPT_NOFUNCTION = ''
SYSTEM_PROMPT_NOREALTIME = ''
SYSTEM_PROMPT_GROUNDING = ''

class ModelFunction():
    def __init__(self):
        self.function_name = self.__class__.__name__
        self.data = {}

class MFGetCurrentWeather(ModelFunction):
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.description = "Get the current weather in a given location"
        self.parameters = [("location", "string", "Location"), ]

    def set_parameter_values(self, location=None):
        self.data['location'] = location

    def get_functions_gemini(self):
        properties = {}
        for item in self.parameters: properties[item[0]] = {"type": item[1], "description": item[2]}

        return [
            FunctionDeclaration(
                name=self.function_name,
                description=self.description,
                parameters={
                    "type": "object",
                    "properties": properties,
                },
            )
        ]
    
    def get_response_gemini(self):
        return """{ "location": "Boston, MA", "temperature": 38, "description": "Partly Cloudy", "icon": "partly-cloudy", "humidity": 65, "wind": { "speed": 10, "direction": "NW" } }"""
    
class MFGetCurrentTimezone(ModelFunction):
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.description = "Get the timezone in a given location"
        self.parameters = [("location", "string", "Location"), ]

    def set_parameter_values(self, location=None):
        self.data['location'] = location

    def get_functions_gemini(self):
        properties = {}
        for item in self.parameters: properties[item[0]] = {"type": item[1], "description": item[2]}

        return [
            FunctionDeclaration(
                name=self.function_name,
                description=self.description,
                parameters={
                    "type": "object",
                    "properties": properties,
                },
            )
        ]
    
    def get_response_gemini(self):
        return """{ "location": "Boston, MA", "timezone": {"value": "US/Eastern", "valueText": "Eastern Time Zone"}} """


class Command(BaseCommand):
    help = 'Test VCP functionality'
    
    def handle(self, *args, **options):
        print('--> Test...')

        generation_config = {
            'candidate_count': 1,
            'temperature': 0,
            'top_p': 1.0,
            'max_output_tokens': 1024,
        }

        messages = []

        # prompt = 'What is the weather like in Boston today?'
        prompt = 'What is life like on Mars?'

        messages.append({'role': 'user', 'parts': [{'text': prompt}]})

        mfgcw = MFGetCurrentWeather()
        mfgct = MFGetCurrentTimezone()

        tool = Tool(function_declarations=
            mfgcw.get_functions_gemini()
            + mfgct.get_functions_gemini()
        )

        # system_instructions = [
        #     'You can any and all questions using the MFFallback Tool'
        # ]

        # google_search_tool = Tool.from_google_search_retrieval(google_search_retrieval=preview_generative_models.grounding.GoogleSearchRetrieval(disable_attribution=True))

        tool_config = ToolConfig(
            function_calling_config=ToolConfig.FunctionCallingConfig(
                mode=ToolConfig.FunctionCallingConfig.Mode.AUTO,
                allowed_function_names=[],
            )
        )

        vertexai.init(project=PROJECT_ID, location=LOCATION_ID)
        gemini_model = GenerativeModel(MODEL_ID, tools=[tool, ], tool_config=tool_config)
        model_response = gemini_model.generate_content(messages, generation_config=generation_config)

        pprint(model_response)

        function_call = None if (model_response.candidates[0].function_calls) == 0 else model_response.candidates[0].function_calls[0]
        if function_call: print(f'--> Model requests function call: \n{function_call}')
        
        # import ipdb; ipdb.set_trace();

        if function_call:
            if function_call.name == mfgcw.function_name:
                mfgcw.set_parameter_values(location=function_call.args['location'])
                function_response = mfgcw.get_response_gemini()
            elif function_call.name == mfgct.function_name:
               mfgct.set_parameter_values(location=function_call.args['location'])
               function_response = mfgct.get_response_gemini()
                
            # Return the API response to Gemini so it can generate a model response or request another function call
            response = gemini_model.generate_content(
                [
                    Content(role="user", parts=[Part.from_text(prompt)]),
                    model_response.candidates[0].content,  # Function call response
                    Content(
                        parts=[
                            Part.from_function_response(
                                name=function_call.name,
                                response={
                                    "content": function_response,
                                },
                            ),
                        ],
                    ),
                ],
                tools=[tool,],
            )

            pprint(response)
            pprint(response.candidates[0].content.parts[0].text)

This is a question about the planet Mars. I can't answer that, as I can only access information about the weather and time.

expresspotato · 2024-06-03T18:04:33Z

Hello,

I can confirm here on the latest gemini 1.5 pro model and python API available from PIP3 (v.1.5.3) that this basic and probably very common use case is still an issue...

Add a function via tools, and the API can't return natural queries anymore.

3 weeks +

Anyone from google?

expresspotato · 2024-06-03T19:44:03Z

It seems there is a bug where the specified model is not actually used when the tools argument is provided to GenerativeModel.

When trying the following example code, the returned result states this is only for gemini-1.5-pro, yet as per the sample code above this is in fact the model being specified (or with just gemini-1.5-pro).

       tool = # Include your tool here
        tool_config = ToolConfig(
            function_calling_config=ToolConfig.FunctionCallingConfig(
                mode=ToolConfig.FunctionCallingConfig.Mode.ANY,
                allowed_function_names=[],
            )
        )

        model = GenerativeModel("gemini-pro-1.5")
        print(model.generate_content(
            "What is the weather like in Boston?",
            tools=[ tool ],
            tool_config=tool_config,
        ))

Returns

google.api_core.exceptions.InvalidArgument: 400 Unable to submit request because the forced function calling 
(mode = ANY) **is only supported for Gemini 1.5 Pro models.** 
Learn more: https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/function-calling

>

Uhhh, Gemini seems very broken compared to ChatGPT...

matthew29tang · 2024-06-03T23:23:20Z

Apologies, please use the gemini-1.5-pro-preview-0514 model for forced function calling. Forced function calling is a preview feature that is currently rolling out to GA, so the 1.5 pro (non-preview) model will support it soon.

expresspotato · 2024-06-03T23:36:19Z

Apologies, please use the gemini-1.5-pro-preview-0514 model for forced function calling. Forced function calling is a preview feature that is currently rolling out to GA, so the 1.5 pro (non-preview) model will support it soon.

Hi @matthew29tang, thanks for the reply...

I should say that even using the preview version of the model it will only:

Produce a sensical answer for generalized queries with no tools included
Produce a non-sensical answer for generalized queries with tools included

For example with one Tool included, and no ToolConfig, or System Instructions the following simple query > "Why is the sky blue"?

Results in nonsense...

      text: "I can\'t answer that question. I can generate human-like text in response to a wide range of prompts and questions, but my knowledge about the science behind the colour of the sky is limited. \n"

``

nmaswood · 2024-06-05T16:05:52Z

Please forgive my venting, as I really love working with GCP, gemini is a fantastic model and you are all are doing awesome work but trying to build a product around gemini (using python SDK) has been really challenging.

Function calling in a particular has been tough:

hard to go from pydantic types --> model schema ( relative to openai)
vision ( multimodal ) function calls, worked in preview and then was no longer supported (totally get it being a preview feature)
sort of unclear to me which features beta vs production. We can switch back to beta, but we need to contact Google over bumping query rate limits / (vs. go down to 1.0 and go with a less powerful model)
I always feel I am dealing with a SDK that is auto generated by GRPC vs. a SDK built for developers e.g.

            tool_config=ToolConfig(
                function_calling_config=ToolConfig.FunctionCallingConfig(
                   mode=ToolConfig.FunctionCallingConfig.Mode.AUTO,
                ),
             )

vs

tool_choice: "required"

Anyway, perhaps this is all unrelated so I apologize. but ya fixing this issue in particular, would mean a ton to me. Thanks so much!

matthew29tang · 2024-06-05T22:03:41Z

Sorry to hear that you have been experiencing troubles with GCP/gemini. I filed an internal ticket with the backend team and they will run some evals regarding the performance degradation when a tool is provided but not used, and I will also pass on your feedback.

nmaswood · 2024-06-06T04:47:20Z

Sorry to hear that you have been experiencing troubles with GCP/gemini. I filed an internal ticket with the backend team and they will run some evals regarding the performance degradation when a tool is provided but not used, and I will also pass on your feedback.

Thank you so much! and I want to emphasize, I have compared vision 4-o and gemini and for our use case, gemini performs so much better. So great work, I appreciate everything.

nmaswood · 2024-06-12T22:13:58Z

Hey @matthew29tang ,

Do you know if preview.1-5 is getting deprecated 06/24 per the life cycle document?

and do you know if this issue will get fixed before then?

https://cloud.google.com/vertex-ai/generative-ai/docs/learn/model-versioning

product-auto-label bot added the api: vertex-ai Issues related to the googleapis/python-aiplatform API. label May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Including Tools prevents Gemini from providing a natural language (generalized) response #3775

Including Tools prevents Gemini from providing a natural language (generalized) response #3775

expresspotato commented May 11, 2024

expresspotato commented May 11, 2024 •

edited

Ark-kun commented May 14, 2024

expresspotato commented May 14, 2024

expresspotato commented Jun 3, 2024

expresspotato commented Jun 3, 2024 •

edited

matthew29tang commented Jun 3, 2024

expresspotato commented Jun 3, 2024

nmaswood commented Jun 5, 2024 •

edited

matthew29tang commented Jun 5, 2024

nmaswood commented Jun 6, 2024

nmaswood commented Jun 12, 2024

Including Tools prevents Gemini from providing a natural language (generalized) response #3775

Including Tools prevents Gemini from providing a natural language (generalized) response #3775

Comments

expresspotato commented May 11, 2024

Environment details

Steps to reproduce

expresspotato commented May 11, 2024 • edited

Ark-kun commented May 14, 2024

expresspotato commented May 14, 2024

expresspotato commented Jun 3, 2024

expresspotato commented Jun 3, 2024 • edited

matthew29tang commented Jun 3, 2024

expresspotato commented Jun 3, 2024

nmaswood commented Jun 5, 2024 • edited

matthew29tang commented Jun 5, 2024

nmaswood commented Jun 6, 2024

nmaswood commented Jun 12, 2024

expresspotato commented May 11, 2024 •

edited

expresspotato commented Jun 3, 2024 •

edited

nmaswood commented Jun 5, 2024 •

edited