RFC: Add `return_direct` to Tools, Actions, and FunctionCalling Agents by eob · Pull Request #532 · steamship-core/python-client

eob · 2023-08-27T19:06:51Z

This PR adds a return_direct flag to Tool the style of LangChain that permits a tool to signal that its response should be delivered as the "final answer" to the user.

This prevents a common error condition in which a reasoning agent runs a "terminal" tool, but then instead of returning its output, it either:

Summarizes for the user what it did
Runs other tools unnecessarily

Here's an example of that in action with the Prompt Golf bot:

What happened here is:

The Agent correctly ran the GameTurnTool which generated a new image
The Agent then replied with a summary of what it did rather than replying with the output of the tool

eob · 2023-08-27T19:07:52Z

@douglas-reid does this seem like a reasonable way to achieve this? It would result in:

FinishAction being the way the Agent signals completion.
Tool.return_direct being a way a Tool can signal that it always should end reasoning

douglas-reid · 2023-08-28T19:09:27Z

@eob my first reaction is that this is fine, as it seems to address an immediate need. it feels like "llm smell" in the sense that a summarizing action seems like something that should be ideally smoothed out of workflows at a different level. and i worry that there are likely tools that don't always want to return FinishActions. but both of those concerns seem minimal in the face of our existing reality. i'll take a closer look at the code impl now.

douglas-reid

Some initial reactions.

douglas-reid · 2023-08-28T19:14:05Z

    output: Optional[List[Block]]
    """Any direct output produced by the Tool."""

+    return_direct: bool = False


curious: should we label this more like is_final or tool_is_direct similar to track that state (and update the comment -- as i'm not expecting users to set return_direct on Actions) ?

aside: Would this be "simpler" if the Action returned by an Agent/AgentExecutor when Tool.return_direct == True was a FinishAction instead? Or even a new type of Action (ex: DirectReturnAction) ? Maybe that would be more complex (as the AgentExecutor would have to be involved) ??

douglas-reid · 2023-08-28T19:14:19Z

+    return_direct: bool = False
+    """Whether to return the tool's output directly.
+
+    Setting this to True means that after the tool is called, the AgentExecutor will stop looping.


Suggested change

Setting this to True means that after the tool is called, the AgentExecutor will stop looping.

Setting this to True means that after the Tool is run, the Tool's output will be returned without further Agent interactions. This will preempt the execution of other tools, as well as any subsequent calls to LLMs, etc., to interpret the Tool output.

douglas-reid · 2023-08-28T19:28:39Z

            self.run_action(agent=agent, action=action, context=context)
+
+            # If this action wishes to end the loop, comply with that wish
+            if action.return_direct:


aside (take it or leave it): my api spidey-sense is tingling seeing this code block. i think seeing this, i'd prefer something like action.is_final (or even if isinstance(action, DirectReturnAction)) for clarity (as Actions aren't really "returning" anything per se).

douglas-reid · 2023-08-28T19:32:37Z

+
+            # If this action wishes to end the loop, comply with that wish
+            if action.return_direct:
+                logging.info(


curious: do you think we need this log statement for full coverage? WDYT about just adding "is_final" or similar to the existing "next tool" log? that might lead to more readable code (no floating break) and fewer lines of logging code?

douglas-reid · 2023-08-28T19:33:10Z

+            # If this action wishes to end the loop, comply with that wish
+            if action.return_direct:
+                logging.info(
+                    f"Exciting reasoning loop by request of: {action.tool}",


Suggested change

f"Exciting reasoning loop by request of: {action.tool}",

f"Exiting Agent execution by request of: {action.tool}",

eob · 2023-08-28T19:53:26Z

@douglas-reid thanks a ton -- will do some tinkering down these threads.

A few thoughts I had that are in the same vein as yours:

I'm not sure if it should feel weird that a the is_final property of Tool is static as opposed to part of its response.
I like the idea of Action.is_final... that feels cleaner and also aligns it with FinalAction

Add return direct capability

b86350a

eob requested a review from douglas-reid August 27, 2023 19:06

douglas-reid reviewed Aug 28, 2023

View reviewed changes

eob and others added 4 commits August 28, 2023 16:08

Sketch at is_final

27bf8ac

Direct tools

2baa975

Add a test

7855323

Merge branch 'main' into add-return-direct-to-tools

5c1eab1

eob enabled auto-merge September 1, 2023 17:53

Yes

3421603

eob added this pull request to the merge queue Sep 1, 2023

eob removed this pull request from the merge queue due to a manual request Sep 1, 2023

eob merged commit 655ae07 into main Sep 1, 2023

eob deleted the add-return-direct-to-tools branch September 1, 2023 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Add `return_direct` to Tools, Actions, and FunctionCalling Agents#532

RFC: Add `return_direct` to Tools, Actions, and FunctionCalling Agents#532
eob merged 6 commits intomainfrom
add-return-direct-to-tools

eob commented Aug 27, 2023 •

edited

Loading

Uh oh!

eob commented Aug 27, 2023

Uh oh!

douglas-reid commented Aug 28, 2023

Uh oh!

douglas-reid left a comment

Uh oh!

douglas-reid Aug 28, 2023

Uh oh!

douglas-reid Aug 28, 2023

Uh oh!

douglas-reid Aug 28, 2023

Uh oh!

douglas-reid Aug 28, 2023

Uh oh!

douglas-reid Aug 28, 2023

Uh oh!

eob commented Aug 28, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	Setting this to True means that after the tool is called, the AgentExecutor will stop looping.
	Setting this to True means that after the Tool is run, the Tool's output will be returned without further Agent interactions. This will preempt the execution of other tools, as well as any subsequent calls to LLMs, etc., to interpret the Tool output.

	f"Exciting reasoning loop by request of: {action.tool}",
	f"Exiting Agent execution by request of: {action.tool}",

Conversation

eob commented Aug 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eob commented Aug 27, 2023

Uh oh!

douglas-reid commented Aug 28, 2023

Uh oh!

douglas-reid left a comment

Choose a reason for hiding this comment

Uh oh!

douglas-reid Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

douglas-reid Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

douglas-reid Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

douglas-reid Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

douglas-reid Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

eob commented Aug 28, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eob commented Aug 27, 2023 •

edited

Loading