Dedicated Observer/Supervisor Class #4242

Boostrix · 2023-05-16T08:47:13Z

Duplicates

I have searched the existing issues

Summary 💡

Recently, using separate agents to observe some constraints/performance evaluation is one of the most recurring ideas around here (and on Discord) - so, it might make sense to come up with a dedicated helper class to wrap all related functionality. We could use two types of classes: passive observation and active/enforcing (supervisor).

For starters, look at agent/agent.py and specifically the self_feedback() function there.
This is what we would want to extract (copy/paste) and generalize (GPT?), and maybe make it a part of llm/llm_utils.py

The details of which would be coordinated with the llm folks

The goal would be instantiate arbitrary agents to obtain feedback for some given constraints/requirements and goals

Examples 🌈

No response

Motivation 🔦

No response

dewasahu2003 · 2023-05-17T11:27:45Z

hi @Boostrix 👋
can i try this issue

dewasahu2003 · 2023-05-17T15:50:29Z

so is this similar to how an agent in reinforcement learning evaluate its performance?

Boostrix · 2023-05-17T16:02:05Z

Basically, see self feedback - it's a way to get a different feedback. Imagine it like asking how good a certain response is based on asking another llm session and providing the constraints / evaluation criteria

dewasahu2003 · 2023-05-18T13:48:10Z

does this pseudo code make sense if so i can start making changes

Boostrix · 2023-05-18T13:53:26Z

You seem to be on the right track, play around with the idea in conjunction with looking at threads mentioning observer / supervisor / feedback etc
And before doing any significant amount of work related to this, please reach out to the team via discord to discuss your ideas and the sccope of the work to ensure that it aligns well with other ongoing efforts.

bbonifacio-at-mudd · 2023-05-19T04:48:56Z

Hi @Boostrix @dewasahu2003, can I join in on this issue too? Are you working off stable or master? (I'll try working on it on stable for now)

Boostrix · 2023-05-19T05:34:30Z

I've added you to the issue, I'd suggest to keep the conversation here or both of you should consider joininig discord.
Either way, don't do any significant coding without first checking back with other contributors.
Implementing a simple proof-of-concept should not take much longer than ~1 hr. And at that point, it makes sense to review what others have said about the idea of using separate agents to evaluate an agent's performance.

bbonifacio-at-mudd · 2023-05-19T05:35:29Z

Sounds good! Other than @dewasahu2003, do you know of anyone else working on this that we can contact on discord?

Boostrix · 2023-05-19T05:46:02Z

not yet, but you can simply announce the project and gather all feedback, feel free to update this issue.
Be aware that there's an upcoming hackathon, so there might be fewer folks available today and during the weekend, since people are trying to participate in 2 hackathons actually

Either way, it would make sense to research what others have said about the idea, before writing any code.
So if there are any related issues, feel free to cross-reference those here.

bbonifacio-at-mudd · 2023-05-19T05:58:49Z

Okay! What channel on the discord would be good for discussing this?

Boostrix · 2023-05-19T06:16:34Z

any of dev-contributors / dev-general / dev-autogpt would seem appropriate, I think ?

See also: #4220

FYI: @anonhostpi has collated a ton of related info, and even came up with a dedicated gist for that: https://github.com/anonhostpi

for the gist, see: https://gist.github.com/anonhostpi/97d4bb3e9535c92b8173fae704b76264#file-_topics-0011-cmds-0002-web-md

"self moderation": https://gist.github.com/anonhostpi/97d4bb3e9535c92b8173fae704b76264#observerregulatory-agents-and-restrictions-proposals

It would make sense to base any future work on evaluating these talks first. BEFORE writing any code.

dewasahu2003 · 2023-05-22T18:07:48Z

Hi @Boostrix 👋
I have a idea regarding implementing evaluation

Agent Class --> we can add evaluate method

so, we can evaluate whichever agent we want..
it would become very easy to evaluate agent as we would have to call method on instance of agent class

evaluate method

we will create a newAgent in evaluate method which will have different constraints
and thus evaluate the agent

some doubts that i have

on what kind of parameters are we going to evaluate
- will we check the agent that how repetitive and wrong command it give and then suggest some new commands

hopefully this makes some sense
open to listen any kind of feedback 🙏

github-actions · 2023-09-06T20:51:04Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions · 2023-11-04T01:45:44Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

estefysc · 2023-11-07T18:37:44Z

Hi, @Boostrix. I was looking into agent/agent.py, searching for the self_feedback() function you referred to, but I cannot find it. I wanted take a look because I would like to see if I can work on this.

github-actions · 2024-02-21T01:45:20Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions · 2024-03-04T01:48:06Z

This issue was closed automatically because it has been stale for 10 days with no activity.

Boostrix added the good first issue Good for newcomers label May 16, 2023

Boostrix assigned dewasahu2003 May 17, 2023

bbonifacio-at-mudd mentioned this issue May 19, 2023

Automated Self Feedback #4220

Open

1 task

Boostrix assigned bbonifacio-at-mudd May 19, 2023

Boostrix mentioned this issue Jun 10, 2023

Adds risk avoidance mode and relevant config. #934

Closed

5 tasks

github-actions bot added the Stale label Sep 6, 2023

Pwuts mentioned this issue Sep 10, 2023

Auto-GPT Performance 📈 #5190

Open

Pwuts removed the Stale label Sep 14, 2023

github-actions bot added the Stale label Nov 4, 2023

github-actions bot removed the Stale label Nov 8, 2023

github-actions bot added the Stale label Feb 21, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dedicated Observer/Supervisor Class #4242

Dedicated Observer/Supervisor Class #4242

Boostrix commented May 16, 2023 •

edited

dewasahu2003 commented May 17, 2023

dewasahu2003 commented May 17, 2023

Boostrix commented May 17, 2023

dewasahu2003 commented May 18, 2023

Boostrix commented May 18, 2023 •

edited

bbonifacio-at-mudd commented May 19, 2023 •

edited

Boostrix commented May 19, 2023

bbonifacio-at-mudd commented May 19, 2023

Boostrix commented May 19, 2023 •

edited

bbonifacio-at-mudd commented May 19, 2023

Boostrix commented May 19, 2023 •

edited

dewasahu2003 commented May 22, 2023

github-actions bot commented Sep 6, 2023

github-actions bot commented Nov 4, 2023

estefysc commented Nov 7, 2023

github-actions bot commented Feb 21, 2024

github-actions bot commented Mar 4, 2024

Dedicated Observer/Supervisor Class #4242

Dedicated Observer/Supervisor Class #4242

Comments

Boostrix commented May 16, 2023 • edited

Duplicates

Summary 💡

Examples 🌈

Motivation 🔦

dewasahu2003 commented May 17, 2023

dewasahu2003 commented May 17, 2023

Boostrix commented May 17, 2023

dewasahu2003 commented May 18, 2023

Boostrix commented May 18, 2023 • edited

bbonifacio-at-mudd commented May 19, 2023 • edited

Boostrix commented May 19, 2023

bbonifacio-at-mudd commented May 19, 2023

Boostrix commented May 19, 2023 • edited

bbonifacio-at-mudd commented May 19, 2023

Boostrix commented May 19, 2023 • edited

dewasahu2003 commented May 22, 2023

github-actions bot commented Sep 6, 2023

github-actions bot commented Nov 4, 2023

estefysc commented Nov 7, 2023

github-actions bot commented Feb 21, 2024

github-actions bot commented Mar 4, 2024

Boostrix commented May 16, 2023 •

edited

Boostrix commented May 18, 2023 •

edited

bbonifacio-at-mudd commented May 19, 2023 •

edited

Boostrix commented May 19, 2023 •

edited

Boostrix commented May 19, 2023 •

edited