-
Notifications
You must be signed in to change notification settings - Fork 43.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dedicated Observer/Supervisor Class #4242
Comments
hi @Boostrix 馃憢 |
so is this similar to how an agent in reinforcement learning evaluate its performance? |
Basically, see self feedback - it's a way to get a different feedback. Imagine it like asking how good a certain response is based on asking another llm session and providing the constraints / evaluation criteria |
You seem to be on the right track, play around with the idea in conjunction with looking at threads mentioning observer / supervisor / feedback etc |
Hi @Boostrix @dewasahu2003, can I join in on this issue too? Are you working off stable or master? (I'll try working on it on stable for now) |
I've added you to the issue, I'd suggest to keep the conversation here or both of you should consider joininig discord. |
Sounds good! Other than @dewasahu2003, do you know of anyone else working on this that we can contact on discord? |
not yet, but you can simply announce the project and gather all feedback, feel free to update this issue. Either way, it would make sense to research what others have said about the idea, before writing any code. |
Okay! What channel on the discord would be good for discussing this? |
any of dev-contributors / dev-general / dev-autogpt would seem appropriate, I think ? See also: #4220 FYI: @anonhostpi has collated a ton of related info, and even came up with a dedicated gist for that: https://github.com/anonhostpi for the gist, see: https://gist.github.com/anonhostpi/97d4bb3e9535c92b8173fae704b76264#file-_topics-0011-cmds-0002-web-md "self moderation": https://gist.github.com/anonhostpi/97d4bb3e9535c92b8173fae704b76264#observerregulatory-agents-and-restrictions-proposals It would make sense to base any future work on evaluating these talks first. BEFORE writing any code. |
Hi @Boostrix 馃憢
hopefully this makes some sense |
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days. |
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days. |
Hi, @Boostrix. I was looking into agent/agent.py, searching for the self_feedback() function you referred to, but I cannot find it. I wanted take a look because I would like to see if I can work on this. |
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days. |
This issue was closed automatically because it has been stale for 10 days with no activity. |
Duplicates
Summary 馃挕
Recently, using separate agents to observe some constraints/performance evaluation is one of the most recurring ideas around here (and on Discord) - so, it might make sense to come up with a dedicated helper class to wrap all related functionality. We could use two types of classes: passive observation and active/enforcing (supervisor).
For starters, look at agent/agent.py and specifically the self_feedback() function there.
This is what we would want to extract (copy/paste) and generalize (GPT?), and maybe make it a part of llm/llm_utils.py
The details of which would be coordinated with the llm folks
The goal would be instantiate arbitrary agents to obtain feedback for some given constraints/requirements and goals
Examples 馃寛
No response
Motivation 馃敠
No response
The text was updated successfully, but these errors were encountered: