[rllib] Accessing the reward of other agents in custom multi-agent algorithm #9949
Labels
enhancement
Request for new feature and/or capability
question
Just a question :)
stale
The issue is stale. It will be closed within 7 days unless there are further conversation
What is your question?
My goal is to implement a custom multi-agent algorithm. However, to update the policy for any individual agent requires access to the rewards of other agents. Is there any way to access the rewards issued to other agents? I imagine this may be similar to the problem of accessing the observation of other agents, which seems to be an open problem (see #7341, #4748).
The text was updated successfully, but these errors were encountered: