You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi!
This is an amazing work!I want to use the social norm to optimize the reward function.
I wonder if there is any possible that you could tell me the value of scalar penalty when you train the network! I didn't find the Specific value in this paper.
Thank you!
The text was updated successfully, but these errors were encountered:
Hi @xiaoxianSun , the IROS '18 paper didn't use the social reward, but in the IROS '17 paper the constant qn in Eqns 9-12 can be tuned according to the tradeoff described in the paper. Looking at some old code I believe we settled on 0.5*(some term based on how close the two agents are):
Hi!
This is an amazing work!I want to use the social norm to optimize the reward function.
I wonder if there is any possible that you could tell me the value of scalar penalty when you train the network! I didn't find the Specific value in this paper.
Thank you!
The text was updated successfully, but these errors were encountered: