Socially Aware Motion Planning #9

xiaoxianSun · 2019-11-02T07:55:05Z

Hi!
This is an amazing work!I want to use the social norm to optimize the reward function.
I wonder if there is any possible that you could tell me the value of scalar penalty when you train the network! I didn't find the Specific value in this paper.
Thank you!

mfe7 · 2019-11-09T07:03:54Z

Hi @xiaoxianSun , the IROS '18 paper didn't use the social reward, but in the IROS '17 paper the constant qn in Eqns 9-12 can be tuned according to the tradeoff described in the paper. Looking at some old code I believe we settled on 0.5*(some term based on how close the two agents are):

weight = 0.5; GAMMA = 0.97; DT_NORMAL = 0.5
d = np.linalg.norm(agent_i_pos - agent_j_pos)
v = agent_i_pref_speed
getting_close_penalty = GAMMA ** (d/DT_NORMAL) * (1.0 - GAMMA ** (-v/DT_NORMAL))
penalty = weight * getting_close_penalty

xiaoxianSun · 2019-11-10T04:41:05Z

Thanks for your kind reply.I am going to try it!

mfe7 closed this as completed Nov 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Socially Aware Motion Planning #9

Socially Aware Motion Planning #9

xiaoxianSun commented Nov 2, 2019

mfe7 commented Nov 9, 2019

xiaoxianSun commented Nov 10, 2019

Socially Aware Motion Planning #9

Socially Aware Motion Planning #9

Comments

xiaoxianSun commented Nov 2, 2019

mfe7 commented Nov 9, 2019

xiaoxianSun commented Nov 10, 2019