How to calculate the reward of every epoch?Will it make sense? #2

ghost · 2018-03-26T02:02:29Z

I have read the code, and I got a question that described as the title. There was an variable called coTheta which is used to calculated the reward. Anyone who know why?
Any help is appreciated.

qingyun-wu · 2018-03-28T15:58:58Z

Hi,
If users are independent in the environment, theta should be used to calculate the reward. But in a collaborative environment, CoTheta, which considers the user connection matrix W, should be used to calculate the environment.

So to make it more general, we decided to use CoTheta to compute the reward and at the same time controls W to support both independent and collaborative environment. In other words, if you need an environment, in which users are independent, we only need to set W to be identical matrix. And in this case, it is equivalent to use theta to compute reward.

Feel free to add more comments if there is still confusion.

Thanks!

huazhengwang assigned qingyun-wu Mar 28, 2018

huazhengwang added the question label Apr 1, 2018

huazhengwang closed this as completed Apr 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to calculate the reward of every epoch?Will it make sense? #2

How to calculate the reward of every epoch?Will it make sense? #2

ghost commented Mar 26, 2018

qingyun-wu commented Mar 28, 2018

How to calculate the reward of every epoch?Will it make sense? #2

How to calculate the reward of every epoch?Will it make sense? #2

Comments

ghost commented Mar 26, 2018

qingyun-wu commented Mar 28, 2018