Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to calculate the reward of every epoch?Will it make sense? #2

Closed
ghost opened this issue Mar 26, 2018 · 1 comment
Closed

How to calculate the reward of every epoch?Will it make sense? #2

ghost opened this issue Mar 26, 2018 · 1 comment
Assignees
Labels

Comments

@ghost
Copy link

ghost commented Mar 26, 2018

I have read the code, and I got a question that described as the title. There was an variable called coTheta which is used to calculated the reward. Anyone who know why?
Any help is appreciated.

@qingyun-wu
Copy link
Collaborator

Hi,
If users are independent in the environment, theta should be used to calculate the reward. But in a collaborative environment, CoTheta, which considers the user connection matrix W, should be used to calculate the environment.

So to make it more general, we decided to use CoTheta to compute the reward and at the same time controls W to support both independent and collaborative environment. In other words, if you need an environment, in which users are independent, we only need to set W to be identical matrix. And in this case, it is equivalent to use theta to compute reward.

Feel free to add more comments if there is still confusion.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants