-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is Per-FedAvg implemented properly? #8
Comments
When the code was implemented, the previous version of this paper only propose the first way to approximate Hessian, so we didn't implement the second approach. |
The fundamental idea of both pFedMe and Per-FedAvg is to generate a well-generalized global model which allowed personalizing this global model to make the personalized model using 1-k steps (small steps). Also in Per-Fedavg, it is trained one batch of test data before evaluation. So, the personalized model is always better than the global only. |
Please see above for details. |
Hi @sshpark and @chuanting; Thank all. Yes, I think my implementation is not correct in terms of the second update in PerfedAvg and @chuanting found out this issue. Regards, |
Hi @CharlieDinh; Regards, |
Hi @sshpark, Thanks for your commit. |
Hi @CharlieDinh, I am sorry for not providing a full performance test for this commit. I have applied it to my current project. In my project, the updated version of Per-FedAvg does not differ much from its previous version in some aspects such as accuracy. I would give a full performance test after finishing my current project. |
Thank @sshpark. |
In the code, when training Per-FedAvg, there are two steps, and each step sample a batch of data and perform parameter update. But in the MAML framework, I think the first step is to obtain a fast weight, and the second step is to update the parameters based on the fast weight of the first step. So why do you update the parameters two times? Are the fundamental differences between Per-FedAvg and FedAvg lie in that the former performs two steps update and the latter performs a one-step update? Is this fair for FedAvg?
The text was updated successfully, but these errors were encountered: