Add MR-MTL Method #125

sanaAyrml · 2024-04-09T21:57:14Z

PR Type

[Feature]

Short Description

Clickup Ticket(s): Link

This is the implementation of MR-MTL: On Privacy and Personalization in Cross-Silo Federated Learning.
The method is very related to FedProx and Ditto. Essentially, at the start of each client training round, we do not update local model weights directly with the global model weights, but instead we constrain the model weights to be close to the initial global model weights at the start of each client training round. Initial global model weights are computed by averaging the model weights at the end of the previous round. Such a mean-regularized training is done by adding a penalty term to the loss function that constrains the local model weights to be close to the initial global model weights.

Tests Added

Three tests are added to tests/clients/test_mr_mtl_client.py regarding setting global weights (as we don't set them for local model but just save them for computing mr loss), forming mr loss and computing loss.

sanaAyrml · 2024-04-09T21:58:15Z

examples/ditto_example/README.md

@@ -1,4 +1,4 @@
-# FedProx Federated Learning Example
+# Ditto Federated Learning Example


I think this left from copy past so I fixed it.

examples/mr_mtl_example/client.py

emersodb · 2024-04-15T17:46:28Z

examples/mr_mtl_example/client.py

+    # Shutdown the client gracefully
+    client.shutdown()
+
+    client.metrics_reporter.dump()


@lotif: Do you think we should just add this client.metrics_report.dump() to the shutdown so that it just always happens?

examples/mr_mtl_example/server.py

fl4health/clients/mr_mtl_client.py

emersodb · 2024-04-15T17:51:21Z

fl4health/clients/mr_mtl_client.py

+        device: torch.device,
+        loss_meter_type: LossMeterType = LossMeterType.AVERAGE,
+        checkpointer: Optional[TorchCheckpointer] = None,
+        lam: float = 1.0,


This isn't part of the original implementation (and shouldn't be in this PR), but it would be interesting to see what effect adapting this value based on the adaptive FedProx approach or the generalization gap of FedDG-GA would have on the approach. A similar idea could be said for the Ditto parameter.

I am confused by this comment, which part do you mean exactly?

Apologies, this was just me recording a thought about something to investigate in the future. That is, would MR-MTL or Ditto benefit from an adaptive implementation of the FedProx-like parameter $\lambda$. It could be adjusted as they suggested in the original paper, or perhaps we could adjust it based on a similar measure to that proposed in FedDG-GA (which is a generalization gap).

TL;DR: No need to do anything. Was just thinking about potential future experimentation.

fl4health/clients/mr_mtl_client.py

tests/clients/test_mr_mtl_client.py

emersodb · 2024-04-15T19:16:52Z

Overall, I think the implementation looks good. Just a few small comments before we can merge.

emersodb

Good to go for me.

sanaAyrml added 5 commits April 9, 2024 10:49

Add mr_mtl_client

156c4af

Add mr-mtl client and its respective tests

9846777

Add MR-MTL examples

5a5f8e6

Update read me

6ca8315

fix ditto read me

784ac5f

sanaAyrml commented Apr 9, 2024

View reviewed changes

sanaAyrml changed the title ~~Mr mtl client~~ Add MR-MTL Method Apr 9, 2024

sanaAyrml added 6 commits April 9, 2024 15:03

fix tests

5fdd672

fix pre commit errors

c84f6f0

fix static code checks failing

debca9d

update readme

e07a889

Update docstring

3183c0c

fix precommit files

21fab3c

emersodb requested review from emersodb and fatemetkl April 15, 2024 17:38