Opinion about dataset split #14

djskwh · 2023-06-07T17:28:26Z

Hi, KarhouTam.
I recently talked about dataset split for Traditional FL(fedavg, fedprox, feddyn, etc. ...) with my colleague.
my colleague insisted that in order to evaluate the FL algorithm, i should evaluate the model on isolated dataset
(i.e. when MNIST, first split 6000~ imgs to test dataset for global model evaluation and then, assign the rest of imgs to each client for test and eval).
as far as i know, global server can't see the entire dataset for privacy issue. right?
i think thats why your dataset creation setting also don't assign a test dataset for global model evaluation.

what is your opinion about this?

KarhouTam · 2023-06-08T00:35:43Z

Hi, @djskwh.

First, actually I think you are right. In my opinion, the global server should not hold a global testset for evaluating. In my imagine, in industry, the side responsible for FL training is unable to obtain the testset that has the same data distribution as the trainset. But in academy, the global testset setting is proposed for evaluation convenience and it seems permissible.

Let us review the code in FL-bench for testing FL methods (no matter traditional or personalized).

FL-bench/src/server/fedavg.py

Lines 227 to 252 in a3ace46

    
           for client_id in self.test_clients: 
        
               client_local_params = self.generate_client_params(client_id) 
        
               stats = self.trainer.test(client_id, client_local_params) 
        
               correct_before.append(stats["before"]["test_correct"]) 
        
               correct_after.append(stats["after"]["test_correct"]) 
        
               loss_before.append(stats["before"]["test_loss"]) 
        
               loss_after.append(stats["after"]["test_loss"]) 
        
               num_samples.append(stats["before"]["test_size"]) 
        
           loss_before = torch.tensor(loss_before) 
        
           loss_after = torch.tensor(loss_after) 
        
           correct_before = torch.tensor(correct_before) 
        
           correct_after = torch.tensor(correct_after) 
        
           num_samples = torch.tensor(num_samples) 
        
           self.test_results[self.current_epoch + 1] = { 
        
               "loss": "{:.4f} -> {:.4f}".format( 
        
                   loss_before.sum() / num_samples.sum(), 
        
                   loss_after.sum() / num_samples.sum(), 
        
               ), 
        
               "accuracy": "{:.2f}% -> {:.2f}%".format( 
        
                   correct_before.sum() / num_samples.sum() * 100, 
        
                   correct_after.sum() / num_samples.sum() * 100, 
        
               ), 
        
           }

Suppose I have a global testset, its size is $S$ and number of final model predicting correctly is $C$.

According to your colleague's opinion, the final accuracy of traditional FL methods should be calculated by
$$\frac{C}{S}$$

Suppose I have two FL clients at all, $A, B$, the size of testset part of them are $S_A, S_B$ ($S = S_A + S_B$) and the number of predicting correctly $C_A, C_B$ ($C = C_A + C_B$).

What my code calculated is based on
$$\frac{C_A + C_B}{S_A + S_B} == \frac{C}{S}$$

So in my opinion, the result my code calculated should be the same as the results calculated with a global testset on traditional FL methods.

Of course, personalized FL methods are N/A to this discussion and they are incompatible to the global testset setting.

djskwh · 2023-06-08T11:46:28Z

thanks for the detailed review of your code and explanation.
good luck with your FL research!

KarhouTam closed this as completed Jun 8, 2023

KarhouTam mentioned this issue Jul 8, 2023

Evaluation in test phase #21

Closed

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opinion about dataset split #14

Opinion about dataset split #14

djskwh commented Jun 7, 2023 •

edited

Loading

KarhouTam commented Jun 8, 2023 •

edited

Loading

djskwh commented Jun 8, 2023

Opinion about dataset split #14

Opinion about dataset split #14

Comments

djskwh commented Jun 7, 2023 • edited Loading

KarhouTam commented Jun 8, 2023 • edited Loading

djskwh commented Jun 8, 2023

djskwh commented Jun 7, 2023 •

edited

Loading

KarhouTam commented Jun 8, 2023 •

edited

Loading