GitHub - judydnguyen/FedGrad_Backdoor_Attack: Evaluating performance of FedGrad under edge-case backdoor attack

Overview

FedGrad: Mitigating Backdoor Attacks in Federated Learning through Local Ultimate Gradients Inspection.
Federated learning (FL) enables multiple clients to train a model without compromising sensitive data. The decentralized nature of FL makes it susceptible to adversarial attacks, especially backdoor insertion during training. Recently, the edge-case backdoor attack employing the tail of the data distribution has been proposed as a powerful one, raising questions about the shortfall in current defenses’ robustness guarantees. Specifically, most existing defenses cannot eliminate edge-case backdoor attacks or suffer from trade-off between backdoor-defending effectiveness and overall performance on the primary task. To tackle this challenge, we propose FedGrad, a novel backdoor-resistant defense for FL that is resistant to cutting-edge backdoor attacks, including the edge-case attack, and performs effectively under heterogeneous client data and a large number of compromised clients. FedGrad is designed as a two-layer filtering mechanism, which thoroughly analyzes the ultimate layer’s gradient to identify suspicious local updates and remove them from the aggregation process. We evaluate FedGrad under different attack scenarios and show that it significantly outperforms state-of-the-art defense mechanisms.

Depdendencies (tentative)

The requirements are listed on requirements.txt file.

Data Preparation

For Southwest Airline (for CIFAR-10) and traditional Cretan costumes (for ImageNet) edge-case example, most of the collected edge-case datasets are available in ./saved_datasets.
To get the Ardis dataset, the edge-case datasets can be generated via running get_ardis_data.sh and then generating_poisoned_DA.py.

Running Experients:

The main script is ./simulated_averaging.py, to launch the jobs, we provide a script ./run_simulated_averaging.sh. And we provide a detailed description on the arguments.

Argument	Description
`fraction`	Only used for EMNIST, varying the fraction of poisioned data points in attacker's poisoned dataset.
`lr`	Inital learning rate that will be used for local training process.
`batch-size`	Batch size for the optimizers e.g. SGD or Adam.
`dataset`	Dataset to use.
`model`	Model to use.
`gamma`	the factor of learning rate decay, i.e. the effective learning rate is `lr*gamma^t`.
`batch-size`	Batch size for the optimizers e.g. SGD or Adam.
`num_nets`	The total number of available users e.g. 3383 for EMNIST and 200 for CIFAR-10.
`fl_round`	maximum number of FL rounds for the code to run.
`part_nets_per_round`	Number of active users that are sampled per FL round to participate.
`local_train_period`	Number of local training epochs that the honest users can run.
`adversarial_local_training_period`	Number of local training epochs that the attacker(s) can run.
`fl_mode`	`fixed-freq` or `fixed-pool` for fixed frequency and fixed pool attacking settings.
`defense_method`	Defense method over the data center end.
`stddev`	Standard deviation of the noise added for weak DP defense.
`norm_bound`	Norm bound for the norm difference clipping defense.
`attack_method`	Attacking schemes used for attacker and either be `blackbox` or `PGD`.
`attack_case`	Wether or not to conduct edge-case attack, can be `edge-case`, `normal-case` or `almost-edge-case`.
`model_replacement`	Used when `attack_method=PGD` to control if the attack is PGD with replacement or without replacement.
`project_frequency`	How frequent (in how many iterations) to project to the l2 norm ball in PGD attack.
`eps`	Radius the l2 norm ball in PGD attack.
`adv_lr`	Learning rate of the attacker when conducting PGD attack.
`poison_type`	Specify the backdoor for each dataset using `southwest` for CIFAR-10 and `ardis` for EMNIST.
`device`	Specify the hardware to run the experiment.
`attacker_percent`	The percentage of attackers per all clients.
`use_trustworthy`	Turn on/off trustworthy filtering layer in FedGrad.
`degree_nonIID`	The degree_nonIID of data distribution between clients.
`pdr`	The poisoned data rate inside training data of a compromised client.

Sample command

Blackbox attack on Southwest Airline exmaple over CIFAR-19 dataset where there is no defense on the data center. The attacker participate in the fixed-pool manner.

python simulated_averaging.py \
--lr 0.02 \
--gamma 0.998 \
--num_nets 200 \
--fl_round 1500 \
--part_nets_per_round 10 \
--local_train_period 2 \
--adversarial_local_training_period 2 \
--dataset cifar10 \
--model vgg9 \
--fl_mode fixed-pool \
--defense_method fedgrad \
--attack_method blackbox \
--attack_case edge-case \
--model_replacement False \
--project_frequency 10 \
--stddev 0.025 \
--eps 2 \
--adv_lr 0.02 \
--prox_attack False \
--poison_type southwest \
--norm_bound 2 \
--attacker_percent 0.25 \
--pdr 0.33 \
--degree_nonIID 0.5 \
--use_trustworthy True \
--device=cuda:1

Acknowledgement

We would like to send a big Acknowledgement to the authors of "Attack of the Tails: Yes, You Really Can Backdoor Federated Learning".
OOD_Federated_Learning

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
backup		backup
checkpoint		checkpoint
experiment-sh		experiment-sh
images		images
jupyter_notebook		jupyter_notebook
language-tasks-fl		language-tasks-fl
models		models
saved_datasets		saved_datasets
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
all_utils.py		all_utils.py
client_data_distribution.csv		client_data_distribution.csv
datasets.py		datasets.py
defense.py		defense.py
edgecase_attack_plotting_helper.ipynb		edgecase_attack_plotting_helper.ipynb
fl_trainer.py		fl_trainer.py
full_rank_test.py		full_rank_test.py
generating_poisoned_DA.py		generating_poisoned_DA.py
geometric_median.py		geometric_median.py
get_ardis_data.sh		get_ardis_data.sh
helpers.py		helpers.py
main.py		main.py
main_resume.py		main_resume.py
poisoned_dataset_fraction_0.15		poisoned_dataset_fraction_0.15
requirements.txt		requirements.txt
run.sh		run.sh
run_main.sh		run_main.sh
run_main_resume.sh		run_main_resume.sh
run_simulated_averaging.sh		run_simulated_averaging.sh
simulated_averaging.py		simulated_averaging.py
southwest_vgg16_blackbox_no_defense_log		southwest_vgg16_blackbox_no_defense_log
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Depdendencies (tentative)

Data Preparation

Running Experients:

Sample command

Acknowledgement

About

Releases

Packages

Languages

License

judydnguyen/FedGrad_Backdoor_Attack

Folders and files

Latest commit

History

Repository files navigation

Overview

Depdendencies (tentative)

Data Preparation

Running Experients:

Sample command

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages