GitHub - janvandermeulen/OOD_Federated_Learning: CSE3000: Research Project. Final bachelor Paper.

Exploring the Impact of Single-Character Attacks in Federated Learning: Introducing the novel Single-Character Strike

This repository is a fork of https://github.com/ksreenivasan/OOD_Federated_Learning who wrote the paper: Attack of the Tails: Yes, You Really Can Backdoor Federated Learning. This project (not the Attack of the Tails paper) is part of a the course CSE3000: Research Project 2023 of the University of Technology Delft.

Reproducability

If you want to reproduce the experiment all files are provided in the language-tasks-fl folder and all instructions are provided in the related README file.

Abstract

Federated learning (FL) is a privacy preserving machine learning approach which allows a machine learning model to be trained in a distributed ashion without ever sharing user data. Due to the large amount of valuable text and voice data stored on end-user devices, this approach works particularly well for natural language processing (NLP) tasks. Due to many applications making use of the algorithm and increasing interest in academics, ensuring security is essential. Current backdoor attacks in NLP tasks are still unable to evade some defence mechanisms. Therefore, we propose a novel attack, the single-character strike to address this research gap. Consequently, the following research question is posed: What are the properties of the single-character strike in a language classification task? By experimental analysis the following properties are discovered: the single-character strike is undetectable against five state-of-the-art defences, has low impact on the global model accuracy, trains slower than similar attacks, relies on characters on the edge of the distribution to function, is robust within the global model, and performs best when close to convergence and with more adversarial clients. Emphasizing its imperceptibility and persistence, the attack maintains a 70% backdoor accuracy after a thousand iterations without training and remains undetectable against: (Multi-)Krum, RFA, Norm Clipping and Weak Differential Privacy. By providing insight into the effective single-character strike, this paper adds to the growing body of work that questions whether federated learning can be secure against backdoor attacks.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.vscode		.vscode
language-tasks-fl		language-tasks-fl
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
datasets.py		datasets.py
defense.py		defense.py
edgecase_attack_plotting_helper.ipynb		edgecase_attack_plotting_helper.ipynb
fl_trainer.py		fl_trainer.py
full_rank_test.py		full_rank_test.py
generating_poisoned_DA.py		generating_poisoned_DA.py
geometric_median.py		geometric_median.py
get_ardis_data.sh		get_ardis_data.sh
main.py		main.py
main_resume.py		main_resume.py
poisoned_dataset_fraction_0.1		poisoned_dataset_fraction_0.1
run.sh		run.sh
run_main.sh		run_main.sh
run_main_resume.sh		run_main_resume.sh
run_simulated_averaging.sh		run_simulated_averaging.sh
simulated_averaging.py		simulated_averaging.py
simulated_averaging_wb.py		simulated_averaging_wb.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring the Impact of Single-Character Attacks in Federated Learning: Introducing the novel Single-Character Strike

Reproducability

Abstract

About

Releases

Packages

Languages

License

janvandermeulen/OOD_Federated_Learning

Folders and files

Latest commit

History

Repository files navigation

Exploring the Impact of Single-Character Attacks in Federated Learning: Introducing the novel Single-Character Strike

Reproducability

Abstract

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages