Federated Reinforcement Learning

Introduction

The goal of this project is to simulate an environment for Federated Learning and experiment different algorithms using Reinforcement Learning for better aggregation or to select in a better way the next participants.

Installation

First, download the repository

git clone https://github.com/bourbonut/reinforcedFL.git

Then create your own environment with a version of python >= 3.8.5. For instance, with conda :

conda create -n reinforcedFL python=3.8.5
conda activate reinforcedFL

Install pytorch For example, with conda :

conda install pytorch torchvision torchaudio cudatoolkit=11.6 -c pytorch -c conda-forge

Then install other packages :

pip install -r requirements.txt

Usage

Configuration tree

Three configurations files must be created to run a simulation. It is recommended to create the following tree:

.
├── ascii_sequential_execution.py
├── ...
└── configurations
    ├── environment
    │   ├── env20.json
    │   ├── ...
    │   └── env100.json
    ├── distribution
    │   ├── iid.json
    │   ├── ...
    │   └── noniid.json
    └── model
        ├── fedavg.json
        ├── ...
        └── evaluator.json

Example of configuration files

Comments with # must be deleted.

In an environment file :

{
    "nexps": 20,	# number of experiments
    "rounds": 10,	# number of rounds
    "epochs": 3,	# number of epochs
    "nworkers": 20,	# number of workers
    "dataset": "MNIST"	# name of the dataset
}

In an distribution file :

{
    "label_distrb":"noniid",	# labels are non independent and identically distributed
    "minlabels":3,		# there are at least 3 labels per worker
    "volume_distrb":"noniid",	# volume of data is non independent and identically distributed
    "balanced":true,		# True for "per worker" else each worker is a cluster of labels
    "k":5			# coefficient for data augmentation (example MNIST : 60_000 x 5 = 300_000 samples)
}

In an model file :

{
  "worker_class": "Worker",		    # name of the worker class
  "server_class": "FederatedAveraging",	    # name of the algorithm for aggregation
  "scheduler_class": "RandomScheduler",	    # name of the scheduler class
  "task_model": "mnist",		    # name of the task model for workers
  "batch_size": 64			    # batch size
}

Run a simulation

Run the following command :

python ascii_sequential_execution.py <path_env_conf> <path_distrb_file> <path_model_file>

Add the flag --cpu to run on CPU and the flag --refresh to refresh the distribution of data if you need. By default, the program will choose the GPU if there is one, else it will run on CPU.

For instance :

python ascii_sequential_execution.py ./configurations/environment/env20.json ./configurations/distribution/iid.json ./configurations/model/fedavg.json --cpu --refresh

Results

For each experiment, data are distributed and saved in ./experiments as results. If the data was already generated, it can be generated again with the flag --refresh else, the program will skip the generation of data. The index of the name of the directory ./experiments/experiment-{k} increases by increments between each experiment. Almost all results are saved as .pkl file (see pickle) and can be opened easily :

import pickle
with open("data.pkl", "rb") as file:
    data = pickle.load(file)

print(data)

Report

The report summarizes :

A state of art of Federated Learning focused on Reinforcement Learning
More details on algorithms implemented
Results of several experiments

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
core		core
docs		docs
model4FL		model4FL
tests		tests
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ascii_sequential_execution.py		ascii_sequential_execution.py
pyproject.toml		pyproject.toml
report.pdf		report.pdf
requirements.txt		requirements.txt

License

bourbonut/reinforcedFL

Folders and files

Latest commit

History

Repository files navigation

Federated Reinforcement Learning

Introduction

Installation

Usage

Configuration tree

Example of configuration files

Run a simulation

Results

Report

About

Resources

License

Stars

Watchers

Forks

Languages