DIRAL

Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication arxiv . This repo contains the source code of the toy example that we used in our paper to test the performance of the algorithm.

Abstract

We consider the distributed resource selection problem in Vehicle-to-vehicle (V2V) communication in the absence of a base station. Each vehicle autonomously selects transmission resources from a pool of shared resources to disseminate Cooperative Awareness Messages (CAMs). This is a consensus problem where each vehicle has to select a unique resource. The problem becomes more challenging whenódue to mobilityóthe number of vehicles in vicinity of each other is changing dynamically. In a congested scenario, allocation of unique resources for each vehicle becomes infeasible and a congested resource allocation strategy has to be developed. The standardized approach in 5G, namely semi-persistent scheduling(SPS) suffers from effects caused by spatial distribution of the vehicles. In our approach, we turn this into an advantage. We propose a novel DIstributed Resource Allocation mechanism using multi-agent reinforcement Learning (DIRAL) which builds on a unique state representation. One challenging issue is to cope with the non-stationarity introduced by concurrently learning agents which causes convergence problems in multi-agent learning systems. We aimed to tackle non-stationarity with unique state representation. Specifically, we deploy view-based positional distribution as a state representation to tackle non-stationarity and perform complex joint behavior in a distributed fashion. Our results showed that DIRAL improves PRR by %20 compared to SPS in challenging congested scenarios.

Algorithm Architecture

Double Deep Q Network(DQN) with Long-Term-Short-Memory(LSTM)
Parameter sharing
Centralized training, decentralized execution framework

Demo

Illustration of the trained model in the real-time network simulator(RealNeS) with 6 vehicles and 5 available resources e.g. congested scenario. As we can see, far vehicles choose the same resources autonomously based on their local observations whereas near vehicles use separate resources.

Requirements

Python 3.6
Tensorflow 1.14
Numpy 1.19.2
PyYAML 5.3.1

Run an experiment

Add the config file you want to the file main_tesy.py
Run the python main_test.py command to launch the training.

Pleasae cite the work if you would like to use it in your work

@inproceedings{gundougan2020distributed, title={Distributed resource allocation with multi-agent deep reinforcement learning for 5G-V2V communication}, author={G{"u}ndo{\u{g}}an, Alperen and G{"u}rsu, H Murat and Pauli, Volker and Kellerer, Wolfgang}, booktitle={Proceedings of the Twenty-First International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing}, pages={357--362}, year={2020} }
Gündoğan, Alperen, et al. "Distributed resource allocation with multi-agent deep reinforcement learning for 5G-V2V communication." Proceedings of the Twenty-First International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing. 2020.

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.idea		.idea
algorithms		algorithms
assets		assets
configs/4ue_3r_toy		configs/4ue_3r_toy
envs		envs
utils		utils
.DS_Store		.DS_Store
LICENSE.md		LICENSE.md
README.md		README.md
main_test.py		main_test.py
master_thesis_gundogan_alperen_DIRAL.pdf		master_thesis_gundogan_alperen_DIRAL.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

algorithms

algorithms

assets

assets

configs/4ue_3r_toy

configs/4ue_3r_toy

envs

envs

utils

utils

.DS_Store

.DS_Store

LICENSE.md

LICENSE.md

README.md

README.md

main_test.py

main_test.py

master_thesis_gundogan_alperen_DIRAL.pdf

master_thesis_gundogan_alperen_DIRAL.pdf

Repository files navigation

DIRAL

Abstract

Algorithm Architecture

Demo

Requirements

Run an experiment

Pleasae cite the work if you would like to use it in your work

License

About

Releases

Packages

Languages

License

gundoganalperen/DIRAL

Folders and files

Latest commit

History

Repository files navigation

DIRAL

Abstract

Algorithm Architecture

Demo

Requirements

Run an experiment

Pleasae cite the work if you would like to use it in your work

License

About

Resources

License

Stars

Watchers

Forks

Languages