Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

This software uses a combination of multi-agent Soft Actor-Critic and weighted bipartite matching to train and test a policy, represented by a neural network, that dispatches vehicles to requests in an autonomous mobility on demand system.

This method is proposed in:

Tobias Enders, James Harrison, Marco Pavone, Maximilian Schiffer (2023). Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems. Proceedings of The 5th Annual Learning for Dynamics and Control Conference (L4DC 2023), in Proceedings of Machine Learning Research 211:1284-1296. Available from https://proceedings.mlr.press/v211/enders23a.html.

All components (code, data, etc.) required to run the experiments reported in the paper are provided here. This includes the greedy and model predictive control (MPC) benchmark algorithms.

Overview

The directory algorithms contains:

The environment implementation in environment.py.
The greedy benchmark algorithm in greedy.py, which can be executed using main_greedy.py with arguments as the exemplary ones in args_greedy_XX_small/large_zones.txt (see comments in main_greedy.py for explanations of the arguments).
The MPC benchmark algorithm: compute_o_d_probability_distribution.py can be used to estimate the request probability distribution. Then, MPC can be executed using main_mpc.py, which in turn uses mip.py, with arguments as the exemplary ones in args_MPC_XX_small/large_zones.txt (see comments in main_mpc.py for explanations of the arguments).
The remaining code files implement the hybrid multi-agent Soft Actor-Critic algorithm, which can be executed using main.py with arguments as the exemplary ones in args_RL_XX_small/large_zones.txt (see comments in main.py for explanations of the arguments). The code in trainer.py and sac_discrete.py is partly based on code from this GitHub repository.

The directory data contains pre-processed data for the two problem instances considered in the paper.

Installation Instructions

Executing the code requires Python and the Python packages in requirements.txt, which can be installed with pip install -r requirements.txt. These packages include TensorFlow. In case of problems when trying to install TensorFlow, please refer to this help page. Gurobi, which is also included in requirements.txt, is only necessary for the MPC benchmark and can be excluded from the installation if the MPC algorithm is not needed.

Code Execution

To run the code with arguments args.txt, execute python main.py @args.txt in the algorithms directory (analogously for the greedy and MPC algorithms).

For typical instance and neural network sizes, a GPU should be used.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
algorithms		algorithms
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms

algorithms

data

data

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Overview

Installation Instructions

Code Execution

About

Releases

Packages

Languages

License

tumBAIS/HybridMADRL-AMoD

Folders and files

Latest commit

History

Repository files navigation

Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Overview

Installation Instructions

Code Execution

About

Resources

License

Stars

Watchers

Forks

Languages