GitHub - BennerLukas/on-time: punctuality as a service

Usage

To install all packages both the requirements and the local environment must be installed

pip install -r requirements.txt
pip install -e gridworld-mannheim-gym

To start training on our custom environment execute the following command

python run.py

For than viewing the results start the tensorboard and see the visualisations in your browser.

tensorboard --logdir logs

Idea & Approach

We want to deliver a service for railway and transport companies worldwide to decrease delays and increase the punctuality of trains. This increases customer satisfaction and thereby the usage. This has a big benefit for our world, fighting against climate change and traffic jams.

Technology

We use OpenAI Gym vor our own custom simulation. For developing the agent we use the new ray library. It has a RLlib submodule where you can train different environments with different reinforcement learning algorithms.

Here you can see the train map with all stops, lights and switches. This what the agent is seeing (mathematically) to decide which signal to activate.

                                      ||||                                      
                                      S0||                                      
                                      W/||                                      
      //----SP----SP------SP----------||W/S0--------SP------------              
    ||  ----SP----SP------SP------S0W/||||----------SP----------\\\\            
    ||//                              ||W/                        \\\\          
    ||||                              ||S0                          S0\\        
    ||||                              ||||                          W\W/W\S0----
    ||||                              ||||                          ||||--------
    S0||                              S0||                          W/W/        
----W\--W\S0--SP------SP--------------W\||W\S0--SP------------SP----||S0        
--S0----------SP------SP------------S0||||------SP------------SPS0W\W/W\        
                                      ||S0                          S0S0        
                                      ||||                          W\W/W\S0SP--
                                      ||||                          ||||----SP--
                                      ||||                          ||W/        
                                      ||||                          ||S0        
                                      ||||                          ||||        
                                      ||||                          //||        
                                      \\  \\----W/S0--SP----------SP  ||        
                                        --S0W\--W\----SP----------SP//          
                                              ||W/                              
                                              SPS0                              
                                              ||||

The experiment to run the agent will be done in the run.py. It executes a DQN which learns the best policy.

For performance increases are we using a GPU. With the help of CUDA and the underlying Tensorflow can the DQN model be trained faster.

Team

Business

for more see docs. There you can find the BusinessModelCanvas, ValuePropositionCanvas and the PitchDeck.

Documentation & further resources

For information about our Learnings in this project see LEARNINGS.md The environment was tested with a PPO algorithm that was trained for 300 episodes. The reinforcement learning agent showed great improvement during training as can be seen in the included SVG-graphs (docs/code folder), which were exported from tensorboard. Furthermore, the latest checkpoint of the PPO agent can be found in the logs/PPO folder.

Mean reward at each step

The underlying theory can be found indepth in the presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
docs		docs
gridworld-mannheim-gym		gridworld-mannheim-gym
logs/PPO		logs/PPO
src		src
.gitignore		.gitignore
LEARNINGS		LEARNINGS
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Usage

Idea & Approach

Technology

Team

Business

Documentation & further resources

About

Releases

Packages

Contributors 4

Languages

License

BennerLukas/on-time

Folders and files

Latest commit

History

Repository files navigation

Usage

Idea & Approach

Technology

Team

Business

Documentation & further resources

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages