.. automodule:: mlpro.rl.examples.howto_rl_att_001_train_and_reload_single_agent_gym_sd
Prerequisites
- Please install the following packages to run this examples properly:
Executable code
.. literalinclude:: ../../../../../../../../src/mlpro/rl/examples/howto_rl_att_001_train_and_reload_single_agent_gym_sd.py :language: python
Results
The Gym Cartpole environment window appears during training and shows an improved control behavior after a while. After the training, the related scenario is reloaded and run for a further episode to demonstrate the final control behavior.
The training itself is terminated due to automatic stagnation detection. The chart below shows the training progress and the ending at the point of maximum possible reward:
- After termination the local result folder contains the training result files:
- agent_actions.csv
- env_rewards.csv
- env_states.csv
- evaluation.csv
- summary.csv
- scenario
Cross Reference