PPO_RLLIB code improvement #58

AvisP · 2023-10-02T20:52:10Z

System information

Grid2op version: 1.9.5
l2rpn-baselines version: 0.8.0
System: osx
Baseline concerned: eg PPO_RLLIB

Bug description

The PPO_RLLIB code has been updated but there are couple of issues

Missing the following line self.env_glop.chronics_handler.reset() after

l2rpn-baselines/l2rpn_baselines/PPO_RLLIB/env_rllib.py

Line 103 in ba346d3

self.env_glop = grid2op.make(nm_env, backend=backend, **env_config)

and need to add it to make the train and eval script work.
There environment seems to be getting created twice. First one just to convert the environment observation and action space into gym format and then pass into the RLLIBAgent class where the environment is built again through rllib library. If I understand correctly this takes more memory for two environments and rewriting to just make one will help with memory.
The environment for the l2rpn_neurips_2020_track1_small take a very long time to do 100 iterations with train_batch_size of 20,000 added to env_config_ppo. These two parameters may even need to be higher to get good results. If something can be done to speed up the training that would be helpful for scaling to bigger networks.

How to reproduce

Execute the train and eval script here

Expected output

Train script should run without any issues and memory requirement is lower and faster training

The text was updated successfully, but these errors were encountered:

AvisP added the bug Something isn't working label Oct 2, 2023

AvisP mentioned this issue Oct 2, 2023

Can't get attribute 'PlayableAction_l2rpn_case14_sandbox_l2rpn_case14_sandbox' on <module 'grid2op.Space.GridObjects' rte-france/Grid2Op#514

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PPO_RLLIB code improvement #58

PPO_RLLIB code improvement #58

AvisP commented Oct 2, 2023 •

edited

Loading

PPO_RLLIB code improvement #58

PPO_RLLIB code improvement #58

Comments

AvisP commented Oct 2, 2023 • edited Loading

System information

Bug description

How to reproduce

Expected output

AvisP commented Oct 2, 2023 •

edited

Loading