- Run
generate_grid.pyto generate the grid for verification. - Train a vanilla controller by setting Line 170 in
moving_obs/train.pytoFalse(use_reachability = False), and comment out Line 149 (ppo_agent.load) intrain.py. - Train with bounds by setting Line 170 to
Trueand loading the checkpoint from the vanilla controller (ppo_agent.load) . - The controller for each k-th step reachability safety will be stored in the
outputsfolder. If a controller is not fully verified for a given k, the filename will have the suffix_not_verified.pth. - If you observe a significant decrease in reward or reach the target verification step, stop training. Run
check_collide.pyto verify the safety of the desired input region. - Based on the results from
check_collide.py(modifytarget_steps), split the input region and continue from Step 3 for each input region cluster (load from a selected checkpoint). - Stop if all input regions are verified safe for the corresponding controller.
jlwu002/VSRL
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|