Skip to content

yzhao98/DeepOptimalControl

Repository files navigation

Introduction

This is a repo for our paper Offline Supervised Learning V.S. Online Direct Policy Optimization: A Comparative Study and A Unified Training Paradigm for Neural Network-Based Closed-Loop Optimal Control.

Dependencies

python==3.8
torch==1.8.1
scipy==1.7.3
numpy==1.21.4
tensorboardX==2.4.1
tqdm==4.62.3

How to run

All scripts are in ./scripts.

  1. Generate data.
    • sh scripts/gen.sh
    • The dataset in satellite's optimal attitude control problem is generated by HJB_NN.
    • The adaptive dataset in quadrotor's optimal landing problem is generated by IVP Enhanced Sampling.
    • You can fasten the generation by multi-processing, i.e., --num_processors 24.
  2. Train with supervised learning.
    • sh scripts/sl.sh
  3. Train with direct policy optimization.
    • sh scripts/direct.sh
    • Note that we apply torch_ACA in the implementation.
  4. Fine-tune a pre-trained network.
    • sh scripts/finetune.sh
  5. Compare performances via closed-loop simulations.
    • scripts/test.sh

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published