GitHub - Mohnishi/LC3: Information Theoretic Regret Bounds for Online Nonlinear Control

Readme

This code is meant to reproduce the results found in Information Theoretic Regret Boundsfor Online Nonlinear Control by Sham Kakade, Akshay Krishnamurthy, Kendall Lowrey, Motoya Ohnishi, and Wen Sun. We include the benchmark task environments that do not require external licenses, namely the mountain-car, acrobot, and cartpole environments.

Setup & Install

This code has been tested on Ubuntu 18.04, but should also work on different platforms (MacOS, Windows, FreeBSD) if the instructions are adapted.

The process to bring up this repo is as follows:

Download and install Julia
Navigate to project and instantiate
Run

The following is an example of installing Julia for Ubuntu 18.04.

cd ~/Downloads
wget https://julialang-s3.julialang.org/bin/linux/x64/1.4/julia-1.4.2-linux-x86_64.tar.gz
tar xvf julia-1.4.2-linux-x86_64.tar.gz

# the following exports can be added to your bashrc.
export JULIA_BINDIR=~/Downloads/julia-1.4.2/bin
export PATH=$JULIA_BINDIR:$PATH
export JULIA_NUM_THREADS=12

cd $directory_you_extracted_code
julia

One you start Julia, regardless of platform, the following instructions may proceed:

julia> ]
(@v1.4) pkg> activate .                        # activates this project
(LC3) pkg> instantiate   # the built in package manager downloads, installs dependences
(LC3) pkg> ctrl-c

julia> include("main.jl")                      # to run all the environments and generate results, or...
julia> seednum = 1234
julia> include("scripts/lc3_acrobot.jl")       # to run individual environments.

Notes

The results in the paper were generated with Julia 1.4.2, with 12 Julia threads. This is critical to reproducibility, but not necessary for running the included algorithm; one should adapt these settings to their compute.

Code Structure

.
├── gym                    # Environment details and functions
│   ├── acrobot.jl
│   ├── cartpole.jl
│   └── mountaincar.jl
├── learned_envs           # Wrapper around environments to allow for learned models
│   └── learned_env.jl
├── log                    # Data store
│   ├── ab
│   ├── cpg
│   └── mc
├── main.jl                # Generates results from paper
├── Manifest.toml          # Julia Manifest file for all dependencies
├── planner
│   └── MPPIClamp.jl
├── Project.toml           # Julia Project file for top level dependencies
├── README.md              # This file
├── scripts                # Environment Hyper-Parameters and configuration
│   ├── lc3_acrobot.jl
│   ├── lc3_cartpole.jl
│   └── lc3_mountaincar.jl
└── utils                  # Algorithm and support code
    ├── LC3.jl
    ├── rff.jl
    └── weightmat.jl

Code Maintenance

The codes are maintained by the authors of Information Theoretic Regret Boundsfor Online Nonlinear Control (https://arxiv.org/abs/2006.12466). The project page can be found at https://sites.google.com/view/lc3algorithm/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Readme

Setup & Install

Notes

Code Structure

Code Maintenance

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
gym		gym
learned_envs		learned_envs
planner		planner
scripts		scripts
utils		utils
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
main.jl		main.jl

License

Mohnishi/LC3

Folders and files

Latest commit

History

Repository files navigation

Readme

Setup & Install

Notes

Code Structure

Code Maintenance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages