Skip to content

Modified OpenAI gym environments for continuous control of underactuated systems

Notifications You must be signed in to change notification settings

jmichaux/gym-underactuated

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 

Repository files navigation

Underactuated Systems (under development)

This repository contains modified OpenAI gym environments for continuous control. The purpose of this project is to create environments that can be used to prototype/experiment with optimal control and reinforcement learning algorithms.

Enviroments

So far, working environments include the Pendulum, Inverted Pendulum (Cartpole), Dual Inverted Pendulum, Acrobot, and Pendubot. Environments currently under development include the Double Inverted Pendulum and Acrowheel (i.e. wheeled inverted pendulum + acrobot).

Dynamics

All of the equations of motion have been derived by hand. Using the standard manipulator form for robot dynamics, we keep track of the inertial mass matrix M, the Coriolis matrix C, gravity vector G, and a vector/array B that maps torque inputs to generalized coordinates. By using the manipulator equations, we can easily obtain the linearized dynamics equations.

Numerical Integration

We currently use a Semi-implicit Euler method for integrating the dynamics. This is more stable than the standard Euler method and the Verlet method, both of which diverge pretty quickly. We will probably implement a fourth order Runge-Kutta method in the future.

Contributions

Contributions are greatly appreciated.

About

Modified OpenAI gym environments for continuous control of underactuated systems

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages