GitHub - seanny1986/quadrotor: Clean, 6DOF quadrotor simulation

This repository contains code for the following:

A 6DOF quadrotor flight dynamics simulation. The files for this are contained in the "simulation" folder. The main simulation uses quaternion rotations with an explicit RK4 integration scheme. In the legacy subfolder, you can also find code for a simulation using rotation matrices with a semi-implicit Euler integrator, and a second simulation using quaternion rotations and leap-frog integration. These are mainly kept for learning purposes. In general, I've aimed for clarity with simulation code -- the code using the semi-implicit Euler scheme is, IMO, a fairly clean and straightforward implementation that helps to get the idea across. You can adjust simulation parameters by editing the config.py file. Future updates will include adding stochastic wind disturbance to the vehicle, and --potentially -- multiple vehicles in the same environment.
Environment wrappers for the 6DOF quadrotor flight dynamics simulation, located in the "environments" folder. Our goal is to apply learning algorithms to solving tasks that human do poorly, or to learn behaviors that can potentially minimize human oversight (i.e. greater autonomy). These wrappers include tasks such as: climbing to altitude and hovering, flying to a randomly generated waypoint, perching on a wall, rapidly descending from altitude without entering a vortex ring state, flying straight and level, and navigating through a 3D box-world to get to a given goal. Future environments are intended to include 3D gathering tasks, seek/avoid, one-on-one pursuit/evasion.
Basic flight controllers, located in the "controllers" folder. At this stage, only a PID hover and waypoint controller is included. This was implemented to provide a clear example for other students, and for sanity checking the simulation. Would be good to have other controllers implemented, but that's currently a low priority for me. Trajectory planning and following would also be very nice to have, along with an optimization routine for the existing PD controller.
DRL policies located in the "policies" folder. This folder contains the actor-critic architectures we wish to apply to the tasks set out in the "environments" folder. Current policies include the Cross-Entropy Method, Deep Deterministic Policy Gradient, Generalized Advantage Estimation, Proximal Policy Optimization, Q-PROP, TRPO, and my own monster that I'm calling Forward Model Importance Sampling. These algorithms have all been validated and shown to learn on other tasks in OpenAI Gym. Policies located in the "ind" folder use a diagonal covariance matrix when selecting actions -- i.e. actions are not correlated with one another. This is not strictly correct, since actions definitely will be correlated (consider a climb-to-altitude task for example, where if one motor is producing high thrust, all motors will be). Policies located in the "cf" folder output a lower triangular Cholesky factor matrix, where the diagonal is always positive. These policies are (in theory) able to capture and learn the covariance over actions for a given task.
DRL trainers located in the "trainers" folder. These trainers implement the actual learning algorithms typically found in papers, as opposed to policies, which only contain the network architecture and helper functions. It makes sense to keep these algorithms separate from the network architecture since we might, for example, do hyperparameter search by spawning multiples of the same network with different hyperparameters to determine the best settings. You can edit the experiment settings using the config.py file located in this folder. If you add a new algorithm to this repository, you should add a trainer to this folder, and then add the training parameters to the config file in a dictionary.
A main experiment script (example). This script fires up multiple instances of the simulator, along with the desired policy search algorithms.

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
__pycache__		__pycache__
algs		algs
common		common
controllers		controllers
data		data
figures		figures
frames		frames
models		models
movies		movies
notebooks		notebooks
saved_models		saved_models
saved_policies		saved_policies
validation		validation
.DS_Store		.DS_Store
0.15_rad_standard.pth.tar		0.15_rad_standard.pth.tar
0.25_rad_standard.pth.tar		0.25_rad_standard.pth.tar
0.2_rad_standard.pth.tar		0.2_rad_standard.pth.tar
0.3_rad_standard.pth.tar		0.3_rad_standard.pth.tar
Learning_Quadrotor_Control_Tasks_Through_Deep_Reinforcement_Learning.zip		Learning_Quadrotor_Control_Tasks_Through_Deep_Reinforcement_Learning.zip
MUJOCO_LOG.TXT		MUJOCO_LOG.TXT
README.md		README.md
REINFORCE-Hover-v0-gae.pth.tar		REINFORCE-Hover-v0-gae.pth.tar
REINFORCE-Hover-v0-q.pth.tar		REINFORCE-Hover-v0-q.pth.tar
REINFORCE-RandomWaypoint-v0-gae.pth.tar		REINFORCE-RandomWaypoint-v0-gae.pth.tar
REINFORCE-RandomWaypoint-v0-q.pth.tar		REINFORCE-RandomWaypoint-v0-q.pth.tar
TRPO-Hover-v0-peb_gae.pth.tar		TRPO-Hover-v0-peb_gae.pth.tar
TRPO-RandomWaypoint-v0-peb_gae.pth.tar		TRPO-RandomWaypoint-v0-peb_gae.pth.tar
_LARGE__bf_Learning_Quadrotor_Control_Tasks_Through_Deep_Reinforcement_Learning.pdf		_LARGE__bf_Learning_Quadrotor_Control_Tasks_Through_Deep_Reinforcement_Learning.pdf
batch_experiment.py		batch_experiment.py
config.py		config.py
config.pyc		config.pyc
env_two.pth.tar		env_two.pth.tar
experiment.py		experiment.py
flight_control.ipynb		flight_control.ipynb
gaussian_term.pth.tar		gaussian_term.pth.tar
gen_fig_xlsx.py		gen_fig_xlsx.py
generate_fbd1.py		generate_fbd1.py
generate_fbd2.py		generate_fbd2.py
generate_figures.py		generate_figures.py
jerky.avi		jerky.avi
landing_video.py		landing_video.py
log_gaussian_term.pth.tar		log_gaussian_term.pth.tar
main_0.1_rad_standard.pth.tar		main_0.1_rad_standard.pth.tar
mdn_policies.ipynb		mdn_policies.ipynb
modified_val_term.pth.tar		modified_val_term.pth.tar
mu_val_term.pth.tar		mu_val_term.pth.tar
one.pth.tar		one.pth.tar
pi_val_term.pth.tar		pi_val_term.pth.tar
play_back.py		play_back.py
play_back_term.py		play_back_term.py
standard_agent.pth.tar		standard_agent.pth.tar
straight_line.avi		straight_line.avi
term_grad_term.pth.tar		term_grad_term.pth.tar
train_multi_step.py		train_multi_step.py
train_one_step.py		train_one_step.py
traj_norm.avi		traj_norm.avi
traj_term.avi		traj_term.avi
trajectory_plot.py		trajectory_plot.py
trajectory_plot_term.py		trajectory_plot_term.py
trpo-term.pth.tar		trpo-term.pth.tar
trpo_term.pth.tar		trpo_term.pth.tar
utils.py		utils.py
waypoint_navigation.ipynb		waypoint_navigation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

seanny1986/quadrotor

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages