Domain Adaptation in Unmanned Aerial Vehicles Navigation and Obstacle Avoidance using Deep Reinforcement Learning

Abstract:

Recent advancements in deep reinforcement learning (RL) inspired end-to-end learning of Unmanned Aerial Vehicles (UAV) navigation. However, they can be slow to train and require lots of interactions with the environment, as these reinforcement learning algorithms have no prior knowledge about the environments or tasks. Transfer learning was shown to useful to help in some problems in transferring knowledge from a source task to a target task. But most RL problems direct TL with fine-tuning might not be the best solution to transfer knowledge between tasks, environments. Our work presents an adversarial domain adaption method for UAV navigation and obstacle avoidance. We align state representations of pre-trained source domain with target domains and demonstrate in a realistic drone simulator that initialization with domain adaption showed significant performance improvements over RL task learned from scratch.

Framework

Environment

Downlaod indoor_updown environment. Link to download Download env and place it under unreal_envs/indoor_updown

Demo

Source task trained from scratch:

Target Task trained from scratch:

Target Task adapted from source:

Report

Link to report - https://github.com/hemanthkandula/Drone-Navigation-Domain-Adaption-Deep-RL/blob/main/Report.pdf

Presentation

Link to presentation - https://docs.google.com/presentation/d/1uxtALIXzihvG79XEEwhaFvZsndUPv903F-BUoFHG6PY/edit?usp=sharing

Running the code

Requirements

pip install requirements.txt

Training Source Task:

Edit in configs/config.cfg files
- set Target: false
- set data_collect: false
- set mode : train
Run python main.py

Training Target Task:

Edit in configs/config.cfg files
- set Target: true
- set data_collect: false
- set mode : train
Run python main.py

Collecting Source Task Dataset for adaption :

Edit in configs/config.cfg files
- set Target: false
- set data_collect: true
- set mode : train
Edit configs/DeepQLearning.cfg file
- set custom_load: true
- set custom_load_path: trained model here
- set epsilon_saturation: 100 because we want model to be greedy
Run python main.py
Stop once if data collection is enough for training. 32 images collected for every 100 training steps

Collecting Target Task Dataset for adaption :

Edit in configs/config.cfg files
- set Target: true
- set data_collect: true
- set mode : train
Run python main.py
Stop once if data collection is enough for training. 32 images collected for every 100 training steps

Domain Adaption:

Collect path for Source model encoder.ckpt,classifier.ckpt files.
data path is already set
Run python adda_main.py

Retraining policy and value function Target Task:

Edit in configs/config.cfg files
- set Target: true
- set data_collect: false
- set mode : train
Edit configs/DeepQLearning.cfg file
- set custom_load: true
- set custom_load_path: 'adda/adapted_target/'
Run python main.py

Infer Target Task from scratch:

Edit in configs/config.cfg files
- set Target: true
- set data_collect: false
- set mode : infer
Edit configs/DeepQLearning.cfg file
- set custom_load: true
- set custom_load_path: scratch trained model here
Run python main.py

Infer adapted Target Task :

Edit in configs/config.cfg files
- set Target: true
- set data_collect: false
- set mode : infer
Edit configs/DeepQLearning.cfg file
- set custom_load: true
- set custom_load_path: path to retrained model after adaption
Run python main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain Adaptation in Unmanned Aerial Vehicles Navigation and Obstacle Avoidance using Deep Reinforcement Learning

Framework

Environment

Demo

Source task trained from scratch:

Target Task trained from scratch:

Target Task adapted from source:

Report

Presentation

Running the code

Requirements

Training Source Task:

Training Target Task:

Collecting Source Task Dataset for adaption :

Collecting Target Task Dataset for adaption :

Domain Adaption:

Retraining policy and value function Target Task:

Infer Target Task from scratch:

Infer adapted Target Task :

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
algorithms		algorithms
configs		configs
figures		figures
images		images
network		network
unreal_envs		unreal_envs
util		util
Readme.md		Readme.md
Report.pdf		Report.pdf
adda_main.py		adda_main.py
aux_functions.py		aux_functions.py
main.py		main.py
requirements.txt		requirements.txt

hemanthkandula/Drone-Navigation-Domain-Adaption-Deep-RL

Folders and files

Latest commit

History

Repository files navigation

Domain Adaptation in Unmanned Aerial Vehicles Navigation and Obstacle Avoidance using Deep Reinforcement Learning

Framework

Environment

Demo

Source task trained from scratch:

Target Task trained from scratch:

Target Task adapted from source:

Report

Presentation

Running the code

Requirements

Training Source Task:

Training Target Task:

Collecting Source Task Dataset for adaption :

Collecting Target Task Dataset for adaption :

Domain Adaption:

Retraining policy and value function Target Task:

Infer Target Task from scratch:

Infer adapted Target Task :

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages