Multiagent Reinforcement Learning in Nvidia TX2

MAy 27, 2019

Step 1. Flash an OS and components

Flash the L4T and components

NVIDIA Jetson TX2 can be flashed by JetPack 4.2 which includes:

L4T R32.1 (which is an Ubuntu 18.04 64-bit variant(aarch64))
CUDA 10.0
cuDNN 7.3.1
TensorRT 5.0.6
openCV 3.3.1

JetPack is available here: https://developer.nvidia.com/embedded/jetpack

JetPack installation guide is available here: https://developer.ridgerun.com/wiki/index.php?title=Installing_JetPack_4.2_-_Nvidia_SDK_Manager

Add a swap file

TX2 uses a 8G unified memory which is shared between the CPU and GPU. When a model is trained, OUT_OF_MEMORY problems may occur. So we add some swap memory for TX2 as follows:

$ cd ~
$ fallocate -l 8G swapfile        # Create a 8G swapfile
$ chmod 600 swapfile              # Change permissions
$ ls -lh swapfile                 # List out the file
$ mkswap swapfile                 # Set up the Linux swap area
$ sudo swapon swapfile            # Now start using the swapfile
$ swapon -s                       # Show that it's now being used

How to add and delete swap space in Ubuntu: see here.

Swap memory in TX2 with a SSD: see here.

Change the performance mode of TX2

$ sudo nvpmodel -m 0             # 0 means the best performance mode

About NVPmodel: https://www.jetsonhacks.com/2017/03/25/nvpmodel-nvidia-jetson-tx2-development-kit/

Step 2. Install TensorFlow

To install TensorFlow in TX2, we can follow this installation guide.

For Python 3.6 + JetPack 4.2:

$ pip3 install --extra-index-url https://developer.download.nvidia.com/compute/redist/jp/v42 tensorflow-gpu==1.13.1+nv19.5 --user

TF installation for other Python + JetPack combinations: see here.

Step 3. MAgent

Now, we will run the MAgent (many-agent reinforcement learning) on TX2. The paper is shown here: https://arxiv.org/abs/1712.00600

The baseline algorithms of MAgent are parameter-sharing DQN, DRQN, a2c in Tensorflow. DQN shows the best performance in large number sharing and gridworld settings.

Git clone

$ git clone https://github.com/geek-ai/MAgent.git
$ cd MAgent

Install dependencies

$ sudo apt-get install cmake libboost-system-dev libjsoncpp-dev libwebsocketpp-dev

Build MAgent

$ bash build.sh
$ export PYTHONPATH=$(pwd)/python:$PYTHONPATH

Run examples

NOTE: You have to run following examples in the root of MAgent. DO NOT cd to examples/.

Train

pursuit

$ python examples/train_pursuit.py --train

gathering

$ python examples/train_gather.py --train

battle

$ python examples/train_battle.py --train

Play

battle game

$ python examples/show_battle_game.py

Note

MARL paper collection: https://github.com/LantaoYu/MARL-Papers

RL Gitbook (in Korean): https://dnddnjs.gitbooks.io/rl/content/

RL blog (by Jay Yang): https://jay.tech.blog/category/machine-learning/reinforcement-learning/

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Repository files navigation

Multiagent Reinforcement Learning in Nvidia TX2

Step 1. Flash an OS and components

Flash the L4T and components

Add a swap file

Change the performance mode of TX2

Step 2. Install TensorFlow

Step 3. MAgent

Git clone

Install dependencies

Build MAgent

Run examples

Train

Play

Note

About

Releases

Packages

ijang-rl/reinforcement-learning-TX2

Folders and files

Latest commit

History

README.md

README.md

Repository files navigation

Multiagent Reinforcement Learning in Nvidia TX2

Step 1. Flash an OS and components

Flash the L4T and components

Add a swap file

Change the performance mode of TX2

Step 2. Install TensorFlow

Step 3. MAgent

Git clone

Install dependencies

Build MAgent

Run examples

Train

Play

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages