A major (performance) update on the submodule: srl-zoo; fixes several issues #50

ncble · 2019-06-13T12:19:38Z

Since the srl_zoo is less popular than robotic-rl-srl, I decided to post the changelog here:

Highlights

2~5 times speed-up (overall) (srl_zoo) compared to the current version of origin/master
Better SRL training mechanism (more intuitive, better modularity) that support sophisticated update (e.g. GAN)
Add GAN to srl_zoo
Scalable SRL models (support any image shape, see the detail below)
~9 times faster DataLoader which is also simpler, since it's natively supported by pytorch.
Remove several redundant codes to speed-up training.
Fixes several issues of (robotics-rl_srl):
[NEW 19/7/2019] 2~7 times speed-up of RL training (for 2D environments) compared to the current version of origin/master
[NEW 19/7/2019] Add new environment Labyrinth and MobileRobotX which can run at speed 36,000 FPS (frame-per-second) on 10 threads of Intel CPU i9-9900K (image resolution 128², 20,000 FPS for 224²). Compared to previous MobileRobotGymEnv-v0, it only runs at speed 800 FPS.

Fixes #41
Fixes #42
Fixes #43
Fixes #46
Fixes #47
Fixes #48
Fixes #49
Fixes #51
Fixes #53
Fixes #54

Note: Before, we need to modify 8 scripts in order to add one new model, now only two scripts (at most three): models/modules.py and models/my_custom_model.py (see the template models/new_model_template.py)

Changelog

SRL part

Support any image resolution for the entire toolbox (--img-shape="(3,128,128)"), including the DataLoader, Environment, SRL models. Before, the SRL models are not scalable with respect to image shape, and it's not sufficient to modify only the input shape (e.g. need to manually calculate each layers' shape, size, etc). Now, all models function more like keras.
Support adversarial state representation learning. (e.g. GAN)
[New scripts] models/base_trainer.py, models/new_model_template.py, models/gan.py
- base_trainer.py: new trainig pipeline for better modularity.
- new_model_template.py: a simple example of "how to add new model to srl_zoo".
- gan.py: adversarial state representation learning.
Better (simpler, ~10 times faster) plots
Support new monitor mode (--monitor) "loss" (before, there is only "pbar" progressbar): monitor losses during training, calculate GTC per epoch.
Support control of number of CPU for dataloader (--num-worker).
Support "anytime training": load the previous trained SRL model weights to continue the training --srl-pre-weights (weights path)
Change validation mechanism to the classic one (i.e. within one epoch: train then valid). Before, we alternate between training and validation mode at batch level.
Support specific GPU number. (by --gpu-num=0, --gpu_num1, etc)
[Remove]: preprocessing/preprocess.py (it's useless), models/custom_layers.py
[Rename] the models/models.py is renamed to models/base_models.py, since it's more intuitive for the outsider. Currently, there are several confusing names "custom_layers.py", "modules.py", "learner.py" "models.py

RL part

support any image shape.
support specific GPU number. (by --gpu-num=0, --gpu_num1, etc)
support --srl-model-path indicate the SRL model weights path. Before, we can only load either the latest (by calling --latest) or manually change the config/srl_model.yaml model weights path.
register new srl models
Add new environments (extremely fast, about 20 times faster than all current environment): Labyrinth-v0 and MobileRobotX-v0 (enable interactive play) which can run at speed 36,000 FPS (frame-per-second) on 10 threads of Intel CPU i9-9900K (image resolution 128², 20,000 FPS for 224²). Compared to previous MobileRobotGymEnv-v0, it only runs at speed 800 FPS.
2~7 times speed-up of RL training (for 2D environments)
fix issues:
- image rotated by 90 degrees
- --log-folder folder doesn't exist.
- several issues in environments
[New] replay/plot_pipeline.py: aggregate all losses and plot on one figure. (draft code to be refined, merged with replay/aggregate_plots.py, compare_plots.py, gather_results.py)

…to adv_srl

…vs other than mobile_robot_env.py need to be modify)

…to adv_srl

…ure)

…pybullet)

…MobileRobotX

…future)

… future)

ncble · 2019-08-09T09:30:35Z

The original version (master) of the whole Toolbox has 18026 lines of code (including srl_zoo). My pull-request has already added and removed +5886/-249, +4145/-1453 (total +10031/-1702 ?) lines of code, thus more than 40% lines of code have been modified. There is no need to be merged to the master branch.

ncble added 30 commits May 17, 2019 15:21

fix issue: image rotated by 90 degrees

853d53d

fix issue: pb with old gym.space.prng(seed)

92ec263

fix issue (srl_zoo): image rotated by 90 degrees

0972b8a

resolve submodules commit

20874b8

include submodules

26006cb

merge

8457456

fix issue: sac/deepq/ddpg have no --policy arg

f32793e

change branch

0ca9326

pass image shape to the input of SRLNeuralNetwork

63d335e

add gan to srl-zoo

1a1c289

update srl_zoo

96ceb4c

merge with new master (1ab1bd3)

6679d77

update srl_zoo

be06349

update gitignore

8ddcac6

Merge branch 'adv_srl' of https://github.com/ncble/robotics-rl-srl in…

be996b0

…to adv_srl

Omnirobot requires python=2.7 incompatible with current python env

bad2039

full update for Omnirobot (new env)

dfe18d4

fix training bug (valid set is used during training) in srl-zoo

5d94eb5

new feature: add image shape to args for RL training

476bc82

add 'img_shape' to rl_baselines/train.py|replay/enjoy_baseline.py (en…

e2f1cb7

…vs other than mobile_robot_env.py need to be modify)

add img_shape to all envs

e72eb6e

update srl-zoo

8dd43f9

multi-runs: raw pixels with different resolutions

695e5fd

Merge branch 'adv_srl' of https://github.com/ncble/robotics-rl-srl in…

8f77ecf

…to adv_srl

update srl_zoo

1c548c7

update srl_zoo

0a9f5d3

update srl_zoo(ASRL)

3b724e4

add newsrl_model gan and unet(temporary)

735d4a1

update srl_zoo

b0b405e

new feature: enable srl model weights path in terminal

5313117

ncble added 7 commits July 5, 2019 15:47

register new srl model

d7366ed

register new srl model for omnirobot

e604cb6

update srl_zoo (resnet gan)

c41eb5e

merge srl_zoo (debug2_tempo)

e55278b

merge debug2_tempo

212ee8f

register new srl model (this registration should be remove in the fut…

cd55b93

…ure)

register new srl model: this should be removed in the future

9a9bbc2

ncble mentioned this pull request Jul 16, 2019

[bug report (Environment)] RELATIVE_POS=False doesn't work as expected #54

Open

ncble changed the title ~~A major (performance) update on the submodule: srl-zoo; fix several issues #41~43, #46~49.~~ A major (performance) update on the submodule: srl-zoo; fixes several issues Jul 16, 2019

ncble added 19 commits July 16, 2019 18:34

add Labyrinth environment

9b36e90

add mobile_robot_extreme environment

ecc093e

minor bugs

24f0c11

discret reward (gt reward) for srl training

4ce10bd

remove ugly self._reward, self._termination functions (cause bugs)

80fdd49

add new environment MobileRobotX (extreme fast, 20 times faster than …

c694749

…pybullet)

add random target to Labyrinth; fix color MobileX; add new env

8633608

quick fix for spcls num_classes (we should use new class design)

b38c958

enable omnirobot and any triplet imgs (background, robot, target) in …

a72fe0d

…MobileRobotX

add new srl model (this registration really should be removed in the …

77675da

…future)

Merge remote-tracking branch 'origin/labyrinth'

a6a35cf

change some params in MobileX

4814ea5

register color for post-processing

60887c8

clean code for 'supervised' learning

02371a6

register debug; clean code

3a3830b

add envs Labyrinth-v1 v2 v3

af82fd3

use png instead of jpg/jpeg extension for saving images

1361bc5

enable any image shape for dataset_generator.py

4ba094c

clean config/sr_models.yaml (this config file should be remove in the…

71243d9

… future)

ncble closed this Aug 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A major (performance) update on the submodule: srl-zoo; fixes several issues #50

A major (performance) update on the submodule: srl-zoo; fixes several issues #50

ncble commented Jun 13, 2019 •

edited

ncble commented Aug 9, 2019

A major (performance) update on the submodule: srl-zoo; fixes several issues #50

A major (performance) update on the submodule: srl-zoo; fixes several issues #50

Conversation

ncble commented Jun 13, 2019 • edited

Highlights

Changelog

SRL part

RL part

ncble commented Aug 9, 2019

ncble commented Jun 13, 2019 •

edited