Data interpretation logged by tensorboard_log #47

HarukiKozukapenguin · 2022-04-15T06:03:41Z

Thank you for interesting simulator!

I checked run_vision_ppo.py by following command.
python3 -m python.run_vision_ppo --render 0 --train 1
And, I found data when they train in the envtest/python/saved directory (e.g. PPO_1, PPO_2). I found some policies when they're training(/policy), and Test Trajectory(/TestTraj).
The questions I would like to ask is as follows.

where is the logging of reward transition when they train?
what does each axis means in the graph of TestTraj/Plots?
which code define the parameter of plotting or logging?

The text was updated successfully, but these errors were encountered:

yun-long · 2022-04-15T13:01:57Z

hi,

the training reward is logged in tensorboard. you can to to saved and run

tensorboard --logdir=./

the plots in the first row are position [x, y, z] and the plots in the second row are velocity [x, y, z]
the plotting is done here

HarukiKozukapenguin · 2022-04-15T14:42:07Z

Which directory should I run this command (tensorboard --logdir=./), or should I run this command b/f I run python3 -m python.run_vision_ppo --render 0 --train 1?

HarukiKozukapenguin · 2022-04-16T08:03:29Z

I move to a directory of saved/PPO_(num) and I run tensorboard --logdir=./ then I can seer transition of each reward.
Thank you!

HarukiKozukapenguin · 2022-04-18T09:03:15Z

@yun-long

I have one question about interpretation of TensorBoard.
I can see rewards transition when I learns in the simulation, but I do not know how what does it means.

I think the reward is a sum of reward of each episode, Is it correct?
Is this rewards a rewards when agent trains? or rewards when agent evaluate?

yun-long · 2022-04-18T11:00:52Z

Hi,

the reward you see on Tensorboard is from here.

In summary, contains the sum reward and each individual reward component. The reward is a training reward, not an evaluation reward.

HarukiKozukapenguin · 2022-04-18T12:12:06Z

@yun-long
Which code writes these rewards to TensorBoard?

yun-long · 2022-04-20T20:35:03Z

hi

the code is here

HarukiKozukapenguin · 2022-04-21T06:16:41Z

Thank you!

HarukiKozukapenguin closed this as completed Apr 16, 2022

HarukiKozukapenguin reopened this Apr 18, 2022

HarukiKozukapenguin closed this as completed Apr 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data interpretation logged by tensorboard_log #47

Data interpretation logged by tensorboard_log #47

HarukiKozukapenguin commented Apr 15, 2022 •

edited

Loading

yun-long commented Apr 15, 2022

HarukiKozukapenguin commented Apr 15, 2022 •

edited

Loading

HarukiKozukapenguin commented Apr 16, 2022

HarukiKozukapenguin commented Apr 18, 2022

yun-long commented Apr 18, 2022

HarukiKozukapenguin commented Apr 18, 2022 •

edited

Loading

yun-long commented Apr 20, 2022

HarukiKozukapenguin commented Apr 21, 2022

Data interpretation logged by tensorboard_log #47

Data interpretation logged by tensorboard_log #47

Comments

HarukiKozukapenguin commented Apr 15, 2022 • edited Loading

yun-long commented Apr 15, 2022

HarukiKozukapenguin commented Apr 15, 2022 • edited Loading

HarukiKozukapenguin commented Apr 16, 2022

HarukiKozukapenguin commented Apr 18, 2022

yun-long commented Apr 18, 2022

HarukiKozukapenguin commented Apr 18, 2022 • edited Loading

yun-long commented Apr 20, 2022

HarukiKozukapenguin commented Apr 21, 2022

HarukiKozukapenguin commented Apr 15, 2022 •

edited

Loading

HarukiKozukapenguin commented Apr 15, 2022 •

edited

Loading

HarukiKozukapenguin commented Apr 18, 2022 •

edited

Loading