GitHub - fizz-ml/pytorch-aux-reward-rl: pyTorch Implementation of Deep Deterministic Policy Gradient with Auxiliary Rewards

Notes

DDPG with Auxillary Rewards:

State -> Actor -> Action
State, Action -> Critic-> Q(State,Action)

Aux_reward_i:

State -> Actor -> lower level representation of state (LRS)
LRS -> aux_reward_module_i -> Aux_reward_i

mean_square_loss(Q , Q_obs) -> critic -> State, action

-Q -> critic -> State, action -> Actor -> State

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
model_defs		model_defs
models/mountain_cart_DDPG		models/mountain_cart_DDPG
results/cart_pole_1		results/cart_pole_1
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
agent.py		agent.py
ddpg_agent.py		ddpg_agent.py
environment.py		environment.py
main.py		main.py
model_generator.py		model_generator.py
replay_buffer.py		replay_buffer.py
test.py		test.py
test2.py		test2.py