Loss function/Labels for neural network used? #4

abhigenie92 · 2017-06-25T02:53:52Z

I do understand the backpropagation in policy gradient networks, but am not sure how your code work keras's auto-differentiation.

That is, how you transform it into a supervised learning problem.
For example, the code below:

Y = self.probs + self.learning_rate * np.squeeze(np.vstack([gradients]))

Why is Y not 1-hot vector for the action taken?
You compute the gradient assuming the action is correct, Y is one-hot vector. Then you multiplies it by the reward in the corresponding time-step. But while training you feed it as the correction.
I think one could multiply the rewards by one-hot vector instead. And then feed it straight away.

If possible please clarify my doubt. :)
https://github.com/keon/policy-gradient/blob/master/pg.py#L67

The text was updated successfully, but these errors were encountered:

LinkToPast1990 · 2019-03-14T02:15:57Z

opt = Adam(lr=self.learning_rate)
model.compile(loss='categorical_crossentropy', optimizer=opt)

First, I think the loss should be gradient = (y-prob)*reward.
Second, we already set the learning_rate of opt.

So, Y should be self.probs + np.vstack([gradients]) ?
Y-Y_predict = Y - self.probs = np.vstack([gradients])

LinkToPast1990 · 2019-03-14T02:18:15Z

https://github.com/gabrielgarza/openai-gym-policy-gradient/blob/master/policy_gradient_layers.py

abhigenie92 changed the title ~~Loss function for neural network?~~ Loss function/Labels for neural network used? Jun 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss function/Labels for neural network used? #4

Loss function/Labels for neural network used? #4

abhigenie92 commented Jun 25, 2017

LinkToPast1990 commented Mar 14, 2019

LinkToPast1990 commented Mar 14, 2019

Loss function/Labels for neural network used? #4

Loss function/Labels for neural network used? #4

Comments

abhigenie92 commented Jun 25, 2017

LinkToPast1990 commented Mar 14, 2019

LinkToPast1990 commented Mar 14, 2019