calculating the state value function from state action value function #15

Fjoelsak · 2023-06-02T14:33:33Z

Hi,
I'm a little bit confused why you just take the q value of the best action and set this as state value function. According to the relationships between v and q the averaged q values over the actions according to the policy should be the value of the state value function.
Best regards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

calculating the state value function from state action value function #15

calculating the state value function from state action value function #15

Fjoelsak commented Jun 2, 2023

calculating the state value function from state action value function #15

calculating the state value function from state action value function #15

Comments

Fjoelsak commented Jun 2, 2023