You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During a group meeting I was raised a question about the meanings of abbreviations in the demo code of TRFL when I tried to introduce TRFL to my lab members. So I have to ask it here.
It reads:
q_tm1: the action value in the source state of a transition.
a_tm1: the action that was selected in the source state.
What does m1 mean here? I know "q" stands for action value, "t" stands for time step, I tried to figure "m1" stands for what, but it is not so intuitive.
Could you please help me on that? Thanks a lot.
The text was updated successfully, but these errors were encountered:
Dear Deepminder:
During a group meeting I was raised a question about the meanings of abbreviations in the demo code of TRFL when I tried to introduce TRFL to my lab members. So I have to ask it here.
It reads:
What does m1 mean here? I know "q" stands for action value, "t" stands for time step, I tried to figure "m1" stands for what, but it is not so intuitive.
Could you please help me on that? Thanks a lot.
The text was updated successfully, but these errors were encountered: