Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
zuoxingdong committed Sep 4, 2018
1 parent 3efa025 commit c44c009
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions lagom/runner/base_history.py
Original file line number Diff line number Diff line change
Expand Up @@ -83,7 +83,7 @@ def all_returns(self):
Formally, suppose we have all rewards :math:`(r_1, \dots, r_T)`, it computes
.. math::
`G_t = \sum_{i=t}^{T} r_i` for all :math:`t`.
G_t = \sum_{i=t}^{T} r_i, \forall t
.. note::
Expand All @@ -99,7 +99,7 @@ def all_discounted_returns(self):
Formally, suppose we have all rewards :math:`(r_1, \dots, r_T)`, it computes
.. math::
`G_t = \sum_{i=t}^{T} \gamma^{i - t} r_i` for all :math:`t`
G_t = \sum_{i=t}^{T} \gamma^{i - t} r_i, \forall t
.. note::
Expand All @@ -116,7 +116,7 @@ def all_bootstrapped_returns(self):
Formally, suppose we have all rewards :math:`(r_1, \dots, r_T)`, it computes
.. math::
`Q_t = r_t + r_{t+1} + \dots + r_T + V(s_{T+1})`
Q_t = r_t + r_{t+1} + \dots + r_T + V(s_{T+1})
.. note::
Expand All @@ -132,7 +132,7 @@ def all_bootstrapped_discounted_returns(self):
Formally, suppose we have all rewards :math:`(r_1, \dots, r_T)`, it computes
.. math::
`Q_t = r_t + \gamma r_{t+1} + \dots + \gamma^{T - t} r_T + \gamma^{T - t + 1} V(s_{T+1})`
Q_t = r_t + \gamma r_{t+1} + \dots + \gamma^{T - t} r_T + \gamma^{T - t + 1} V(s_{T+1})
.. note::
Expand Down Expand Up @@ -160,7 +160,7 @@ def all_TD(self):
values :math:`(V(s_1), \dots, V(s_T), V(s_{T+1}))`, it computes
.. math::
`\delta_t = r_t + \gamma V(s_{t+1}) - V(s_t)`
\delta_t = r_t + \gamma V(s_{t+1}) - V(s_t)
.. note::
Expand Down

0 comments on commit c44c009

Please sign in to comment.