Rewards

RewardFunctions classes are used to convert observations of an agents state into a numerical value that is used to inform the policy of the effectiveness of its actions so as to allow the policy to learn and improve future policy decisions.

RewardFunction classes are a fully optional feature of Phantom. There is no functional difference between defining an :meth:`compute_reward()` method on an Agent and defining a :class:`RewardFunction` (whose :meth:`reward()` method performs the same actions) and attaching it to the Agent.

Base RewardFunction

.. autoclass:: phantom.reward_functions.RewardFunction
   :inherited-members:

Provided Implementations

.. autoclass:: phantom.reward_functions.Constant
   :inherited-members:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward_functions.rst

reward_functions.rst

Rewards

Base RewardFunction

Provided Implementations

Files

reward_functions.rst

Latest commit

History

reward_functions.rst

File metadata and controls

Rewards

Base RewardFunction

Provided Implementations