ENH: new risk metrics API

quantopian · Feb 26, 2018 · 1f50b1b · 1f50b1b
1 parent 63e5ab3
commit 1f50b1b
Show file tree

Hide file tree

Showing 68 changed files with 6,975 additions and 5,807 deletions.
diff --git a/.gitignore b/.gitignore
@@ -73,4 +73,6 @@ zipline.iml
 
 TAGS
 
-.ipynb_checkpoints/
+.ipynb_checkpoints/
+
+.gdb_history
diff --git a/docs/source/appendix.rst b/docs/source/appendix.rst
@@ -362,6 +362,9 @@ Readers
 .. autoclass:: zipline.data.data_portal.DataPortal
    :members:
 
+.. autoclass:: zipline.sources.benchmark_source.BenchmarkSource
+   :members:
+
 Bundles
 ```````
 .. autofunction:: zipline.data.bundles.register
@@ -375,11 +378,77 @@ Bundles
 .. data:: zipline.data.bundles.bundles
 
    The bundles that have been registered as a mapping from bundle name to bundle
-   data. This mapping is immutable and should only be updated through
+   data. This mapping is immutable and may only be updated through
    :func:`~zipline.data.bundles.register` or
    :func:`~zipline.data.bundles.unregister`.
 
 
+Risk Metrics
+~~~~~~~~~~~~
+
+Algorithm State
+```````````````
+
+.. autoclass:: zipline.finance.ledger.Ledger
+   :members:
+
+.. autoclass:: zipline.protocol.Portfolio
+   :members:
+
+.. autoclass:: zipline.protocol.Account
+   :members:
+
+.. autoclass:: zipline.finance.ledger.PositionTracker
+   :members:
+
+.. autoclass:: zipline.finance._finance_ext.PositionStats
+
+Built-in Metrics
+````````````````
+
+.. autoclass:: zipline.finance.metrics.metric.SimpleLedgerField
+
+.. autoclass:: zipline.finance.metrics.metric.DailyLedgerField
+
+.. autoclass:: zipline.finance.metrics.metric.StartOfPeriodLedgerField
+
+.. autoclass:: zipline.finance.metrics.metric.StartOfPeriodLedgerField
+
+.. autoclass:: zipline.finance.metrics.metric.Returns
+
+.. autoclass:: zipline.finance.metrics.metric.BenchmarkReturnsAndVolatility
+
+.. autoclass:: zipline.finance.metrics.metric.CashFlow
+
+.. autoclass:: zipline.finance.metrics.metric.Orders
+
+.. autoclass:: zipline.finance.metrics.metric.Transactions
+
+.. autoclass:: zipline.finance.metrics.metric.Positions
+
+.. autoclass:: zipline.finance.metrics.metric.ReturnsStatistic
+
+.. autoclass:: zipline.finance.metrics.metric.AlphaBeta
+
+.. autoclass:: zipline.finance.metrics.metric.MaxLeverage
+
+Metrics Sets
+````````````
+
+.. autofunction:: zipline.finance.metrics.register
+
+.. autofunction:: zipline.finance.metrics.load
+
+.. autofunction:: zipline.finance.metrics.unregister
+
+.. data:: zipline.data.finance.metrics.metrics_sets
+
+   The metrics sets that have been registered as a mapping from metrics set name
+   to load function. This mapping is immutable and may only be updated through
+   :func:`~zipline.finance.metrics.register` or
+   :func:`~zipline.finance.metrics.unregister`.
+
+
 Utilities
 ~~~~~~~~~
 

diff --git a/docs/source/conf.py b/docs/source/conf.py
@@ -27,7 +27,7 @@
 
 # -- Docstrings ---------------------------------------------------------------
 
-extensions += ['numpydoc']
+extensions += ['sphinx.ext.napoleon']
 numpydoc_show_class_members = False
 
 # Add any paths that contain templates here, relative to this directory.

diff --git a/docs/source/index.rst b/docs/source/index.rst
@@ -7,6 +7,7 @@
    beginner-tutorial
    bundles
    trading-calendars
+   risk-and-perf-metrics
    development-guidelines
    appendix
    release-process

diff --git a/docs/source/risk-and-perf-metrics.rst b/docs/source/risk-and-perf-metrics.rst
@@ -0,0 +1,284 @@
+.. _metrics:
+
+Risk and Performance Metrics
+----------------------------
+
+The risk and performance metrics are summarizing values calculated by Zipline
+when running a simulation. These metrics can be about the performance of an
+algorithm, like returns or cash flow, or the riskiness of an algorithm, like
+volatility or beta. Metrics may be reported minutely, daily, or once at the end
+of a simulation. A single metric may choose to report at multiple time-scales
+where appropriate.
+
+Metrics Sets
+~~~~~~~~~~~~
+
+Zipline groups risk and performance metrics into collections called "metrics
+sets". A single metrics set defines all of the metrics to track during a single
+backtest. A metrics set may contain metrics that report at different time
+scales. The default metrics set will compute a host of metrics, such as
+algorithm returns, volatility, Sharpe ratio, and beta.
+
+Selecting the Metrics Set
+~~~~~~~~~~~~~~~~~~~~~~~~~
+
+When running a simulation, the user may select the metrics set to report. How
+you select the metrics set depends on the interface being used to run the
+algorithm.
+
+Command Line and IPython Magic
+``````````````````````````````
+
+When running with the command line or IPython magic interfaces, the metrics set
+may be selected by passing the ``--metrics-set`` argument. For example:
+
+.. code-block:: bash
+
+   $ zipline run algorithm.py -s 2014-01-01 -e 2014-02-01 --metrics-set my-metrics-set
+
+``run_algorithm``
+`````````````````
+
+When running through the :func:`~zipline.run_algorithm` interface, the metrics
+set may be passed with the ``metrics_set`` argument. This may either be the name
+of a registered metrics set, or a set of metric object. For example:
+
+.. code-block:: python
+
+   run_algorithm(..., metrics_set='my-metrics-set')
+   run_algorithm(..., metrics_set={MyMetric(), MyOtherMetric(), ...})
+
+Running Without Metrics
+~~~~~~~~~~~~~~~~~~~~~~~
+
+Computing risk and performance metrics is not free, and contributes to the total
+runtime of a backtest. When actively developing an algorithm, it is often
+helpful to skip these computations to speed up the debugging cycle. To disable
+the calculation and reporting of all metrics, users may select the built-in
+metrics set ``none``. For example:
+
+.. code-block:: python
+
+   $ zipline run algorithm.py -s 2014-01-01 -e 2014-02-01 --metrics-set none
+
+Defining New Metrics
+~~~~~~~~~~~~~~~~~~~~
+
+A metric is any object that implements some subset of the following methods:
+
+- ``start_of_simulation``
+- ``end_of_simulation``
+- ``start_of_session``
+- ``end_of_session``
+- ``end_of_bar``
+
+These functions will be called at the time indicated by their name, at which
+point the metric object may collect any needed information and optionally report
+a computed value. If a metric does not need to do any processing at one of these
+times, it may omit a definition for the given method.
+
+A metric should be reusable, meaning that a single instance of a metric class
+should be able to be used across multiple backtests. Metrics do not need to
+support multiple simulations at once, meaning that internal caches and data are
+consistent between ``start_of_simulation`` and ``end_of_simulation``.
+
+``start_of_simulation``
+```````````````````````
+
+The ``start_of_simulation`` method should be thought of as a per-simulation
+constructor. This method should initialize any caches needed for the duration of
+a single simulation.
+
+The ``start_of_simulation`` method should have the following signature:
+
+.. code-block:: python
+
+   def start_of_simulation(self,
+                           ledger,
+                           emission_rate,
+                           trading_calendar,
+                           sessions,
+                           benchmark_source):
+       ...
+
+``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
+maintaining the simulation's state. This may be used to lookup the algorithm's
+starting portfolio values.
+
+``emission_rate`` is a string representing the smallest frequency at which
+metrics should be reported. ``emission_rate`` will be either ``minute`` or
+``daily``. When ``emission_rate`` is ``daily``, ``end_of_bar`` will not be
+called at all.
+
+``trading_calendar`` is an instance of
+:class:`~zipline.utils.calendars.TradingCalendar` which is the trading calendar
+being used by the simulation.
+
+``sessions`` is a :class:`pandas.DatetimeIndex` which is holds the session
+labels, in sorted order, that the simulation will execute.
+
+``benchmark_source`` is an instance of
+:class:`~zipline.sources.benchmark_source.BenchmarkSource` which is the
+interface to the returns of the benchmark specified by
+:func:`~zipline.api.set_benchmark`.
+
+``end_of_simulation``
+`````````````````````
+
+The ``end_of_simulation`` method should have the following signature:
+
+.. code-block:: python
+
+   def end_of_simulation(self,
+                         packet,
+                         ledger,
+                         trading_calendar,
+                         sessions,
+                         data_portal,
+                         benchmark_source):
+       ...
+
+``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
+maintaining the simulation's state. This may be used to lookup the algorithm's
+final portfolio values.
+
+``packet`` is a dictionary to write the end of simulation values for the given
+metric into.
+
+``trading_calendar`` is an instance of
+:class:`~zipline.utils.calendars.TradingCalendar` which is the trading calendar
+being used by the simulation.
+
+``sessions`` is a :class:`pandas.DatetimeIndex` which is holds the session
+labels, in sorted order, that the simulation has executed.
+
+``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
+which is the metric's interface to pricing data.
+
+``benchmark_source`` is an instance of
+:class:`~zipline.sources.benchmark_source.BenchmarkSource` which is the
+interface to the returns of the benchmark specified by
+:func:`~zipline.api.set_benchmark`.
+
+``start_of_session``
+````````````````````
+
+The ``start_of_session`` method may see a slightly different view of the
+``ledger`` or ``data_portal`` than the previous ``end_of_session`` if the price
+of any futures owned move between trading sessions or if a capital change
+occurs.
+
+The ``start_of_session`` method should have the following signature:
+
+.. code-block:: python
+
+   def start_of_session(self,
+                        ledger,
+                        session_label,
+                        data_portal):
+       ...
+
+``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
+maintaining the simulation's state. This may be used to lookup the algorithm's
+current portfolio values.
+
+``session_label`` is a :class:`~pandas.Timestamp` which is the label of the
+session which is about to run.
+
+``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
+which is the metric's interface to pricing data.
+
+``end_of_session``
+``````````````````
+
+The ``end_of_session`` method should have the following signature:
+
+.. code-block:: python
+
+   def end_of_session(self,
+                      packet,
+                      ledger,
+                      session_label,
+                      session_ix,
+                      data_portal):
+
+``packet`` is a dictionary to write the end of session values. This dictionary
+contains two sub-dictionaries: ``daily_perf`` and ``cumulative_perf``. When
+applicable, the ``daily_perf`` should hold the current day's value, and
+``cumulative_perf`` should hold a cumulative value for the entire simulation up
+to the current time.
+
+``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
+maintaining the simulation's state. This may be used to lookup the algorithm's
+current portfolio values.
+
+``session_label`` is a :class:`~pandas.Timestamp` which is the label of the
+session which is has just completed.
+
+``session_ix`` is an :class:`int` which is the index of the current trading
+session being run. This is provided to allow for efficient access to the daily
+returns through ``ledger.daily_returns_array[:session_ix + 1]``.
+
+``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
+which is the metric's interface to pricing data
+
+``end_of_bar``
+``````````````
+
+.. note::
+
+   ``end_of_bar`` is only called when ``emission_mode`` is ``minute``.
+
+The ``end_of_bar`` method should have the following signature:
+
+.. code-block:: python
+
+   def end_of_bar(self,
+                  packet,
+                  ledger,
+                  dt,
+                  session_ix,
+                  data_portal):
+
+``packet`` is a dictionary to write the end of session values. This dictionary
+contains two sub-dictionaries: ``minute_perf`` and ``cumulative_perf``. When
+applicable, the ``minute_perf`` should hold the current partial day's value, and
+``cumulative_perf`` should hold a cumulative value for the entire simulation up
+to the current time.
+
+``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
+maintaining the simulation's state. This may be used to lookup the algorithm's
+current portfolio values.
+
+``dt`` is a :class:`~pandas.Timestamp` which is the label of bar that has just
+completed.
+
+``session_ix`` is an :class:`int` which is the index of the current trading
+session being run. This is provided to allow for efficient access to the daily
+returns through ``ledger.daily_returns_array[:session_ix + 1]``.
+
+``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
+which is the metric's interface to pricing data.
+
+Defining New Metrics Sets
+~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Users may use :func:`zipline.finance.metrics.register` to register a new metrics
+set. This may be used to decorate a function taking no arguments which returns a
+new set of metric object instances. For example:
+
+.. code-block:: python
+
+   from zipline.finance import metrics
+
+   @metrics.register('my-metrics-set')
+   def my_metrics_set():
+       return {MyMetric(), MyOtherMetric(), ...}
+
+
+This may be embedded in the user's ``extension.py``.
+
+The reason that a metrics set is defined as a function which produces a set,
+instead of just a set, is that users may want to fetch external data or
+resources to construct their metrics. By putting this behind a callable, users
+do not need to fetch the resources when the metrics set is not being used.