Quantopian Pairs Trading Algo - Deprecation Fix #1550

ash487 · 2016-10-23T20:25:32Z

Greetings Quantopian Community,

I was at the NYC Event on Pairs Trading, and the current example algorithm is deprecated, such that one cannot deploy it in live trading. With this fix, users can now deploy the algorithm in live trading.

import numpy as np
import statsmodels.api as sm
import pandas as pd
from zipline.utils import tradingcalendar
import pytz


def initialize(context):
    # Quantopian backtester specific variables
    set_slippage(slippage.FixedSlippage(spread=0))
    set_commission(commission.PerTrade(cost=1))
    set_symbol_lookup_date('2010-01-01')

    context.stock_pairs = [(symbol(' '), symbol(' '))]
    # set_benchmark(context.y)

    context.num_pairs = len(context.stock_pairs)
    # strategy specific variables
    context.lookback = 20 # used for regression
    context.z_window = 20 # used for zscore calculation, must be <= lookback

    context.spread = np.ndarray((context.num_pairs, 0))
    # context.hedgeRatioTS = np.ndarray((context.num_pairs, 0))
    context.inLong = [False] * context.num_pairs
    context.inShort = [False] * context.num_pairs

    # Only do work 30 minutes before close
    schedule_function(func=check_pair_status, date_rule=date_rules.every_day(), time_rule=time_rules.market_close(minutes=30))

# Will be called on every trade event for the securities you specify. 
def handle_data(context, data):
    # Our work is now scheduled in check_pair_status
    pass

def check_pair_status(context, data):
    if get_open_orders():
        return

    new_spreads = np.ndarray((context.num_pairs, 1))

    for i in range(context.num_pairs):

        (stock_y, stock_x) = context.stock_pairs[i]


       ***THE CHANGE***
       ######################################################

        Y = data.history(stock_y, 'price', 35, '1d').iloc[-context.lookback::]
        X = data.history(stock_x, 'price', 35, '1d').iloc[-context.lookback::]

       ######################################################

        try:
            hedge = hedge_ratio(Y, X, add_const=True)      
        except ValueError as e:
            log.debug(e)
            return

        # context.hedgeRatioTS = np.append(context.hedgeRatioTS, hedge)

        new_spreads[i, :] = Y[-1] - hedge * X[-1]

        if context.spread.shape[1] > context.z_window:
            # Keep only the z-score lookback period
            spreads = context.spread[i, -context.z_window:]

            zscore = (spreads[-1] - spreads.mean()) / spreads.std()

            if context.inShort[i] and zscore < 0.0:
                order_target(stock_y, 0)
                order_target(stock_x, 0)
                context.inShort[i] = False
                context.inLong[i] = False
                record(X_pct=0, Y_pct=0)
                return

            if context.inLong[i] and zscore > 0.0:
                order_target(stock_y, 0)
                order_target(stock_x, 0)
                context.inShort[i] = False
                context.inLong[i] = False
                record(X_pct=0, Y_pct=0)
                return

            if zscore < -1.0 and (not context.inLong[i]):
                # Only trade if NOT already in a trade
                y_target_shares = 1
                X_target_shares = -hedge
                context.inLong[i] = True
                context.inShort[i] = False

                (y_target_pct, x_target_pct) = computeHoldingsPct( y_target_shares,X_target_shares, Y[-1], X[-1] )
                order_target_percent( stock_y, y_target_pct * (1.0/context.num_pairs) / float(context.num_pairs) )
                order_target_percent( stock_x, x_target_pct * (1.0/context.num_pairs) / float(context.num_pairs) )
                record(Y_pct=y_target_pct, X_pct=x_target_pct)
                return

            if zscore > 1.0 and (not context.inShort[i]):
                # Only trade if NOT already in a trade
                y_target_shares = -1
                X_target_shares = hedge
                context.inShort[i] = True
                context.inLong[i] = False

                (y_target_pct, x_target_pct) = computeHoldingsPct( y_target_shares, X_target_shares, Y[-1], X[-1] )
                order_target_percent( stock_y, y_target_pct * (1.0/context.num_pairs) / float(context.num_pairs) )
                order_target_percent( stock_x, x_target_pct * (1.0/context.num_pairs) / float(context.num_pairs) )
                record(Y_pct=y_target_pct, X_pct=x_target_pct)

    context.spread = np.hstack([context.spread, new_spreads])

def hedge_ratio(Y, X, add_const=True):
    if add_const:
        X = sm.add_constant(X)
        model = sm.OLS(Y, X).fit()
        return model.params[1]
    model = sm.OLS(Y, X).fit()
    return model.params.values

def computeHoldingsPct(yShares, xShares, yPrice, xPrice):
    yDol = yShares * yPrice
    xDol = xShares * xPrice
    notionalDol =  abs(yDol) + abs(xDol)
    y_target_pct = yDol / notionalDol
    x_target_pct = xDol / notionalDol
    return (y_target_pct, x_target_pct)

When in python2.7, and unicode_literals is imported type check will raise error because 'type' is not str but unicode

and corresponding tests

…#1470) This reverts commit 5b1aa5e. The paradigm is: we're calculating a new capital base for the performance period. We are therefore using the total portfolio_value, not just the cash, to calculate the difference from the specified target as the algorithm has meaningful holdings.

Remove module scope invocations of `get_calendar('NYSE')`, which cuts zipline import time in half on my machine. This make the zipline CLI noticeably more responsive, and it reduces memory consumed at import time from 130MB to 90MB. Before: $ time python -c 'import zipline' real 0m1.262s user 0m1.128s sys 0m0.120s After: $ time python -c 'import zipline' real 0m0.676s user 0m0.536s sys 0m0.132s

Symbol lookup raises

MAINT: remove __getitem__ as alias of __getattr__

…me-from-pushing-this-commit-directly-;_; ENH: improve warning for protocol getitem

Update release notes. Generate api stubs.

REL: Prepare for 1.0.2 release.

MAINT: Update leveraged ETF list

PERF: Remove import-time calendar creations.

Post 1.0.2 cleanup.

* REF: More options before raise MultiFound. * TST: Checks corner case for fuzzy matching.

Check param string types

to 'quantopian-quandl' bundle

BUG: run_algorithm with no data source should default

This reverts commit a5ecaf4. This causes downstream problems; unsure why, Jamie advised reverting.

These were previously available like the others.

Refcount pipeline terms during execution and release terms once they're no longer needed. This dramatically reduces memory usage on large pipelines.

There have been cases where the requested start or end date is not in the history calendar. Add the beginning and of the calendar to the KeyError to give more detail to figure out root cause.

…ar-mismatch MAINT: Add more info to history calendar KeyError.

This provides a 15% speedup for an algo that calls `data.current` with 1000 every minute.

Make `__next__` and `seek` share code instead of seek() calling `__next__`. This avoids having to make a large number of integer comparisons and `asanyarray` calls when seeking more than one tick forward.

…SX (#1560)

This is a dramatic speedup (~25% in local benchmarks) for history calls with a large number of assets and a short window length.

This shaves off 20 out of 160 seconds for an algorithm that makes a large number of large universe, short window_length `history()` calls.

`_get_minute_window_data` was just forwarding its input to a method with the same signature.

Avoids a couple function calls in a hot path.

Instead of using the difference between the session close of the front contract before the roll and and the open of back contract on the beginning of the roll, use the close of both at the end of the session before the roll. The closes of the session prior to roll is in lieu of settlement data.

…t-closes BUG: Use proxy for settlement on future adjustments.

richafrank · 2016-10-27T19:18:16Z

Hi @ash487! It looks like this PR proposes we merge our master branch into @ssanderson 's "revamp-tutorial" branch. Is the fix you're suggesting on a branch somewhere that you want us to pull in?

Apply offset value when writing out the rolls in a continuous future which is offset from the primary.

This boundary case was exposed with internal fixture data which used a continuous future with a contract chain of size one.

BUG: Fix continuous future history with offsets.

Micro optimizations 2

This will keep `opens`, `closes`, `early_closes`, etc to the same pattern.

MAINT: Restore @Property decorator

Rename _get_daily_window_for_sids to _get_daily_window_data. Rename _get_minute_window_for_assets to _get_minute_window_data. Rename _get_daily_data to get_daily_spot_value.

Fix microoptimizations

richafrank · 2016-10-28T19:22:18Z

@ash487 I'm going to close this, but feel free to open a new PR using the branch with your fix!

phil.zhang and others added 30 commits September 2, 2016 16:47

BUG: Change str to string_types to avoid errors

7ac9127

When in python2.7, and unicode_literals is imported type check will raise error because 'type' is not str but unicode

BUG: Fix up check_parameters usage of string_types

a4e495d

and corresponding tests

BUG: Fixing SymbolNotFound to be raised.

ebe8311

BUG: Handle case with mult symbol options for same sid.

d7b3c54

BUG: Fixing 2/3 compat.

658b536

Merge pull request #1462 from quantopian/symbol-lookup-raises

393aa06

Symbol lookup raises

ENH: just deprecate __getitem__, don't remove

1546979

DEV: update copyright in protocol.py (added code)

a3e869e

Merge pull request #1449 from quantopian/getitem-is-not-getattr

cf2abf1

MAINT: remove __getitem__ as alias of __getattr__

MAINT/TEST: Update default calendar smoketest.

977d1fa

ENH: improve warning for protocol getitem

ec1ca28

Merge pull request #1472 from quantopian/branch-protect-hook-stopped-…

89786f1

…me-from-pushing-this-commit-directly-;_; ENH: improve warning for protocol getitem

REL: Prepare for 1.0.2 release.

15aaafe

Update release notes. Generate api stubs.

Merge pull request #1473 from quantopian/release-1.0.2

cf44fcb

REL: Prepare for 1.0.2 release.

added DGZ to delete list

a5ecaf4

Merge pull request #1434 from quantopian/update-leveraged-etfs

9c8f0ce

MAINT: Update leveraged ETF list

MAINT: Updates from Joe's PR feedback.

d6ad73e

STY: Fix flake8.

40fa6ae

Merge pull request #1471 from quantopian/fix-slow-startup

1ccc9e4

PERF: Remove import-time calendar creations.

DOC: Add skeleton for 1.0.3 release notes.

36c4f4a

Post 1.0.2 cleanup.

More Fuzzy Symbol Fixes (#1475)

4a00e69

* REF: More options before raise MultiFound. * TST: Checks corner case for fuzzy matching.

Merge pull request #1467 from quantopian/check_param-string_types

df07f67

Check param string types

BUG: run_algorithm with no data source should default

3bdba2e

to 'quantopian-quandl' bundle

Merge pull request #1479 from quantopian/run_algo-defaults

7b2ca76

BUG: run_algorithm with no data source should default

Revert "added DGZ to delete list" (#1481)

508c7ac

This reverts commit a5ecaf4. This causes downstream problems; unsure why, Jamie advised reverting.

MAINT: Add additional fields to __getitem__ for Order (#1483)

fb0981e

These were previously available like the others.

PERF: Release unneeded pipeline terms.

3babc38

Refcount pipeline terms during execution and release terms once they're no longer needed. This dramatically reduces memory usage on large pipelines.

MAINT: Move refcount management into TermGraph.

31436cd

Eddie Hebert and others added 16 commits October 26, 2016 14:41

MAINT: Add more info to history calendar KeyError.

cd36e37

There have been cases where the requested start or end date is not in the history calendar. Add the beginning and of the calendar to the KeyError to give more detail to figure out root cause.

Merge pull request #1558 from quantopian/add-detail-to-history-calend…

df2a091

…ar-mismatch MAINT: Add more info to history calendar KeyError.

PERF: Try cache on scalar asset lookups.

a56fc70

This provides a 15% speedup for an algo that calls `data.current` with 1000 every minute.

MAINT: Auto-rebuild templated cython files.

35a72f6

PERF: Remove attribute access in inner loop.

73cc580

PERF: Vectorize assignments in get_history_window.

20531fb

BUG: Return NaT instead of None in daily reader.

85fcf0b

PERF: Refactor AdjustedArrayWindow.

502fbf5

Make `__next__` and `seek` share code instead of seek() calling `__next__`. This avoids having to make a large number of integer comparisons and `asanyarray` calls when seeking more than one tick forward.

DEV delete old *.c and *.so files with rebuild-cython.sh for Darwin/O…

e7ca080

…SX (#1560)

PERF: Use vectorized assignment into dataframe.

9d10e28

This is a dramatic speedup (~25% in local benchmarks) for history calls with a large number of assets and a short window length.

PERF: Pull out loop-invariant code.

5547cca

This shaves off 20 out of 160 seconds for an algorithm that makes a large number of large universe, short window_length `history()` calls.

MAINT/PERF: Remove redundant method call.

25c78de

`_get_minute_window_data` was just forwarding its input to a method with the same signature.

PERF: Don't round until after we hstack.

9e3eebc

PERF: Call concatenate directly instead of hstack.

70cc602

Avoids a couple function calls in a hot path.

Merge pull request #1563 from quantopian/use-same-session-for-contrac…

43ca435

…t-closes BUG: Use proxy for settlement on future adjustments.

Eddie Hebert and others added 11 commits October 27, 2016 16:23

BUG: Fix continuous future history with offsets.

4235dbd

Apply offset value when writing out the rolls in a continuous future which is offset from the primary.

BUG: Protect against contract offset at end of range. (#1564)

575a8cf

This boundary case was exposed with internal fixture data which used a continuous future with a contract chain of size one.

Merge pull request #1565 from quantopian/fix-offset-history

9a51efc

BUG: Fix continuous future history with offsets.

Merge pull request #1561 from quantopian/micro-optimizations-2

285b5f7

Micro optimizations 2

MAINT: Restore @Property decorator

265c02b

This will keep `opens`, `closes`, `early_closes`, etc to the same pattern.

Merge pull request #1567 from bernoullio/master

2c819a6

MAINT: Restore @Property decorator

BUG: Raise SidsNotFound in retrieve_asset.

1fbc17d

DOC: Comment on outdated code.

8ccef7b

MAINT: Consolidate data_portal names.

21a3f1a

Rename _get_daily_window_for_sids to _get_daily_window_data. Rename _get_minute_window_for_assets to _get_minute_window_data. Rename _get_daily_data to get_daily_spot_value.

Merge pull request #1568 from quantopian/fix-microoptimizations

5ecb5c3

Fix microoptimizations

STY: Put 0 at the end. (#1569)

2536ad2

richafrank closed this Oct 28, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantopian Pairs Trading Algo - Deprecation Fix #1550

Quantopian Pairs Trading Algo - Deprecation Fix #1550

ash487 commented Oct 23, 2016 •

edited

richafrank commented Oct 27, 2016

richafrank commented Oct 28, 2016

Quantopian Pairs Trading Algo - Deprecation Fix #1550

Quantopian Pairs Trading Algo - Deprecation Fix #1550

Conversation

ash487 commented Oct 23, 2016 • edited

richafrank commented Oct 27, 2016

richafrank commented Oct 28, 2016

ash487 commented Oct 23, 2016 •

edited