### Linear Regression Coefficient Significance Testing

In this notebook I will load the structured data and fit a statsmodels Linear Regression on it. I will investigate p-values, remove coefficients that are not significant, and retest. I will attempt to understand which, if any, are significant and why. 

In [3]:
import statsmodels.api as sm

In [1]:
import pandas as pd
import numpy as np

pd.set_option('display.max_columns', 500)

df = pd.read_pickle('./data/GOOG081320.pickle')
df.head()

Unnamed: 0,open,high,low,close,volume,date,SYMBOL,prev_close,diff_1,pct_change,log_return,pct_return_0,pct_return_1,pct_return_2,pct_return_3,pct_return_4,pct_return_5,pct_return_6,pct_return_7,pct_return_8,pct_return_9,pct_return_10,pct_return_11,pct_return_12,pct_return_13,pct_return_14,pct_return_15,pct_return_16,pct_return_17,pct_return_18,pct_return_19,pct_return_20,pct_return_21,pct_return_22,pct_return_23,pct_return_24,pct_return_25,pct_return_26,pct_return_27,pct_return_28,pct_return_29,pct_return_30,intervals,"monthly_(-0.12, -0.11499999999999999]","monthly_(-0.11499999999999999, -0.10999999999999999]","monthly_(-0.10999999999999999, -0.105]","monthly_(-0.105, -0.09999999999999999]","monthly_(-0.09999999999999999, -0.09499999999999999]","monthly_(-0.09499999999999999, -0.09]","monthly_(-0.09, -0.08499999999999999]","monthly_(-0.08499999999999999, -0.07999999999999999]","monthly_(-0.07999999999999999, -0.07499999999999998]","monthly_(-0.07499999999999998, -0.06999999999999998]","monthly_(-0.06999999999999998, -0.06499999999999999]","monthly_(-0.06499999999999999, -0.059999999999999984]","monthly_(-0.059999999999999984, -0.05499999999999998]","monthly_(-0.05499999999999998, -0.04999999999999999]","monthly_(-0.04999999999999999, -0.044999999999999984]","monthly_(-0.044999999999999984, -0.03999999999999998]","monthly_(-0.03999999999999998, -0.034999999999999976]","monthly_(-0.034999999999999976, -0.02999999999999997]","monthly_(-0.02999999999999997, -0.02499999999999998]","monthly_(-0.02499999999999998, -0.019999999999999976]","monthly_(-0.019999999999999976, -0.014999999999999972]","monthly_(-0.014999999999999972, -0.009999999999999981]","monthly_(-0.009999999999999981, -0.004999999999999977]","monthly_(-0.004999999999999977, 2.7755575615628914e-17]","monthly_(2.7755575615628914e-17, 0.005000000000000032]","monthly_(0.005000000000000032, 0.010000000000000037]","monthly_(0.010000000000000037, 0.015000000000000041]","monthly_(0.015000000000000041, 0.020000000000000018]","monthly_(0.020000000000000018, 0.025000000000000022]","monthly_(0.025000000000000022, 0.030000000000000027]","monthly_(0.030000000000000027, 0.03500000000000003]","monthly_(0.03500000000000003, 0.040000000000000036]","monthly_(0.040000000000000036, 0.04500000000000004]","monthly_(0.04500000000000004, 0.050000000000000044]","monthly_(0.050000000000000044, 0.05500000000000005]","monthly_(0.05500000000000005, 0.06000000000000005]","monthly_(0.06000000000000005, 0.06500000000000003]","monthly_(0.06500000000000003, 0.07000000000000003]","monthly_(0.07000000000000003, 0.07500000000000004]","monthly_(0.07500000000000004, 0.08000000000000004]","monthly_(0.08000000000000004, 0.08500000000000005]","monthly_(0.08500000000000005, 0.09000000000000005]","monthly_(0.09000000000000005, 0.09500000000000006]","monthly_(0.09500000000000006, 0.10000000000000003]","monthly_(0.10000000000000003, 0.10500000000000004]","monthly_(0.10500000000000004, 0.11000000000000004]","monthly_(0.11000000000000004, 0.11500000000000005]","monthly_(0.11500000000000005, 0.12000000000000005]","monthly_(0.12000000000000005, 0.12500000000000006]","monthly_(0.12500000000000006, 0.13000000000000006]","monthly_(0.13000000000000006, 0.13500000000000006]","monthly_(0.13500000000000006, 0.14000000000000007]","monthly_(0.14000000000000007, 0.14500000000000007]","monthly_(0.14500000000000007, 0.15000000000000008]","monthly_(0.15000000000000008, 0.15500000000000008]","monthly_(0.15500000000000008, 0.16000000000000003]","monthly_(0.16000000000000003, 0.16500000000000004]","monthly_(0.16500000000000004, 0.17000000000000004]","monthly_(0.17000000000000004, 0.17500000000000004]","monthly_(0.17500000000000004, 0.18000000000000005]","monthly_(0.18000000000000005, 0.18500000000000005]","monthly_(0.18500000000000005, 0.19000000000000006]","monthly_(0.19000000000000006, 0.19500000000000006]","monthly_(0.19500000000000006, 0.20000000000000007]","monthly_(0.20000000000000007, 0.20500000000000007]","monthly_(0.20500000000000007, 0.21000000000000008]","monthly_(0.21000000000000008, 0.21500000000000005]","yearly_(-0.12, -0.11499999999999999]","yearly_(-0.11499999999999999, -0.10999999999999999]","yearly_(-0.10999999999999999, -0.105]","yearly_(-0.105, -0.09999999999999999]","yearly_(-0.09999999999999999, -0.09499999999999999]","yearly_(-0.09499999999999999, -0.09]","yearly_(-0.09, -0.08499999999999999]","yearly_(-0.08499999999999999, -0.07999999999999999]","yearly_(-0.07999999999999999, -0.07499999999999998]","yearly_(-0.07499999999999998, -0.06999999999999998]","yearly_(-0.06999999999999998, -0.06499999999999999]","yearly_(-0.06499999999999999, -0.059999999999999984]","yearly_(-0.059999999999999984, -0.05499999999999998]","yearly_(-0.05499999999999998, -0.04999999999999999]","yearly_(-0.04999999999999999, -0.044999999999999984]","yearly_(-0.044999999999999984, -0.03999999999999998]","yearly_(-0.03999999999999998, -0.034999999999999976]","yearly_(-0.034999999999999976, -0.02999999999999997]","yearly_(-0.02999999999999997, -0.02499999999999998]","yearly_(-0.02499999999999998, -0.019999999999999976]","yearly_(-0.019999999999999976, -0.014999999999999972]","yearly_(-0.014999999999999972, -0.009999999999999981]","yearly_(-0.009999999999999981, -0.004999999999999977]","yearly_(-0.004999999999999977, 2.7755575615628914e-17]","yearly_(2.7755575615628914e-17, 0.005000000000000032]","yearly_(0.005000000000000032, 0.010000000000000037]","yearly_(0.010000000000000037, 0.015000000000000041]","yearly_(0.015000000000000041, 0.020000000000000018]","yearly_(0.020000000000000018, 0.025000000000000022]","yearly_(0.025000000000000022, 0.030000000000000027]","yearly_(0.030000000000000027, 0.03500000000000003]","yearly_(0.03500000000000003, 0.040000000000000036]","yearly_(0.040000000000000036, 0.04500000000000004]","yearly_(0.04500000000000004, 0.050000000000000044]","yearly_(0.050000000000000044, 0.05500000000000005]","yearly_(0.05500000000000005, 0.06000000000000005]","yearly_(0.06000000000000005, 0.06500000000000003]","yearly_(0.06500000000000003, 0.07000000000000003]","yearly_(0.07000000000000003, 0.07500000000000004]","yearly_(0.07500000000000004, 0.08000000000000004]","yearly_(0.08000000000000004, 0.08500000000000005]","yearly_(0.08500000000000005, 0.09000000000000005]","yearly_(0.09000000000000005, 0.09500000000000006]","yearly_(0.09500000000000006, 0.10000000000000003]","yearly_(0.10000000000000003, 0.10500000000000004]","yearly_(0.10500000000000004, 0.11000000000000004]","yearly_(0.11000000000000004, 0.11500000000000005]","yearly_(0.11500000000000005, 0.12000000000000005]","yearly_(0.12000000000000005, 0.12500000000000006]","yearly_(0.12500000000000006, 0.13000000000000006]","yearly_(0.13000000000000006, 0.13500000000000006]","yearly_(0.13500000000000006, 0.14000000000000007]","yearly_(0.14000000000000007, 0.14500000000000007]","yearly_(0.14500000000000007, 0.15000000000000008]","yearly_(0.15000000000000008, 0.15500000000000008]","yearly_(0.15500000000000008, 0.16000000000000003]","yearly_(0.16000000000000003, 0.16500000000000004]","yearly_(0.16500000000000004, 0.17000000000000004]","yearly_(0.17000000000000004, 0.17500000000000004]","yearly_(0.17500000000000004, 0.18000000000000005]","yearly_(0.18000000000000005, 0.18500000000000005]","yearly_(0.18500000000000005, 0.19000000000000006]","yearly_(0.19000000000000006, 0.19500000000000006]","yearly_(0.19500000000000006, 0.20000000000000007]","yearly_(0.20000000000000007, 0.20500000000000007]","yearly_(0.20500000000000007, 0.21000000000000008]","yearly_(0.21000000000000008, 0.21500000000000005]",pct_change_SMA_50,pct_change_SMA_120,close_SMA_50,close_SMA_120,diff_from_SMA_50,diff_from_SMA_120
0,1485.58,1512.3859,1485.25,1506.62,1437655,2020-08-12,GOOG,1480.32,26.3,0.017766,0.01761,0.017766,-0.010547,0.001077,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,"(0.015000000000000041, 0.020000000000000018]",0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,2,1,3,5,2,5,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,15,15,41,40,37,23,21,8,2,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001067,0.00057,1476.0256,1360.140458,0.020728,0.107694
1,1492.44,1510.0,1478.0,1480.32,1454365,2020-08-11,GOOG,1496.1,-15.78,-0.010547,-0.010603,-0.010547,0.001077,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,"(-0.014999999999999972, -0.009999999999999981]",0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,2,5,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.000815,0.00024,1474.6776,1359.961208,0.003826,0.088502
2,1487.18,1504.075,1473.08,1496.1,1289530,2020-08-10,GOOG,1494.49,1.61,0.001077,0.001077,0.001077,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,"(2.7755575615628914e-17, 0.005000000000000032]",0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001066,0.000282,1473.7076,1360.276458,0.015195,0.09985
3,1500.0,1516.845,1481.64,1494.49,1577826,2020-08-07,GOOG,1500.1,-5.61,-0.00374,-0.003747,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,"(-0.004999999999999977, 2.7755575615628914e-17]",0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,40,40,37,23,22,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001217,0.000311,1472.364,1360.531375,0.015028,0.098461
4,1471.75,1502.39,1466.0,1500.1,1995368,2020-08-06,GOOG,1473.61,26.49,0.017976,0.017817,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,-0.022152,"(0.015000000000000041, 0.020000000000000018]",0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,7,4,2,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,6,3,9,8,14,15,40,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001276,0.000336,1470.8088,1360.741208,0.019915,0.102414


In [2]:
df['target'] = df['pct_change'].shift(1)
df[['target','pct_change']]
drop_cols = ['pct_change',  # redundant column
             'intervals',  # data leakage
             'diff_1',  # dropping any column that references absolute price for now
             'prev_close', 
             'SYMBOL', # not important here
             'date', # should not be important 
             'open', # open high low close might be interesting but sticking with dropping absolute values 
             'high',
             'low',
             'close',
             'close_SMA_50', # since they are absolute still dropping these
             'close_SMA_120'] 

df.drop(drop_cols, axis=1, inplace=True)
df.dropna(inplace=True)

y = df['target']
X = df.drop('target', axis=1)

y.shape, X.shape

df.head()

Unnamed: 0,volume,log_return,pct_return_0,pct_return_1,pct_return_2,pct_return_3,pct_return_4,pct_return_5,pct_return_6,pct_return_7,pct_return_8,pct_return_9,pct_return_10,pct_return_11,pct_return_12,pct_return_13,pct_return_14,pct_return_15,pct_return_16,pct_return_17,pct_return_18,pct_return_19,pct_return_20,pct_return_21,pct_return_22,pct_return_23,pct_return_24,pct_return_25,pct_return_26,pct_return_27,pct_return_28,pct_return_29,pct_return_30,"monthly_(-0.12, -0.11499999999999999]","monthly_(-0.11499999999999999, -0.10999999999999999]","monthly_(-0.10999999999999999, -0.105]","monthly_(-0.105, -0.09999999999999999]","monthly_(-0.09999999999999999, -0.09499999999999999]","monthly_(-0.09499999999999999, -0.09]","monthly_(-0.09, -0.08499999999999999]","monthly_(-0.08499999999999999, -0.07999999999999999]","monthly_(-0.07999999999999999, -0.07499999999999998]","monthly_(-0.07499999999999998, -0.06999999999999998]","monthly_(-0.06999999999999998, -0.06499999999999999]","monthly_(-0.06499999999999999, -0.059999999999999984]","monthly_(-0.059999999999999984, -0.05499999999999998]","monthly_(-0.05499999999999998, -0.04999999999999999]","monthly_(-0.04999999999999999, -0.044999999999999984]","monthly_(-0.044999999999999984, -0.03999999999999998]","monthly_(-0.03999999999999998, -0.034999999999999976]","monthly_(-0.034999999999999976, -0.02999999999999997]","monthly_(-0.02999999999999997, -0.02499999999999998]","monthly_(-0.02499999999999998, -0.019999999999999976]","monthly_(-0.019999999999999976, -0.014999999999999972]","monthly_(-0.014999999999999972, -0.009999999999999981]","monthly_(-0.009999999999999981, -0.004999999999999977]","monthly_(-0.004999999999999977, 2.7755575615628914e-17]","monthly_(2.7755575615628914e-17, 0.005000000000000032]","monthly_(0.005000000000000032, 0.010000000000000037]","monthly_(0.010000000000000037, 0.015000000000000041]","monthly_(0.015000000000000041, 0.020000000000000018]","monthly_(0.020000000000000018, 0.025000000000000022]","monthly_(0.025000000000000022, 0.030000000000000027]","monthly_(0.030000000000000027, 0.03500000000000003]","monthly_(0.03500000000000003, 0.040000000000000036]","monthly_(0.040000000000000036, 0.04500000000000004]","monthly_(0.04500000000000004, 0.050000000000000044]","monthly_(0.050000000000000044, 0.05500000000000005]","monthly_(0.05500000000000005, 0.06000000000000005]","monthly_(0.06000000000000005, 0.06500000000000003]","monthly_(0.06500000000000003, 0.07000000000000003]","monthly_(0.07000000000000003, 0.07500000000000004]","monthly_(0.07500000000000004, 0.08000000000000004]","monthly_(0.08000000000000004, 0.08500000000000005]","monthly_(0.08500000000000005, 0.09000000000000005]","monthly_(0.09000000000000005, 0.09500000000000006]","monthly_(0.09500000000000006, 0.10000000000000003]","monthly_(0.10000000000000003, 0.10500000000000004]","monthly_(0.10500000000000004, 0.11000000000000004]","monthly_(0.11000000000000004, 0.11500000000000005]","monthly_(0.11500000000000005, 0.12000000000000005]","monthly_(0.12000000000000005, 0.12500000000000006]","monthly_(0.12500000000000006, 0.13000000000000006]","monthly_(0.13000000000000006, 0.13500000000000006]","monthly_(0.13500000000000006, 0.14000000000000007]","monthly_(0.14000000000000007, 0.14500000000000007]","monthly_(0.14500000000000007, 0.15000000000000008]","monthly_(0.15000000000000008, 0.15500000000000008]","monthly_(0.15500000000000008, 0.16000000000000003]","monthly_(0.16000000000000003, 0.16500000000000004]","monthly_(0.16500000000000004, 0.17000000000000004]","monthly_(0.17000000000000004, 0.17500000000000004]","monthly_(0.17500000000000004, 0.18000000000000005]","monthly_(0.18000000000000005, 0.18500000000000005]","monthly_(0.18500000000000005, 0.19000000000000006]","monthly_(0.19000000000000006, 0.19500000000000006]","monthly_(0.19500000000000006, 0.20000000000000007]","monthly_(0.20000000000000007, 0.20500000000000007]","monthly_(0.20500000000000007, 0.21000000000000008]","monthly_(0.21000000000000008, 0.21500000000000005]","yearly_(-0.12, -0.11499999999999999]","yearly_(-0.11499999999999999, -0.10999999999999999]","yearly_(-0.10999999999999999, -0.105]","yearly_(-0.105, -0.09999999999999999]","yearly_(-0.09999999999999999, -0.09499999999999999]","yearly_(-0.09499999999999999, -0.09]","yearly_(-0.09, -0.08499999999999999]","yearly_(-0.08499999999999999, -0.07999999999999999]","yearly_(-0.07999999999999999, -0.07499999999999998]","yearly_(-0.07499999999999998, -0.06999999999999998]","yearly_(-0.06999999999999998, -0.06499999999999999]","yearly_(-0.06499999999999999, -0.059999999999999984]","yearly_(-0.059999999999999984, -0.05499999999999998]","yearly_(-0.05499999999999998, -0.04999999999999999]","yearly_(-0.04999999999999999, -0.044999999999999984]","yearly_(-0.044999999999999984, -0.03999999999999998]","yearly_(-0.03999999999999998, -0.034999999999999976]","yearly_(-0.034999999999999976, -0.02999999999999997]","yearly_(-0.02999999999999997, -0.02499999999999998]","yearly_(-0.02499999999999998, -0.019999999999999976]","yearly_(-0.019999999999999976, -0.014999999999999972]","yearly_(-0.014999999999999972, -0.009999999999999981]","yearly_(-0.009999999999999981, -0.004999999999999977]","yearly_(-0.004999999999999977, 2.7755575615628914e-17]","yearly_(2.7755575615628914e-17, 0.005000000000000032]","yearly_(0.005000000000000032, 0.010000000000000037]","yearly_(0.010000000000000037, 0.015000000000000041]","yearly_(0.015000000000000041, 0.020000000000000018]","yearly_(0.020000000000000018, 0.025000000000000022]","yearly_(0.025000000000000022, 0.030000000000000027]","yearly_(0.030000000000000027, 0.03500000000000003]","yearly_(0.03500000000000003, 0.040000000000000036]","yearly_(0.040000000000000036, 0.04500000000000004]","yearly_(0.04500000000000004, 0.050000000000000044]","yearly_(0.050000000000000044, 0.05500000000000005]","yearly_(0.05500000000000005, 0.06000000000000005]","yearly_(0.06000000000000005, 0.06500000000000003]","yearly_(0.06500000000000003, 0.07000000000000003]","yearly_(0.07000000000000003, 0.07500000000000004]","yearly_(0.07500000000000004, 0.08000000000000004]","yearly_(0.08000000000000004, 0.08500000000000005]","yearly_(0.08500000000000005, 0.09000000000000005]","yearly_(0.09000000000000005, 0.09500000000000006]","yearly_(0.09500000000000006, 0.10000000000000003]","yearly_(0.10000000000000003, 0.10500000000000004]","yearly_(0.10500000000000004, 0.11000000000000004]","yearly_(0.11000000000000004, 0.11500000000000005]","yearly_(0.11500000000000005, 0.12000000000000005]","yearly_(0.12000000000000005, 0.12500000000000006]","yearly_(0.12500000000000006, 0.13000000000000006]","yearly_(0.13000000000000006, 0.13500000000000006]","yearly_(0.13500000000000006, 0.14000000000000007]","yearly_(0.14000000000000007, 0.14500000000000007]","yearly_(0.14500000000000007, 0.15000000000000008]","yearly_(0.15000000000000008, 0.15500000000000008]","yearly_(0.15500000000000008, 0.16000000000000003]","yearly_(0.16000000000000003, 0.16500000000000004]","yearly_(0.16500000000000004, 0.17000000000000004]","yearly_(0.17000000000000004, 0.17500000000000004]","yearly_(0.17500000000000004, 0.18000000000000005]","yearly_(0.18000000000000005, 0.18500000000000005]","yearly_(0.18500000000000005, 0.19000000000000006]","yearly_(0.19000000000000006, 0.19500000000000006]","yearly_(0.19500000000000006, 0.20000000000000007]","yearly_(0.20000000000000007, 0.20500000000000007]","yearly_(0.20500000000000007, 0.21000000000000008]","yearly_(0.21000000000000008, 0.21500000000000005]",pct_change_SMA_50,pct_change_SMA_120,diff_from_SMA_50,diff_from_SMA_120,target
1,1454365,-0.010603,-0.010547,0.001077,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,2,5,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.000815,0.00024,0.003826,0.088502,0.017766
2,1289530,0.001077,0.001077,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001066,0.000282,0.015195,0.09985,-0.010547
3,1577826,-0.003747,-0.00374,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,40,40,37,23,22,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001217,0.000311,0.015028,0.098461,0.001077
4,1995368,0.017817,0.017976,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,-0.022152,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,7,4,2,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,6,3,9,8,14,15,40,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001276,0.000336,0.019915,0.102414,-0.00374
5,1979957,0.00588,0.005898,-0.00643,-0.005739,-0.031663,0.006196,0.01445,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.00288,-0.004564,0.006114,-0.019718,0.020351,0.01002,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,-0.022152,0.008644,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,6,5,2,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,6,3,9,8,15,15,40,40,36,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.000928,0.00022,0.003026,0.08281,0.017976


In [6]:
X.shape, y.shape

((3903, 171), (3903,))

In [7]:
X = sm.add_constant(X)
X

Unnamed: 0,const,volume,log_return,pct_return_0,pct_return_1,pct_return_2,pct_return_3,pct_return_4,pct_return_5,pct_return_6,pct_return_7,pct_return_8,pct_return_9,pct_return_10,pct_return_11,pct_return_12,pct_return_13,pct_return_14,pct_return_15,pct_return_16,pct_return_17,pct_return_18,pct_return_19,pct_return_20,pct_return_21,pct_return_22,pct_return_23,pct_return_24,pct_return_25,pct_return_26,pct_return_27,pct_return_28,pct_return_29,pct_return_30,"monthly_(-0.12, -0.11499999999999999]","monthly_(-0.11499999999999999, -0.10999999999999999]","monthly_(-0.10999999999999999, -0.105]","monthly_(-0.105, -0.09999999999999999]","monthly_(-0.09999999999999999, -0.09499999999999999]","monthly_(-0.09499999999999999, -0.09]","monthly_(-0.09, -0.08499999999999999]","monthly_(-0.08499999999999999, -0.07999999999999999]","monthly_(-0.07999999999999999, -0.07499999999999998]","monthly_(-0.07499999999999998, -0.06999999999999998]","monthly_(-0.06999999999999998, -0.06499999999999999]","monthly_(-0.06499999999999999, -0.059999999999999984]","monthly_(-0.059999999999999984, -0.05499999999999998]","monthly_(-0.05499999999999998, -0.04999999999999999]","monthly_(-0.04999999999999999, -0.044999999999999984]","monthly_(-0.044999999999999984, -0.03999999999999998]","monthly_(-0.03999999999999998, -0.034999999999999976]","monthly_(-0.034999999999999976, -0.02999999999999997]","monthly_(-0.02999999999999997, -0.02499999999999998]","monthly_(-0.02499999999999998, -0.019999999999999976]","monthly_(-0.019999999999999976, -0.014999999999999972]","monthly_(-0.014999999999999972, -0.009999999999999981]","monthly_(-0.009999999999999981, -0.004999999999999977]","monthly_(-0.004999999999999977, 2.7755575615628914e-17]","monthly_(2.7755575615628914e-17, 0.005000000000000032]","monthly_(0.005000000000000032, 0.010000000000000037]","monthly_(0.010000000000000037, 0.015000000000000041]","monthly_(0.015000000000000041, 0.020000000000000018]","monthly_(0.020000000000000018, 0.025000000000000022]","monthly_(0.025000000000000022, 0.030000000000000027]","monthly_(0.030000000000000027, 0.03500000000000003]","monthly_(0.03500000000000003, 0.040000000000000036]","monthly_(0.040000000000000036, 0.04500000000000004]","monthly_(0.04500000000000004, 0.050000000000000044]","monthly_(0.050000000000000044, 0.05500000000000005]","monthly_(0.05500000000000005, 0.06000000000000005]","monthly_(0.06000000000000005, 0.06500000000000003]","monthly_(0.06500000000000003, 0.07000000000000003]","monthly_(0.07000000000000003, 0.07500000000000004]","monthly_(0.07500000000000004, 0.08000000000000004]","monthly_(0.08000000000000004, 0.08500000000000005]","monthly_(0.08500000000000005, 0.09000000000000005]","monthly_(0.09000000000000005, 0.09500000000000006]","monthly_(0.09500000000000006, 0.10000000000000003]","monthly_(0.10000000000000003, 0.10500000000000004]","monthly_(0.10500000000000004, 0.11000000000000004]","monthly_(0.11000000000000004, 0.11500000000000005]","monthly_(0.11500000000000005, 0.12000000000000005]","monthly_(0.12000000000000005, 0.12500000000000006]","monthly_(0.12500000000000006, 0.13000000000000006]","monthly_(0.13000000000000006, 0.13500000000000006]","monthly_(0.13500000000000006, 0.14000000000000007]","monthly_(0.14000000000000007, 0.14500000000000007]","monthly_(0.14500000000000007, 0.15000000000000008]","monthly_(0.15000000000000008, 0.15500000000000008]","monthly_(0.15500000000000008, 0.16000000000000003]","monthly_(0.16000000000000003, 0.16500000000000004]","monthly_(0.16500000000000004, 0.17000000000000004]","monthly_(0.17000000000000004, 0.17500000000000004]","monthly_(0.17500000000000004, 0.18000000000000005]","monthly_(0.18000000000000005, 0.18500000000000005]","monthly_(0.18500000000000005, 0.19000000000000006]","monthly_(0.19000000000000006, 0.19500000000000006]","monthly_(0.19500000000000006, 0.20000000000000007]","monthly_(0.20000000000000007, 0.20500000000000007]","monthly_(0.20500000000000007, 0.21000000000000008]","monthly_(0.21000000000000008, 0.21500000000000005]","yearly_(-0.12, -0.11499999999999999]","yearly_(-0.11499999999999999, -0.10999999999999999]","yearly_(-0.10999999999999999, -0.105]","yearly_(-0.105, -0.09999999999999999]","yearly_(-0.09999999999999999, -0.09499999999999999]","yearly_(-0.09499999999999999, -0.09]","yearly_(-0.09, -0.08499999999999999]","yearly_(-0.08499999999999999, -0.07999999999999999]","yearly_(-0.07999999999999999, -0.07499999999999998]","yearly_(-0.07499999999999998, -0.06999999999999998]","yearly_(-0.06999999999999998, -0.06499999999999999]","yearly_(-0.06499999999999999, -0.059999999999999984]","yearly_(-0.059999999999999984, -0.05499999999999998]","yearly_(-0.05499999999999998, -0.04999999999999999]","yearly_(-0.04999999999999999, -0.044999999999999984]","yearly_(-0.044999999999999984, -0.03999999999999998]","yearly_(-0.03999999999999998, -0.034999999999999976]","yearly_(-0.034999999999999976, -0.02999999999999997]","yearly_(-0.02999999999999997, -0.02499999999999998]","yearly_(-0.02499999999999998, -0.019999999999999976]","yearly_(-0.019999999999999976, -0.014999999999999972]","yearly_(-0.014999999999999972, -0.009999999999999981]","yearly_(-0.009999999999999981, -0.004999999999999977]","yearly_(-0.004999999999999977, 2.7755575615628914e-17]","yearly_(2.7755575615628914e-17, 0.005000000000000032]","yearly_(0.005000000000000032, 0.010000000000000037]","yearly_(0.010000000000000037, 0.015000000000000041]","yearly_(0.015000000000000041, 0.020000000000000018]","yearly_(0.020000000000000018, 0.025000000000000022]","yearly_(0.025000000000000022, 0.030000000000000027]","yearly_(0.030000000000000027, 0.03500000000000003]","yearly_(0.03500000000000003, 0.040000000000000036]","yearly_(0.040000000000000036, 0.04500000000000004]","yearly_(0.04500000000000004, 0.050000000000000044]","yearly_(0.050000000000000044, 0.05500000000000005]","yearly_(0.05500000000000005, 0.06000000000000005]","yearly_(0.06000000000000005, 0.06500000000000003]","yearly_(0.06500000000000003, 0.07000000000000003]","yearly_(0.07000000000000003, 0.07500000000000004]","yearly_(0.07500000000000004, 0.08000000000000004]","yearly_(0.08000000000000004, 0.08500000000000005]","yearly_(0.08500000000000005, 0.09000000000000005]","yearly_(0.09000000000000005, 0.09500000000000006]","yearly_(0.09500000000000006, 0.10000000000000003]","yearly_(0.10000000000000003, 0.10500000000000004]","yearly_(0.10500000000000004, 0.11000000000000004]","yearly_(0.11000000000000004, 0.11500000000000005]","yearly_(0.11500000000000005, 0.12000000000000005]","yearly_(0.12000000000000005, 0.12500000000000006]","yearly_(0.12500000000000006, 0.13000000000000006]","yearly_(0.13000000000000006, 0.13500000000000006]","yearly_(0.13500000000000006, 0.14000000000000007]","yearly_(0.14000000000000007, 0.14500000000000007]","yearly_(0.14500000000000007, 0.15000000000000008]","yearly_(0.15000000000000008, 0.15500000000000008]","yearly_(0.15500000000000008, 0.16000000000000003]","yearly_(0.16000000000000003, 0.16500000000000004]","yearly_(0.16500000000000004, 0.17000000000000004]","yearly_(0.17000000000000004, 0.17500000000000004]","yearly_(0.17500000000000004, 0.18000000000000005]","yearly_(0.18000000000000005, 0.18500000000000005]","yearly_(0.18500000000000005, 0.19000000000000006]","yearly_(0.19000000000000006, 0.19500000000000006]","yearly_(0.19500000000000006, 0.20000000000000007]","yearly_(0.20000000000000007, 0.20500000000000007]","yearly_(0.20500000000000007, 0.21000000000000008]","yearly_(0.21000000000000008, 0.21500000000000005]",pct_change_SMA_50,pct_change_SMA_120,diff_from_SMA_50,diff_from_SMA_120
1,1.0,1454365,-0.010603,-0.010547,0.001077,-0.003740,0.017976,0.005898,-0.006430,-0.005739,-0.031663,0.006196,0.014450,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.002880,-0.004564,0.006114,-0.019718,0.020351,0.010020,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,2,5,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.000815,0.000240,0.003826,0.088502
2,1.0,1289530,0.001077,0.001077,-0.003740,0.017976,0.005898,-0.006430,-0.005739,-0.031663,0.006196,0.014450,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.002880,-0.004564,0.006114,-0.019718,0.020351,0.010020,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,0,2,0,3,5,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,41,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001066,0.000282,0.015195,0.099850
3,1.0,1577826,-0.003747,-0.003740,0.017976,0.005898,-0.006430,-0.005739,-0.031663,0.006196,0.014450,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.002880,-0.004564,0.006114,-0.019718,0.020351,0.010020,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,6,4,3,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,5,3,9,8,14,15,40,40,37,23,22,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001217,0.000311,0.015028,0.098461
4,1.0,1995368,0.017817,0.017976,0.005898,-0.006430,-0.005739,-0.031663,0.006196,0.014450,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.002880,-0.004564,0.006114,-0.019718,0.020351,0.010020,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,-0.022152,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,7,4,2,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,6,3,9,8,14,15,40,40,37,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.001276,0.000336,0.019915,0.102414
5,1.0,1979957,0.005880,0.005898,-0.006430,-0.005739,-0.031663,0.006196,0.014450,-0.019514,0.012124,-0.002514,-0.033669,0.006462,-0.004662,0.033103,-0.001614,0.002880,-0.004564,0.006114,-0.019718,0.020351,0.010020,0.007285,-0.007033,0.021165,0.018539,0.017282,0.013362,0.025789,-0.056496,0.006536,-0.022152,0.008644,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,2,0,1,2,0,3,4,1,6,5,2,2,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,2,6,3,9,8,15,15,40,40,36,23,21,8,3,4,3,1,0,2,0,0,0,1,0,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.000928,0.000220,0.003026,0.082810
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3899,1.0,51606517,0.011540,0.011607,0.029829,-0.003085,-0.018791,-0.035542,0.013314,-0.040761,-0.030826,0.023791,0.073267,-0.019017,0.027740,0.012016,-0.006130,0.068428,-0.019920,-0.040153,-0.029084,-0.017131,-0.032369,0.019653,0.023755,-0.000256,0.009507,-0.007793,0.006242,0.028109,-0.025632,-0.005090,-0.040501,0.051455,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,1,2,2,1,4,0,3,2,0,2,2,1,2,3,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,3,2,6,4,8,7,12,5,6,9,10,9,11,8,2,3,1,1,1,0,1,1,2,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0.001938,0.005594,0.029799,0.185592
3900,1.0,77185599,0.029393,0.029829,-0.003085,-0.018791,-0.035542,0.013314,-0.040761,-0.030826,0.023791,0.073267,-0.019017,0.027740,0.012016,-0.006130,0.068428,-0.019920,-0.040153,-0.029084,-0.017131,-0.032369,0.019653,0.023755,-0.000256,0.009507,-0.007793,0.006242,0.028109,-0.025632,-0.005090,-0.040501,0.051455,-0.024342,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,1,2,2,1,4,0,3,2,0,2,2,1,3,2,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,3,2,6,4,8,7,12,5,6,9,10,9,11,7,2,3,1,1,1,0,1,1,2,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0.001817,0.005587,0.019579,0.177306
3901,1.0,26252815,-0.003090,-0.003085,-0.018791,-0.035542,0.013314,-0.040761,-0.030826,0.023791,0.073267,-0.019017,0.027740,0.012016,-0.006130,0.068428,-0.019920,-0.040153,-0.029084,-0.017131,-0.032369,0.019653,0.023755,-0.000256,0.009507,-0.007793,0.006242,0.028109,-0.025632,-0.005090,-0.040501,0.051455,-0.024342,0.024365,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,1,2,2,1,4,0,3,1,1,2,2,1,3,2,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,3,2,6,4,8,7,12,4,6,9,10,9,11,7,2,3,1,1,1,0,1,1,2,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0.001159,0.004993,-0.008530,0.148349
3902,1.0,37995525,-0.018970,-0.018791,-0.035542,0.013314,-0.040761,-0.030826,0.023791,0.073267,-0.019017,0.027740,0.012016,-0.006130,0.068428,-0.019920,-0.040153,-0.029084,-0.017131,-0.032369,0.019653,0.023755,-0.000256,0.009507,-0.007793,0.006242,0.028109,-0.025632,-0.005090,-0.040501,0.051455,-0.024342,0.024365,0.000726,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,1,2,2,1,3,0,3,1,2,2,2,1,3,2,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,3,2,6,4,7,7,12,4,6,9,10,9,11,7,2,3,1,1,1,0,1,1,2,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0.000998,0.005103,-0.004677,0.156509


In [8]:
model = sm.OLS(y, X)
results = model.fit()
print(results.summary())

                            OLS Regression Results                            
Dep. Variable:                 target   R-squared:                       0.074
Model:                            OLS   Adj. R-squared:                  0.040
Method:                 Least Squares   F-statistic:                     2.178
Date:                Fri, 14 Aug 2020   Prob (F-statistic):           1.94e-13
Time:                        17:25:24   Log-Likelihood:                 10143.
No. Observations:                3903   AIC:                        -2.001e+04
Df Residuals:                    3763   BIC:                        -1.913e+04
Df Model:                         139                                         
Covariance Type:            nonrobust                                         
                                                              coef    std err          t      P>|t|      [0.025      0.975]
--------------------------------------------------------------------------------------

  return self.params / self.bse
  return (a < x) & (x < b)
  return (a < x) & (x < b)
  cond2 = cond0 & (x <= _a)


In [11]:
p_coefs = results.pvalues > 0.05

In [14]:
X.columns[p_coefs].shape

(106,)

In [15]:
X[X.columns[p_coefs]]

Unnamed: 0,const,volume,log_return,pct_return_0,pct_return_26,pct_return_27,pct_return_30,"monthly_(-0.12, -0.11499999999999999]","monthly_(-0.11499999999999999, -0.10999999999999999]","monthly_(-0.10999999999999999, -0.105]","monthly_(-0.105, -0.09999999999999999]","monthly_(-0.09999999999999999, -0.09499999999999999]","monthly_(-0.09499999999999999, -0.09]","monthly_(-0.09, -0.08499999999999999]","monthly_(-0.06499999999999999, -0.059999999999999984]","monthly_(-0.059999999999999984, -0.05499999999999998]","monthly_(-0.05499999999999998, -0.04999999999999999]","monthly_(-0.04999999999999999, -0.044999999999999984]","monthly_(-0.044999999999999984, -0.03999999999999998]","monthly_(-0.034999999999999976, -0.02999999999999997]","monthly_(-0.02999999999999997, -0.02499999999999998]","monthly_(-0.02499999999999998, -0.019999999999999976]","monthly_(-0.009999999999999981, -0.004999999999999977]","monthly_(0.010000000000000037, 0.015000000000000041]","monthly_(0.015000000000000041, 0.020000000000000018]","monthly_(0.020000000000000018, 0.025000000000000022]","monthly_(0.025000000000000022, 0.030000000000000027]","monthly_(0.03500000000000003, 0.040000000000000036]","monthly_(0.040000000000000036, 0.04500000000000004]","monthly_(0.04500000000000004, 0.050000000000000044]","monthly_(0.050000000000000044, 0.05500000000000005]","monthly_(0.05500000000000005, 0.06000000000000005]","monthly_(0.06000000000000005, 0.06500000000000003]","monthly_(0.06500000000000003, 0.07000000000000003]","monthly_(0.07500000000000004, 0.08000000000000004]","monthly_(0.08000000000000004, 0.08500000000000005]","monthly_(0.08500000000000005, 0.09000000000000005]","monthly_(0.09000000000000005, 0.09500000000000006]","monthly_(0.09500000000000006, 0.10000000000000003]","monthly_(0.10000000000000003, 0.10500000000000004]","monthly_(0.10500000000000004, 0.11000000000000004]","monthly_(0.11000000000000004, 0.11500000000000005]","monthly_(0.11500000000000005, 0.12000000000000005]","monthly_(0.12000000000000005, 0.12500000000000006]","monthly_(0.12500000000000006, 0.13000000000000006]","monthly_(0.13000000000000006, 0.13500000000000006]","monthly_(0.13500000000000006, 0.14000000000000007]","monthly_(0.14000000000000007, 0.14500000000000007]","monthly_(0.14500000000000007, 0.15000000000000008]","monthly_(0.15000000000000008, 0.15500000000000008]","monthly_(0.15500000000000008, 0.16000000000000003]","monthly_(0.16000000000000003, 0.16500000000000004]","monthly_(0.16500000000000004, 0.17000000000000004]","monthly_(0.17000000000000004, 0.17500000000000004]","monthly_(0.17500000000000004, 0.18000000000000005]","monthly_(0.18000000000000005, 0.18500000000000005]","monthly_(0.18500000000000005, 0.19000000000000006]","monthly_(0.19000000000000006, 0.19500000000000006]","monthly_(0.19500000000000006, 0.20000000000000007]","monthly_(0.20000000000000007, 0.20500000000000007]","monthly_(0.20500000000000007, 0.21000000000000008]","monthly_(0.21000000000000008, 0.21500000000000005]","yearly_(-0.12, -0.11499999999999999]","yearly_(-0.11499999999999999, -0.10999999999999999]","yearly_(-0.10999999999999999, -0.105]","yearly_(-0.105, -0.09999999999999999]","yearly_(-0.09999999999999999, -0.09499999999999999]","yearly_(-0.09499999999999999, -0.09]","yearly_(-0.09, -0.08499999999999999]","yearly_(-0.08499999999999999, -0.07999999999999999]","yearly_(-0.07999999999999999, -0.07499999999999998]","yearly_(-0.07499999999999998, -0.06999999999999998]","yearly_(-0.06999999999999998, -0.06499999999999999]","yearly_(-0.06499999999999999, -0.059999999999999984]","yearly_(-0.059999999999999984, -0.05499999999999998]","yearly_(-0.05499999999999998, -0.04999999999999999]","yearly_(-0.04999999999999999, -0.044999999999999984]","yearly_(-0.044999999999999984, -0.03999999999999998]","yearly_(-0.034999999999999976, -0.02999999999999997]","yearly_(-0.02999999999999997, -0.02499999999999998]","yearly_(-0.02499999999999998, -0.019999999999999976]","yearly_(-0.019999999999999976, -0.014999999999999972]","yearly_(-0.004999999999999977, 2.7755575615628914e-17]","yearly_(0.005000000000000032, 0.010000000000000037]","yearly_(0.010000000000000037, 0.015000000000000041]","yearly_(0.015000000000000041, 0.020000000000000018]","yearly_(0.040000000000000036, 0.04500000000000004]","yearly_(0.04500000000000004, 0.050000000000000044]","yearly_(0.050000000000000044, 0.05500000000000005]","yearly_(0.05500000000000005, 0.06000000000000005]","yearly_(0.06000000000000005, 0.06500000000000003]","yearly_(0.06500000000000003, 0.07000000000000003]","yearly_(0.07500000000000004, 0.08000000000000004]","yearly_(0.08000000000000004, 0.08500000000000005]","yearly_(0.08500000000000005, 0.09000000000000005]","yearly_(0.09000000000000005, 0.09500000000000006]","yearly_(0.09500000000000006, 0.10000000000000003]","yearly_(0.10000000000000003, 0.10500000000000004]","yearly_(0.12000000000000005, 0.12500000000000006]","yearly_(0.13500000000000006, 0.14000000000000007]","yearly_(0.14500000000000007, 0.15000000000000008]","yearly_(0.15000000000000008, 0.15500000000000008]","yearly_(0.16000000000000003, 0.16500000000000004]","yearly_(0.19500000000000006, 0.20000000000000007]",pct_change_SMA_50,pct_change_SMA_120
1,1.0,1454365,-0.010603,-0.010547,0.021165,0.018539,0.025789,0,0,0,0,0,0,0,0,1,0,0,0,2,0,0,3,4,3,2,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,5,3,9,8,41,37,23,21,1,0,2,0,0,0,0,1,1,1,0,0,0,0,0,0,0,0,0.000815,0.000240
2,1.0,1289530,0.001077,0.001077,0.018539,0.017282,-0.056496,0,0,0,0,0,0,0,0,1,0,0,0,2,0,0,3,4,3,2,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,5,3,9,8,41,37,23,21,1,0,2,0,0,0,0,1,1,1,0,0,0,0,0,0,0,0,0.001066,0.000282
3,1.0,1577826,-0.003747,-0.003740,0.017282,0.013362,0.006536,0,0,0,0,0,0,0,0,1,0,0,0,2,0,1,3,4,3,2,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,5,3,9,8,40,37,23,22,1,0,2,0,0,0,0,1,1,1,0,0,0,0,0,0,0,0,0.001217,0.000311
4,1.0,1995368,0.017817,0.017976,0.013362,0.025789,-0.022152,0,0,0,0,0,0,0,0,1,0,0,0,2,0,1,3,4,2,2,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,6,3,9,8,40,37,23,21,1,0,2,0,0,0,0,1,1,1,0,0,0,0,0,0,0,0,0.001276,0.000336
5,1.0,1979957,0.005880,0.005898,0.025789,-0.056496,0.008644,0,0,0,0,0,0,0,0,1,0,0,0,2,0,1,3,5,2,2,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,1,1,2,2,3,6,3,9,8,40,36,23,21,1,0,2,0,0,0,0,1,1,1,0,0,0,0,0,0,0,0,0.000928,0.000220
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3899,1.0,51606517,0.011540,0.011607,0.028109,-0.025632,0.051455,0,0,0,0,0,0,0,0,0,0,0,3,2,2,1,3,2,1,2,3,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,2,6,4,8,5,9,10,9,1,1,1,0,1,1,1,0,1,1,0,0,0,0,0,1,0,0,0.001938,0.005594
3900,1.0,77185599,0.029393,0.029829,-0.025632,-0.005090,-0.024342,0,0,0,0,0,0,0,0,0,0,0,3,2,2,1,3,2,1,3,2,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,2,6,4,8,5,9,10,9,1,1,1,0,1,1,1,0,1,1,0,0,0,0,0,1,0,0,0.001817,0.005587
3901,1.0,26252815,-0.003090,-0.003085,-0.005090,-0.040501,0.024365,0,0,0,0,0,0,0,0,0,0,0,3,2,2,1,3,2,1,3,2,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,2,6,4,8,4,9,10,9,1,1,1,0,1,1,1,0,1,1,0,0,0,0,0,1,0,0,0.001159,0.004993
3902,1.0,37995525,-0.018970,-0.018791,-0.040501,0.051455,0.000726,0,0,0,0,0,0,0,0,0,0,0,3,2,2,1,3,2,1,3,2,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,4,2,6,4,7,4,9,10,9,1,1,1,0,1,1,1,0,1,1,0,0,0,0,0,1,0,0,0.000998,0.005103


In [16]:
mod2 = sm.OLS(y, X[X.columns[p_coefs]])
res2 = mod2.fit()
print(res2.summary())

                            OLS Regression Results                            
Dep. Variable:                 target   R-squared:                       0.035
Model:                            OLS   Adj. R-squared:                  0.013
Method:                 Least Squares   F-statistic:                     1.594
Date:                Fri, 14 Aug 2020   Prob (F-statistic):           0.000439
Time:                        17:30:13   Log-Likelihood:                 10062.
No. Observations:                3903   AIC:                        -1.995e+04
Df Residuals:                    3815   BIC:                        -1.940e+04
Df Model:                          87                                         
Covariance Type:            nonrobust                                         
                                                             coef    std err          t      P>|t|      [0.025      0.975]
---------------------------------------------------------------------------------------

Testing for significance seems to have null or negative effect on the result. 

Many of the distribution features created so far are surprisingly correlated to the target value, with coefficient pvalues far exceeding 0.05. However, despite their significance in this case we still are not able to predict much information more than the mean. 