# Applications d'algo Deep Learning (NN) adaptés aux Time Series

Il existe plusieurs types de modèles adaptés aux Time Series. Leur particularité est de ne pas utiliser simplement les données comme des évenements indépendants mais de conserver une "mémoire" des évenements précédents pour mieux analyser un instant T.

Ceci est utile notamment pour trouver des pattern de tendance à terme. Voici les principaux modèles :
- RNN  : Recurrent Neuronal Network
- LSTM : Long Short-Term Memory
- GRU  : Gated Recurrent Unit

# Combinaison multi-input

On a vu précédemment que les réseaux GRU ou LSTM donnaient les moins mauvais résultats (insufffisant). Les 2 utilisent des fenêtres d'inervalle de temps pour prédire un instant T à partir de plusieurs observations passés. Le GRU plutôt sur des grandes fenêtres, un peu plus courtes pour le LSTM.

En analyse technique on va souvent utiliser plusieurs types de fenêtre d'interval (nb observations passées) simultanément. C'est ce qu'on va essayer de reproduire ici avec des réseaux combinants plusieurs input.

Voici les 2 éléments qu'on va vouloir intégrer :
- Information de base de l'observation (ellles sont noyés dans les observations de la fenêtre) donc on veut ici les répeter pour qu'elles soient "conservées"/non transformés.
- Utilisation en parallèle de plusieurs layers (LSTM/GRU) en entrée qui vont pré-analyser les données avec fenêtrage mais sur des inetrvals de temps différents.


#### First of all set randomeness in order to have comparable results

In [1]:
from numpy.random import seed
seed(1)
import tensorflow as tf
tf.random.set_seed(2)

## Input parameters

To be reviewed:adapt before 1st launch

In [2]:
modelName = 'NN_TS_TFTS_TRANSFORMER_01'

In [3]:
pathModelWeights = 'weights/' + modelName + '_WEIGHTS.h5'
pathModel = 'model/' + modelName + '_MODEL.h5'

## Constitution des datasets

On va constituer 3 datasets différents avec une profondeur différente (nombre de variables) afin de pouvoir comparer notamment l'impact des indicateurs sur la qualité du résultat.

In [4]:
# pip install psycopg2-binary

In [5]:
import time
import numpy as np
import random
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
import psycopg2
from sqlalchemy import create_engine
import os.path

In [6]:
import warnings
warnings.filterwarnings('ignore')

In [7]:
# pip install attention

In [8]:
from sklearn.model_selection import train_test_split, ShuffleSplit
from sklearn.metrics import *
from sklearn.preprocessing import StandardScaler
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Dropout, Activation, Convolution1D, MaxPooling1D, Flatten
from tensorflow.keras.layers import LSTM, GRU, TimeDistributed, Conv1D, ConvLSTM2D, BatchNormalization
from attention import Attention
from tensorflow.keras.callbacks import EarlyStopping
from tensorflow.keras import Input, Model, layers
from tensorflow.keras import backend as K

from scikeras.wrappers import KerasClassifier
from sklearn.model_selection import cross_val_score
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import StratifiedKFold


In [9]:
#pip install nbimporter

In [10]:
import nbimporter

from tfts.models.transformer import Transformer 

### Datasets : EURUSD H1

In [11]:
conn_string = 'postgresql://postgres:Juw51000@localhost/tradingIA'

db = create_engine(conn_string)
conn = db.connect()

In [12]:
df = pd.read_sql("select * from fex_eurusd_h1", conn);
df.head()

Unnamed: 0,epoch,mopen,mclose,mhigh,mlow,mvolume,mspread,ima,ima2,ima4,...,istos4,imom,imom2,imom4,rProfitBuy,rSwapBuy,rProfitBTrigger,rProfitSell,rSwapSell,rProfitSTrigger
0,946861200,1.0073,1.0128,1.0132,1.0073,194,50,1.008242,1.007963,1.006779,...,70.12987,100.536033,100.615935,100.565982,3.64,0.0,TO,-3.07,0.0,SL
1,946864800,1.0129,1.0137,1.0141,1.012,113,50,1.008733,1.008175,1.006973,...,72.331461,100.67534,100.815515,100.495688,2.56,0.0,TO,-3.15,0.0,SL
2,946868400,1.014,1.0171,1.0173,1.0134,149,50,1.009517,1.008588,1.007215,...,76.041667,101.073239,101.002979,100.902778,-0.1,0.0,TO,-0.88,0.0,TO
3,946872000,1.017,1.0175,1.019,1.017,214,50,1.01035,1.008958,1.007462,...,78.688525,100.87241,100.962493,100.882411,-2.36,0.0,TO,1.38,0.0,TO
4,946875600,1.0173,1.0167,1.0177,1.0164,162,50,1.010975,1.009296,1.007677,...,78.51153,100.703249,100.893123,100.813089,-2.95,0.0,SL,5.74,0.0,TP


In [13]:
conn.close()

In [14]:
df['targetBuy'] = df['rProfitBuy'] + df['rSwapBuy']
df['targetSell'] = df['rProfitSell'] + df['rSwapSell']

In [15]:
dfNotNa = df[df['rProfitBTrigger'].notna()]
dfCleanRow = dfNotNa[dfNotNa['epoch'] < 1690484400]
dfClean = dfCleanRow.drop(['rProfitBuy', 'rSwapBuy', 'rProfitSell', 'rSwapSell', 'rProfitSTrigger', 'rProfitBTrigger'], axis=1)
dfClean.shape

(145559, 27)

### Transposition en problème de classification binaire

On peut simplifier la question de base qui est de savoir quel est le moment du profit (Buy/Sell) en question binaire, à savoir est-ce que le trade à un instant T (Buy et Sell) entrainera une perte (0) ou un gain (1) ?

In [16]:
dfCleanBin = dfClean

In [17]:
dfCleanBin['targetProfitBuy'] = dfCleanBin['targetBuy'].apply(lambda x: 1 if x > 0 else 0)
dfCleanBin['targetProfitSell'] = dfCleanBin['targetSell'].apply(lambda x: 1 if x > 0 else 0)
dfCleanBin.shape

(145559, 29)

In [18]:
sum(dfCleanBin['targetBuy'])

-33065.310000000005

In [19]:
sum(dfCleanBin['targetProfitBuy']) / dfCleanBin.shape[0]

0.37148510226093884

In [20]:
sum(dfCleanBin['targetSell'])

-32935.02000000026

In [21]:
sum(dfCleanBin['targetProfitSell']) / dfCleanBin.shape[0]

0.37439801042876086

Qu'il s'agisse des Profits Buy ou Sell on est à environ 37% de target Profit pour 63% de perte. Les classes sont donc plutôt équilibrées.

### Glissement des valeurs Target (prévision)

Pour la prévision les valeurs à prédire (profit du trade) sont les valeurs qui concernent la periode à venir du trade (T+1) en fonction des features observées sur la periode actuelle (T). On doit donc glisser les valeurs de Target de T+1 vers T.

In [22]:
dfCleanBin['targetProfitBuy'] = dfCleanBin['targetProfitBuy'].shift(-1)
dfCleanBin['targetProfitSell'] = dfCleanBin['targetProfitSell'].shift(-1)
dfCleanBin['targetSell'] = dfCleanBin['targetSell'].shift(-1)
dfCleanBin['targetBuy'] = dfCleanBin['targetBuy'].shift(-1)

In [23]:
dfCleanBin = dfCleanBin[dfCleanBin['targetProfitSell'].notna()]

### Transformation du prix d'ouverture

Le prix d'ouverture T est finalement le prix de clôture T-1 (avec possible légère correction), il n'est donc pas primordial.
On aimerait mieux peut-être visualiser facilement le sens de tendance de la periode (Prix cloture - Prix ouverture) plus révélateur.

In [24]:
dfCleanBin['evol'] = dfCleanBin['mclose'] - dfCleanBin['mopen']

In [25]:
dfCleanBin['evol'].describe()

count    145558.000000
mean          0.000004
std           0.001462
min          -0.024800
25%          -0.000600
50%           0.000000
75%           0.000600
max           0.030200
Name: evol, dtype: float64

In [26]:
dfCleanBin.set_index('epoch', inplace=True)

#### Dataset basis
Ce dataset ne va comporfter que les données brutes (en plus des target) sans aucun indicateur technique

In [27]:
dfBasisB = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitBuy']]
dfBasisS = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitSell']]

#### Dataset intermediate low
Ce dataset, va comporfter les données brutes (en plus des target) ainsi que la version des indicateurs sur la plus courte periode de calcul

In [28]:
dfIntLowB = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitBuy', 
                   'ima', 'iatr', 'irsi', 'imacd', 'istos', 'imom']]
dfIntLowS = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitSell', 
                   'ima', 'iatr', 'irsi', 'imacd', 'istos', 'imom']]

#### Dataset intermediate Medium
Ce dataset, va comporfter les données brutes (en plus des target) ainsi que la version des indicateurs sur la periode de calcul intermediaire

In [29]:
dfIntMedB = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitBuy', 
                   'ima2', 'iatr2', 'irsi2', 'imacd2', 'istos2', 'imom2']]
dfIntMedS = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitSell', 
                   'ima2', 'iatr2', 'irsi2', 'imacd2', 'istos2', 'imom2']]

#### Dataset intermediate High
Ce dataset, va comporfter les données brutes (en plus des target) ainsi que la version des indicateurs sur la plus longue periode de calcul

In [30]:
dfIntHigB = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitBuy', 
                   'ima4', 'iatr4', 'irsi4', 'imacd4', 'istos4', 'imom4']]
dfIntHigS = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitSell', 
                   'ima4', 'iatr4', 'irsi4', 'imacd4', 'istos4', 'imom4']]

#### Dataset Complet
Ce dataset, va comporfter les données brutes (en plus des target) ainsi tous les indicateurs sur toutes les periodes de calcul

In [31]:
dfFullB = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitBuy', 
                   'ima', 'iatr', 'irsi', 'imacd','ima2', 'iatr2', 'irsi2', 'imacd2','ima4', 'iatr4', 'irsi4', 'imacd4',
                   'istos', 'istos2', 'istos4', 'imom', 'imom2', 'imom4']]
dfFullS = dfCleanBin[['mopen', 'mclose', 'mhigh', 'mlow', 'mvolume', 'mspread', 'targetProfitSell', 
                   'ima', 'iatr', 'irsi', 'imacd','ima2', 'iatr2', 'irsi2', 'imacd2','ima4', 'iatr4', 'irsi4', 'imacd4',
                   'istos', 'istos2', 'istos4', 'imom', 'imom2', 'imom4']]

## Applications des Deep Learning Model

#### Utilisation du modele de base : dfBasisB

In [32]:
dfBasisB.shape

(145558, 7)

#### Definition des datsests de Features / Target

In [33]:
df = dfBasisB

In [34]:
dfTarget = df['targetProfitBuy']
dfFeatures = df.drop(columns=['targetProfitBuy'])

#### Separation du Dataset Train / Test

In [35]:
def getTrainTestDatasets(dfFeatures, dfTarget, testSize=.2):
    rs = ShuffleSplit(n_splits=1, test_size=testSize)
    train_index, test_index = next(rs.split(dfFeatures, dfTarget)) 
    dX_train, dX_test = dfFeatures.iloc[train_index], dfFeatures.iloc[test_index] 
    dy_train, dy_test = dfTarget.iloc[train_index], dfTarget.iloc[test_index]
    return dX_train, dX_test, dy_train, dy_test

Split into (Train + Valid) / Test datasets :

In [36]:
dfFeaturesT, dX_test, dfTargetT, dy_test = getTrainTestDatasets(dfFeatures, dfTarget, .2)

Split into Train / Valid datasets

In [37]:
dX_train, dX_val, dy_train, dy_val = getTrainTestDatasets(dfFeaturesT, dfTargetT, .1)

#### Tests random sur 5 valeurs

In [38]:
def removeChars(lstChars, inputS):
    for char in lstChars:
        inputS = inputS.replace(char, '')
    return inputS 

In [39]:
def getFeaturesDatasetFromDB(lstIndex, lstColumns, table):
    conn = db.connect()
    sql = "select epoch, " + removeChars(["[", "]", "'"], str(lstColumns)) + " from " + table + " where epoch in (" + removeChars(["[", "]"], str(lstIndex)) + ")"
    #print(sql)
    df = pd.read_sql(sql, conn, index_col='epoch')
    conn.close()
    return df

In [40]:
def getTargetsDatasetFromDB(lstIndex, table):
    # For each epoch T we need value on T+1 (trading is baed on the period -1 values)
    conn = db.connect()
    lstEpochs = [epoch + 3600 for epoch in lstIndex]
    sql = 'select epoch - 3600 as epoch, ' + '("rProfitBuy" + "rSwapBuy") as profit from ' + table + ' where epoch in (' + removeChars(["[", "]"], str(lstEpochs)) + ')'
    #print(sql)
    df = pd.read_sql(sql, conn, index_col='epoch')
    conn.close()
    return df

In [41]:
def getSamplesDataFromDatasets(dfFeatures, dfTargets, nb_samples):
    lstXIndex = random.sample(range(0, dfFeatures.shape[0]), 5)
    dfUnitT = pd.concat([dfFeatures.iloc[lstXIndex] , dfTargets.iloc[lstXIndex] ], axis=1)
    return dfUnitT

In [42]:
def compareDfValues(lstColumns, lstEpochs, dfUsed, dfRef):
    lstErrors = []
    for epoch in lstEpochs:
        for column in lstColumns:
            val1=dfUsed.loc[epoch][column]
            val2=dfRef.loc[epoch][column]
            if val1!=val2:
                lstErrors.append("Values differs (Used={} vs DB={}) on epoch : {} for column : {}".format(val1,val2,epoch,column))
    return lstErrors

In [43]:
def compareDfTargetsBuy(lstEpochs, dfUsed, dfRef):
    lstErrors = []
    dfRef['targetProfitBuy'] = dfRef['profit'].apply(lambda x: 1 if x > 0 else 0)
    for epoch in lstEpochs:
        if (epoch in dfUsed.index and epoch in dfRef.index):
            val1=dfUsed.loc[epoch]
            val2=dfRef.loc[epoch]['targetProfitBuy']
            if (val1!=val2):
                lstErrors.append("Values differs (DB={} vs Used={}) on epoch : {} for column : targetProfitBuy".format(val1,val2,epoch))
    return lstErrors

In [44]:
def testDatasetsWithDB(dfFeatures, dfTargets, nb_samples, table):
    dfUnitT = getSamplesDataFromDatasets(dfFeatures, dfTargets, nb_samples)
    lstEpochs = dfUnitT.index.to_list()
    lstColumns = dfFeatures.columns.to_list()
    dfDBdataFeat = getFeaturesDatasetFromDB(lstEpochs, lstColumns, table)
    dfDBdataTarget = getTargetsDatasetFromDB(lstEpochs, table)
    lstErrorsFeat = compareDfValues(lstColumns, lstEpochs, dfUnitT, dfDBdataFeat)
    lstErrorstarget = compareDfTargetsBuy(lstEpochs, dfTargets, dfDBdataTarget)
    for errorFeat in lstErrorsFeat:
        print(errorFeat) 
    for errorTarget in lstErrorstarget:
         print(errorTarget) 
    if (len(lstErrorstarget) + len(lstErrorsFeat)) > 0:
        raise Exception('Data Validation issues') 
    return

#### Test randomely 200 records (compare df with Database) in all datasets
=> Errors are raised in case of NO GO - validation. Stopping the whole processing.

In [45]:
testDatasetsWithDB(dX_train, dy_train, 200, 'fex_eurusd_h1')

In [46]:
testDatasetsWithDB(dX_test, dy_test, 200, 'fex_eurusd_h1')

In [47]:
testDatasetsWithDB(dX_val, dy_val, 200, 'fex_eurusd_h1')

#### Normalisation des données

In [48]:
scaler = StandardScaler()
X_train = scaler.fit_transform(dX_train)
X_test = scaler.transform(dX_test)
X_val = scaler.transform(dX_val)

In [49]:
y_train = dy_train.to_numpy()
y_test = dy_test.to_numpy()
y_val = dy_val.to_numpy()

In [50]:
X_train.shape

(104801, 6)

#### Spécificité LSTM / GRU : Separation des données en sous-ensembles

Les LSTM travaillent par lots (sous-ensembles) qui déterminent pour une instance donné quelles sont les instances précédentes qui doivent lui être associées.

Dans le contexte du trading on va donner pour chaque extrait de données à un instant T un nombre n (paramètre) d'extraits qui le précédent directement dans le temps [T-1 .... T-n], et qui vont être utilisés par LSTM pour comprendre la donnée à l'instant T.

In [51]:
def spliSequencesWithSamples(xdata, ydata, lookback):
    X, y = list(), list()
    for i in range(len(xdata)):
        if (i>=lookback-1): # Rows with not enough prev values cannot be taken
            # gather input and output parts of the pattern
            seq_x, seq_y = xdata[i+1-lookback:i+1, :], ydata[i]
            X.append(seq_x)
            y.append(seq_y)  
    return(np.array(X), np.array(y))

## Calcul des scores et gains

In [52]:
def calculateRandomProfit(dfCleanRow, target='targetBuy'):
    profit = dfCleanRow[target].sum()
    profitPerTrade = profit / len(dfCleanRow)
    return profit, profitPerTrade

### Calcul des scores et gains (model 100 % aléatoire)

In [53]:
profitRandom, profitPerTradeRandom = calculateRandomProfit(dfCleanRow, target='targetBuy')

In [54]:
profitRandom

-33065.30999999999

In [55]:
profitPerTradeRandom

-0.2271608763456742

## LSTM SINGLE LAYER

NN will have just 1 LSTM Layer before the Fully Connected layers

Custom Metric functions :

In [56]:
def createLSTMWithAttention(nbFeatures, lookback):
    inputWindow = Input(shape=(lookback, nbFeatures))
    # LSTM input : [timesteps, features] 
    mem1 = LSTM(32, return_sequences = True, activation='tanh')(inputWindow)   
    att =  Attention(32)(mem1)
    #dense = Dense(32)(att)
    return Model(inputs=inputWindow, outputs=att)

In [57]:
def createTimeWindowedGRU(nbFeatures, lookback):
    inputWindow = Input(shape=(lookback, nbFeatures))
    # GRU input : [timesteps, features] 
    mem1 = GRU(32, return_sequences = True, activation='tanh', kernel_initializer='TruncatedNormal')(inputWindow)    
    mem2 = GRU(8,  return_sequences = False, activation='tanh', kernel_initializer='TruncatedNormal')(mem1)
    return Model(inputs=inputWindow, outputs=mem2)

In [58]:
def createTimeWindowedLSTM(nbFeatures, lookback):
    inputWindow = Input(shape=(lookback, nbFeatures))
    # LSTM input : [timesteps, features] 
    mem1 = LSTM(32, return_sequences = False, activation='tanh')(inputWindow)   
    #mem2 = LSTM(8,  return_sequences = False, activation='tanh')(mem1)
    return Model(inputs=inputWindow, outputs=mem1)

In [59]:
def getTransfoCustomParams(nbEncoder, nbDecoder, numHeads, ffnHiddenSize, ffnFilterSize):
    params = {
        "n_encoder_layers": nbEncoder,
        "n_decoder_layers": nbDecoder,
        "use_token_embedding": False,
        "attention_hidden_sizes": 128 * 1,
        "num_heads": numHeads,
        "attention_dropout": 0.0,
        "ffn_hidden_sizes": ffnHiddenSize * 1,
        "ffn_filter_sizes": ffnFilterSize * 1,
        "ffn_dropout": 0.0,
        "scheduler_sampling": 1,  # 0 means teacher forcing, 1 means use last prediction
        "skip_connect_circle": False,
        "skip_connect_mean": False,
    }
    return params

In [60]:
def createTimeWindowedTransformer(nbFeatures, lookback, nbOutVal=1, nbEncoder=1, nbDecoder=1, numHeads=1, ffnHiddenSize=128, ffnFilterSize=128):
    inputWindow = Input(shape=(lookback, nbFeatures))
    params = getTransfoCustomParams(nbEncoder, nbDecoder, numHeads, ffnHiddenSize, ffnFilterSize)
    instanceT = Transformer(nbOutVal, custom_model_params=params)
    transfo = instanceT(inputWindow)
    return Model(inputs=inputWindow, outputs=transfo)

In [61]:
def createRawDataBranch(nbFeatures):
    inputRaw = Input(shape=(nbFeatures))
    return Model(inputs=inputRaw, outputs=inputRaw)

In [74]:
def createBranche(nbFeatures, lookback, typeLayer):
    match typeLayer:
        case "RAW":
            return createRawDataBranch(nbFeatures)
        case "GRU":
            return createTimeWindowedGRU(nbFeatures, lookback)
        case "LSTM":
            return createTimeWindowedLSTM(nbFeatures, lookback)
        case "LSTMAT":
            return createLSTMWithAttention(nbFeatures, lookback)
        case "TRANSFO":
            return createTimeWindowedTransformer(nbFeatures, lookback, nbOutVal=1, nbEncoder=1, nbDecoder=1, numHeads=1, ffnHiddenSize=128, ffnFilterSize=128)

In [63]:
def combineBranches(lstBranches):
    lstOutput = [branche.output for branche in lstBranches]
    return layers.concatenate(lstOutput)

In [64]:
def createModelInputs(lstBranches):
    return [branche.input for branche in lstBranches]

#### Create NN model from a dataset with the associated layers (Raw / LSTM / GRU) with specified window size

In [65]:
def buildMultiWindowedInput(nbFeatures, nbWindows, lstLookback, lstLayers):
    lstBranches = []
    for i in range(nbWindows):
        lookback = lstLookback[i]
        branche = createBranche(nbFeatures, lookback, lstLayers[i])
        lstBranches.append(branche)
    combined = combineBranches(lstBranches)
    # Fully connected layers, with 1 final output for binary classification
    # d2 = Dense(32, name='Dense_1', activation='relu')(combined)
    d1 = Dense(8, name='Dense_2', activation='relu')(combined)
    d2 = Dense(1, name='Dense_3', activation='sigmoid')(d1)
    lstInputs = createModelInputs(lstBranches)
    model = Model(inputs=lstInputs, outputs=d2)
    model.compile(loss='mse', optimizer='adam', metrics=['accuracy'])
    return model

In [66]:
lstLookback = [5 * 24]
lstLayers   = ['TRANSFO']
nbInput = len(lstLookback)

In [67]:
nbInput

1

In [68]:
K.clear_session() 

In [73]:
modeldyn = buildMultiWindowedInput(X_train.shape[1], nbInput, lstLookback, lstLayers)

6


#### Format dataset and Time Windows for the model

In [75]:
def spliSequencesWithSamples(xdata, ydata, lookback):
    X, y = list(), list()
    for i in range(len(xdata)):
        if (i>=lookback-1): # Rows with not enough prev values cannot be taken
            # gather input and output parts of the pattern
            seq_x, seq_y = xdata[i+1-lookback:i+1, :], ydata[i]
            X.append(seq_x)
            y.append(seq_y)  
    return(np.array(X), np.array(y))

In [76]:
def getDataWindowed(xData2D, lookback, maxLookback):
    X = list()
    if lookback == 0:
        return xData2D[maxLookback-1:,:]
    else:
        for i in range(len(xData2D)):
            if (i>=maxLookback-1): # Rows with not enough prev values cannot be taken
                seq_x = xData2D[i+1-lookback:i+1, :]
                X.append(seq_x) 
    return np.array(X)

In [77]:
# Return Windowed dataset (xData in 3D) and label (yData1D) sized. Number of rows has to match with the maximum Windowed dataset
def formatWindowedData(lstLookback, xData2D, yData1D):
    maxLookback = max(lstLookback)
    lstxData3D = [getDataWindowed(xData2D, lookback, maxLookback) for lookback in lstLookback]
    yDataReshape1D = yData1D[maxLookback-1:]
    return lstxData3D, yDataReshape1D

In [78]:
modeldyn.summary()

Model: "model_1"
__________________________________________________________________________________________________
 Layer (type)                   Output Shape         Param #     Connected to                     
 input_3 (InputLayer)           [(None, 120, 6)]     0           []                               
                                                                                                  
 tf.compat.v1.shape_2 (TFOpLamb  (3,)                0           ['input_3[0][0]']                
 da)                                                                                              
                                                                                                  
 tf.__operators__.getitem_2 (Sl  ()                  0           ['tf.compat.v1.shape_2[0][0]']   
 icingOpLambda)                                                                                   
                                                                                            

In [79]:
# TEST : Reload always the same init weights in order to compare results easily
if os.path.isfile(pathModelWeights):
    modeldyn.load_weights(pathModelWeights)
    print('Model : Reload Weights Done')
else:
    modeldyn.save_weights(pathModelWeights)
    print('Model : Save Weights Done')

Model : Save Weights Done


In [80]:
lstxTrainWindowed3D, yTrained1D = formatWindowedData(lstLookback, X_train, y_train)

In [81]:
lstxValWindowed3D, yVal1D = formatWindowedData(lstLookback, X_val, y_val)

### TRAINING

In [82]:
PATIENCE = 4
EPOCHS = 2
LOOP = 2
BATCH_SIZE = 32 # Default used my model.fit is 32
steps_per_epoch = yTrained1D.shape[0] * LOOP / EPOCHS // BATCH_SIZE    # Split all data by Epochs ()
validation_steps = yVal1D.shape[0] // BATCH_SIZE                       # Take all validation data for validation on each epoch

In [83]:
CLASS_WEIGHT = {0: .37, 1 : .63} # Use to counter unbalnced class

In [84]:
early_stopping = EarlyStopping(monitor='val_loss', patience = PATIENCE, restore_best_weights=True)

In [85]:
modelstart = time.time()
history = modeldyn.fit(
                    x=lstxTrainWindowed3D,
                    y=yTrained1D,
                    epochs = EPOCHS,
                    batch_size = BATCH_SIZE,
                    #class_weight = CLASS_WEIGHT,
                    validation_data=(lstxValWindowed3D, yVal1D),
                    validation_steps=validation_steps,
                    steps_per_epoch=steps_per_epoch)
# modeldyn.save(pathModel)
print("\nModel Runtime: %0.2f Minutes"%((time.time() - modelstart)/60))

Epoch 1/2
Epoch 2/2

Model Runtime: 3.97 Minutes


### Test

In [86]:
lstxTestWindowed3D, yTest1D = formatWindowedData(lstLookback, X_test, y_test)

In [87]:
pred = modeldyn.predict(lstxTestWindowed3D)



In [88]:
del modeldyn

In [96]:
pred = pred[:,0,:]

In [97]:
pred

array([[0.36795834],
       [0.36795834],
       [0.36795816],
       ...,
       [0.36795765],
       [0.36795765],
       [0.36795756]], dtype=float32)

### Profit

In [89]:
def calculateProfit(dfCleanRow, dX_test, yTestLbk, pred, lookback=100, specificity=.8, target='targetBuy'):
    [fpr, tpr, thr] = roc_curve(yTestLbk, pred, pos_label=1)
    idx = np.max(np.where((1-fpr) > specificity)) 
    seuil = thr[idx]  
    dfPred = pd.DataFrame(pred, columns = ['proba'])
    #Get rows index with positive proba (proba > seuil)
    xRows = dfPred[dfPred['proba']>seuil].index.to_numpy()
    #Get matching index (epoch timestamp) from dX_test => Periods with proba > seuil
    xEpochs = dX_test.iloc[lookback-1:,:].iloc[xRows].index.to_numpy()
    dfCleanEpochIdx = dfCleanRow.set_index('epoch')
    profit = dfCleanEpochIdx.loc[xEpochs][target].sum()
    profitPerTrade = profit / len(xRows)
    return profit, profitPerTrade

In [98]:
profit, profitPerTrade = calculateProfit(dfCleanRow, dX_test, yTest1D, pred, lookback=max(lstLookback), specificity=.95, target='targetBuy')

In [99]:
print('Global profit : ', profit)
print('Average profit per trade : ', profitPerTrade)
print('Global Number of trade made : ', profit / profitPerTrade)
print('Average number of trade made per day : ', (profit / profitPerTrade) / len(pred) * 24)

Global profit :  -202.65999999999997
Average profit per trade :  -0.1711655405405405
Global Number of trade made :  1184.0
Average number of trade made per day :  0.9800986445003965


In [None]:
pred

## Conclusion

This model, based on Stacked GRU, seems to be the most promising so far. 
- It looks like using specificity 0.9 makes the model break even or close in term of profit. 
- Windows lookback timeframe is quite large 5 days (GRU are optimized)
- Validation Loss decrease is not really progressive (Model unstable ?). Early stop cannot really be used. Metrics are a bit uneasy to read (class unbalanced ?)

At this point we have a first basis, not great but could be promising with optimizations. In order to optimize we can answer this different questions :
- Could it be helpfull to add some features ? (technical analysis, time feature)
- Would it be possible, and usefull to adapt in order to have different time windows in "parallel" ? Not just 1 ?
- Could it be interesting to use different loss or balanc the class ? In order to make model more "stable" in his progression ?


## Next steps

1 - Add features 

-> Complete the dataset with calculated features
- Add Time feature
- Add Windows period tech indicators (Mostly short Windows as GRU has a large TimeFrame Window)

-> Combine different time window in //
- Multiple input usage. Idea behind is tech analysis uses multiple timefgrame analysis. Could be interesting to reproduce this in some way and not be "fixed" on a single specific lookback window timeframe.

-> Add detail gain analysis
Glabal result is important, but could be also nice to have a graphical view (monthly, daily) with standard deviation (sd -> risk)

-> Renforce The results validations, calculations
- Using Kfold validations (different set of test validations)
