# Recurrent Neural Network

Predicting upward and downward trends - 5 years of Google stock 2012 to 2016- to predict Jan 2017

## Part 1 - Data Preprocessing

### Importing the libraries

In [None]:
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

### Importing the training set

In [None]:
dataset_train = pd.read_csv('Google_Stock_Price_Train.csv')
training_set = dataset_train.iloc[:, 1:2].values

In [None]:
training_set

### Feature Scaling

In [None]:
from sklearn.preprocessing import MinMaxScaler
sc = MinMaxScaler(feature_range = (0, 1))
training_set_scaled = sc.fit_transform(training_set)

In [None]:
training_set_scaled


When you have sigmoid function at output network- Normalization scaling is preferred

### Creating a data structure with 60 timesteps and 1 output

In [None]:
X_train = []
y_train = []
for i in range(60, 1258):
    X_train.append(training_set_scaled[i-60:i, 0]) # 0 to 59 rows, 0 column
    y_train.append(training_set_scaled[i, 0]) #60
X_train, y_train = np.array(X_train), np.array(y_train)

In [None]:
X_train

In [None]:
y_train

60 timesteps means at each time the rnn looks back 60 timesteps or 60 days back - we fix 60 based on trail and error


### Reshaping for RNN

In [None]:
X_train = np.reshape(X_train, (X_train.shape[0], X_train.shape[1], 1))

This line of code is reshaping your training data (X_train) into a format suitable for an LSTM network or any other model expecting three-dimensional input. In Keras, the LSTM layer expects input to be in the form of [samples, time steps, features]:

## Part 2 - Building and Training the RNN

### Importing the Keras libraries and packages

In [None]:
from keras.models import Sequential
from keras.layers import Dense
from keras.layers import LSTM
from keras.layers import Dropout

### Initialising the RNN

In [None]:
regressor = Sequential()

regressor has sequence of layers

### Adding the first LSTM layer and some Dropout regularisation

In [None]:
regressor.add(LSTM(units = 50, return_sequences = True, input_shape = (X_train.shape[1], 1)))
regressor.add(Dropout(0.2))

why dropout to aviod overfitting- 20% of meuron in the layer- 20%0f 50 in LSTM is 10 neurons is dropout.
LSTM - Number of lstm cell or units,50 neurons in the 1st LSTM layer
true represents you can add more LSTM
X_train.shape[1], 1 - 60,1 predictor


### Adding a second LSTM layer and some Dropout regularisation

In [None]:
regressor.add(LSTM(units = 50, return_sequences = True))
regressor.add(Dropout(0.2))

### Adding a third LSTM layer and some Dropout regularisation

In [None]:
regressor.add(LSTM(units = 50, return_sequences = True))
regressor.add(Dropout(0.2))

### Adding a fourth LSTM layer and some Dropout regularisation

In [None]:
regressor.add(LSTM(units = 50))
regressor.add(Dropout(0.2))

### Adding the output layer

In [None]:
regressor.add(Dense(units = 1))

### Compiling the RNN

In [None]:
regressor.compile(optimizer = 'adam', loss = 'mean_squared_error')

keras documentation - optimizer
This step prepares the model for training by specifying the optimizer and the loss function.

### Fitting the RNN to the Training set

In [None]:
regressor.fit(X_train, y_train, epochs = 100, batch_size = 32)

## Part 3 - Making the predictions and visualising the results

### Getting the real stock price of 2017

In [None]:
dataset_test = pd.read_csv('Google_Stock_Price_Test.csv')
real_stock_price = dataset_test.iloc[:, 1:2].values

### Getting the predicted stock price of 2017

In [None]:
dataset_total = pd.concat((dataset_train['Open'], dataset_test['Open']), axis = 0)
inputs = dataset_total[len(dataset_total) - len(dataset_test) - 60:].values
inputs = inputs.reshape(-1,1)
inputs = sc.transform(inputs)

we need 60 previous values for test set, but in jan 2017 only 20, so we need other values from training set Dec 2016

axis=0 data concated along vertical

##Create Test sequences

In [None]:
X_test = []
for i in range(60, 80):
    X_test.append(inputs[i-60:i, 0])
X_test = np.array(X_test)

## Reshape Test Data

In [None]:
X_test = np.reshape(X_test, (X_test.shape[0], X_test.shape[1], 1))

In [None]:
predicted_stock_price = regressor.predict(X_test)
predicted_stock_price = sc.inverse_transform(predicted_stock_price)

### Visualising the results

In [None]:
plt.plot(real_stock_price, color = 'red', label = 'Real Google Stock Price')
plt.plot(predicted_stock_price, color = 'blue', label = 'Predicted Google Stock Price')
plt.title('Google Stock Price Prediction')
plt.xlabel('Time')
plt.ylabel('Google Stock Price')
plt.legend()
plt.show()