Neural-Network-with-Financial-Time-Series-Data

Introduction:

Time series is an important part of financial analysis. Today, you have more data, more data sources, and higher frequency of data. New sources include new exchanges, social media and news sources. Today, delivery frequency has been increased from dozens of messages every day to hundreds of thousands of messages per second. Therefore, the results will bring more and more analytical techniques. Most modern analytical techniques are not different, and they all have a statistical basis, but their applicability follows the available computational capabilities. The increase in available computing power is faster than the increase in the number of time series data, so it is now possible to analyze large-scale time series in an unprecedented way. This neural network predicts the future movement of the index and achieves a reasonably well result.

Content:

It downloads the stock/ index data from an online information provider, then forms a pandas DataFrame that contains open, high, low, close and is compatible with the TensorFlow library and Keras. Finally, a LSTM recurrent neural network will be implemented to train and predict. It also creates a visualized result for the ease of presentation. Optimized Hyperparameters arre also provided at the end.

How it works:

The efficient market hypothesis (EMH) states that price cannot be predicted based on previous price and this model clearly violates the EMH. It attempts to understand the market sentiment behind price trends rather than analyzing a security's fundamental attributes. In order to strengthen the market sentiment analysis, a sentiment analysis model or event driven prediction model will be added. Hopefully, the result would be slightly better than a random guess. The model is currently overfitting and more updates will be provided.

Versions

After receiving feedbacks that stock price is very close to the previous price and thus regression is inappropriate. From now on, there will be 3 methods to predict the stock price.

Prediction with 22 previous days (Original) (Regression)

Filename: LSTM_Stock_prediction_20170507.ipynb

Currently not working, but all of the reserach results are in it.

Prediction with 22 previous days (Using Quandl database) (Regression)

Filename: LSTM_Stock_prediction_20170528(Quandl).ipynb

Latest update, using Quandl Database instead of pandas datareader.

Prediction with 22 previous days (Modified) (Classification)

Filename: TBA

Future improvement:

Uses more fundamental data to predict the price of stock.
Sentiment analysis or event driven analysis
Train the model with 3000 US stocks.
Deep Q ／ reinforcement learning for portfolio optimization and risk
Quantopian Zipline will be used for backtesting
GRU and LSTM comparison
Applying Learn to learn for this model

Currently I am working on another project, I will resume to this project soon with a big update.

Result:

Lastest LSTM model result for 7 years of testing data that has not been trained:

Train Score: 0.00006 MSE (0.01 RMSE)

Test Score: 0.00029 MSE (0.02 RMSE)

Hyperparameter

The following result will be deleted and modified soon because new model has been deployed.

After serveral tests,

For dropout, the result is shown as below. Dropout of 0.2, 0.3 would be fantastic

For epochs, the result is shown as below. Epochs less than 100 would be sufficient.

For number of neurons, [256, 256, 32, 1] and [512, 512, 32, 1] would be ideal for this model.

For weight decay, after serveral tests, 0.4 and 0.5 would be good.

For the days of stock price included (window), after serveral tests, 10 days would be ideal.

Update:

26/03/2017 First update

Recurrent neural network with LSTM is added to the code.
Keras with tensorflow is also implemented.
Tensorboard for neural network visualization is also added to the code.

14/04/2017 Second update

Normalized adjusted close price.
A new data downloader has been implemented for simplicity
Added more variable to predict the adjusted close price
More accurate result, significantly less mean square error
Extra visualization for close price
Denormalization will be fixed soon
Twitter sentiment analysis is currently on testing stage

16/04/2017 Third update

Updated denormalization
More test results available

18/04/2017 Fourth update

Updated fundamental data from Kaggle for NYSE

19/04/2017 Fifth update

Supporting Python 3.5 on Windows 10
Significant improvement in accuracy

29/04/2017 Sixth update

^GSPC Data since 1970 has been added, more training data, higher accuracy
7 years of test data
Object oriented programming
Hyperparameters for dropout has been tested

08/05/2017 Seventh update

All Hyperparameters have been tested and results have been uploaded.
Fixed comment for the data loader
More technical analysis like volume, moving average and other indexes will be added

28/05/2017 Eighth update

Using Quandl instead of Pandas datareader
Correlation heatmap has been addded
Using Adjusted OHLCV for the network
All functions can be loaded from lstmstock.py
A Quandl api key is provided temporarily for those who do not own a quandl account
Moving averages have been added

How to use Quandl

With this link, you should be able to get the historic price data of a particular stock after login. Use Export > Python > api key and insert the api key to your model. https://www.quandl.com/product/WIKIP/WIKI/PRICES-Quandl-End-Of-Day-Stocks-Info

References:

Bernal, A., Fok, S., & Pidaparthi, R. (2012). Financial Market Time Series Prediction with Recurrent Neural Networks.

Box, G. E., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M. (2015). Time series analysis: forecasting and control. John Wiley & Sons.

Jaeger, H. (2001). The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. Bonn, Germany: German National Research Center for Information Technology GMD Technical Report, 148(34), 13.

Jaeger, H. (2002). Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the" echo state network" approach (Vol. 5). GMD-Forschungszentrum Informationstechnik.

Maass, W., Natschläger, T., & Markram, H. (2002). Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural computation, 14(11), 2531-2560.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
How-to-Predict-Stock-Prices-Easily-Demo		How-to-Predict-Stock-Prices-Easily-Demo
Photos		Photos
dlib python examples		dlib python examples
face_recognition examples		face_recognition examples
kaggle_finance		kaggle_finance
keras examples		keras examples
opencv examples		opencv examples
tensorflow examples		tensorflow examples
LSTM_Stock_prediction_20170501.h5		LSTM_Stock_prediction_20170501.h5
LSTM_Stock_prediction_20170508.ipynb		LSTM_Stock_prediction_20170508.ipynb
LSTM_Stock_prediction_20170528(Quandl).ipynb		LSTM_Stock_prediction_20170528(Quandl).ipynb
README.md		README.md
decay2.png		decay2.png
dropout.png		dropout.png
epochs.png		epochs.png
epochs2.png		epochs2.png
lstmstock.py		lstmstock.py
model-e100.h5		model-e100.h5
model-e3.h5		model-e3.h5
neurons.png		neurons.png
oldversion1.py		oldversion1.py
oldversion2.ipynb		oldversion2.ipynb
oldversion3.ipynb		oldversion3.ipynb
requirements.txt		requirements.txt
result2.png		result2.png
val.py		val.py
window.png		window.png

JasF/deeplearning

Folders and files

Latest commit

History

Repository files navigation

Neural-Network-with-Financial-Time-Series-Data

Introduction:

Content:

How it works:

Versions

Future improvement:

Result:

Hyperparameter

The following result will be deleted and modified soon because new model has been deployed.

Update:

How to use Quandl

References:

About

Resources

Stars

Watchers

Forks

Languages