Stock-Price-Specific-LSTM

This is a basic LSTM application on stock (securities) price.

What's LSTM (Long Short-Term Memory)?

A popular LSTM introcution can be found at http://colah.github.io/posts/2015-08-Understanding-LSTMs/. It said: Long Short Term Memory networks – usually just called “LSTMs” – are a special kind of RNN, capable of learning long-term dependencies. They were introduced by Hochreiter & Schmidhuber (1997), and were refined and popularized by many people in following work.1 They work tremendously well on a large variety of problems, and are now widely used.

LSTMs are explicitly designed to avoid the long-term dependency problem. Remembering information for long periods of time is practically their default behavior, not something they struggle to learn!

The internal LSTM architecture is as the following:

How does LSTM work?

There is a good Illustrated Guide to LSTM (https://towardsdatascience.com/illustrated-guide-to-lstms-and-gru-s-a-step-by-step-explanation-44e9eb85bf21).

Forget gate

Input gate

Cell State

Output Gate

If you are not familar to LSTM, I strongly suggest you to check this guide.

Why LSTM?

By Wikipedia (https://en.wikipedia.org/wiki/Long_short-term_memory): LSTM networks are well-suited to classifying, processing and making predictions based on time series data, since there can be lags of unknown duration between important events in a time series. LSTMs were developed to deal with the exploding and vanishing gradient problems that can be encountered when training traditional RNNs. Relative insensitivity to gap length is an advantage of LSTM over RNNs, hidden Markov models and other sequence learning methods in numerous applications.

In short, LSTM is suitable to deal with a time series with long term patterns.

What's the specificity of stock market?

A lot of people, especially technical analysis supporters, believe (https://en.wikipedia.org/wiki/Technical_analysis#Principles):

Market action discounts everything
Prices move in trends
History tends to repeat itself

Of course, some other people believe the price moves as random walking and no one can gain extra by taking advantange of the price data.

Anyway, a lot of research shows the return is very similar to a normal distribution with fat tail.

What's the unique ?

There are a few good LSTM projects on stock price data. But the code shows that most of them didn't use any knowledge of stock market. The code is general to any data.

As an investement professional and an AI learner, I believe stock specific features are needed for stock price prediction.

So based on the code of https://github.com/jaungiers/LSTM-Neural-Network-for-Time-Series-Prediction, I demoed my ideas.

I still use the original model define:

Stock specific preprocessing. I use daily return instead of price directly. After my preprocessing, I don't need any normalization. For the daily return is normalized in a normal distribution already. You may think I normalized the whole data set instead of windowed data at the beginning. The original data is visualized as the following:
Stock specific post LSTM. In order to visualize the prediction, the predicted return is converted into price.

The model using window-normalised price data has the following prediction: The model using daily return data has the following prediction:

By the way, I added a lot of comments.

Is it practical to use LSTM to make money?

The results is better than I expected.

There is a good discussion on this topic: https://medium.com/@mikeharrisNY/machine-learning-often-a-complicated-way-of-replicating-simple-forecasting-methods-in-financial-25c38db2f624.

I agree that the algo is not so important and feature engineering is the key to success. In short, in order to make a successful mechine learning application on stock market, you should put your attension on your data. You cannot have complete stock market data. You have to balance your time, budget with the data universe, frenquency, delay and accuracy. Besides price data, you may need fundamental data, such as financial statements data.

Credit

Most code is modified from https://github.com/jaungiers/LSTM-Neural-Network-for-Time-Series-Prediction. So I didn't put my name in the source code.

I am working on a reinforcement learning project on stock market.

Please follow it at https://github.com/MRYingLEE/Stock.AI

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.vscode		.vscode
core		core
data		data
pandas/_libs		pandas/_libs
saved_models		saved_models
.gitignore		.gitignore
DailyPrices.csv		DailyPrices.csv
README.md		README.md
Visual Data.png		Visual Data.png
Visual Results by Return.png		Visual Results by Return.png
Visual_Results by Normalised Price.png		Visual_Results by Normalised Price.png
config.json		config.json
model summary.txt		model summary.txt
model.png		model.png
predictions.csv		predictions.csv
rate_predictions.csv		rate_predictions.csv
rate_y_test.csv		rate_y_test.csv
requirements.txt		requirements.txt
run.py		run.py
test.csv		test.csv
vs data.png		vs data.png
y_test.csv		y_test.csv

MRYingLEE/Stock-Price-Specific-LSTM

Folders and files

Latest commit

History

Repository files navigation

Stock-Price-Specific-LSTM

What's LSTM (Long Short-Term Memory)?

How does LSTM work?

Why LSTM?

What's the specificity of stock market?

What's the unique ?

Is it practical to use LSTM to make money?

Credit

I am working on a reinforcement learning project on stock market.

About

Topics

Resources

Stars

Watchers

Forks

Languages