RL Trader

A collection of RL financial applications; part of a talk at BADS2018

Position Bot

This agent is created to demonstrate how Q-learning can be applied to a real-world problem. Retail trading is chosed as it has a lot of resemblances with many real-world problem such as ads bidding. We created the environment based on minute-level bitcoin price data from Kaggle. We use the original Deep Q-learning implementation as stated in the paper Playing Atari with Deep Reinforcement Learning. For more advanced implementations as well as other methods, check out our bi-weekly Bangkok School of AI Reinforcement Learning Workshop.

Disclaimer You will not get rich with this algorithm. We do not hold any Bitcoin position as of 2018-10-31.

Environment

The environment is based on Bitcoin price from 2016-08-01 13:21:00 to 2016-10-10 00:00:00. It consists of 6 * 60 values for a state and 3 available actions. The state represents batch-normalized 60-minute previous ohlc, vwap and position. The actions are enter short position, do nothing, enter long position respectively. In our example notebook, we run the agent for 10,000 timesteps. The reward is differential sharpe ratio.

Actions look like: 
* 0 - short
* 1 - nothing
* 2 - long
Action size: 3
States look like: (1, 6, 60)

Getting Started

Install dependencies.

pip install -r requirements.txt

Follow position_sandbox.ipynb to train the agent.
Our implementation is divided as follows:

replay_memory.py - Experience Replay Memory
agent.py - Agent
qnetwork - Q-networks for local and target

Train Agent

These are the steps you can take to train the agent with default settings.

Initiate environment

env = SingleStockMarket(bitstamp_df)

Create a experience replay memory.

mem = ReplayMemory(10000)

Create an agent.

a = VanillaQAgent(replay_memory = mem)

Train the agent.

state = env.reset()
for i in trange(10000):
    #select action
    action = a.act(state,i)  

    #step
    next_state,reward,done,info = env.step(action)                
    a.step(state,action,reward,next_state,done)

    state = next_state

Imitation Bot

Work in progress

Shortfall Bot

Work in progress

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
position_bot		position_bot
README.md		README.md
position_sandbox.ipynb		position_sandbox.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

position_bot

position_bot

README.md

README.md

position_sandbox.ipynb

position_sandbox.ipynb

Repository files navigation

RL Trader

Position Bot

Environment

Getting Started

Train Agent

Imitation Bot

Shortfall Bot

About

Releases

Packages

Languages

cstorm125/rl_trader

Folders and files

Latest commit

History

Repository files navigation

RL Trader

Position Bot

Environment

Getting Started

Train Agent

Imitation Bot

Shortfall Bot

About

Resources

Stars

Watchers

Forks

Languages