Reinforcement Learning Trading Bot for Binance Futures

An advanced reinforcement learning agent for algorithmic trading on Binance Futures, designed for crypto market trading.

Features

Deep reinforcement learning agent with Proximal Policy Optimization (PPO)
Support for various neural network architectures (CNN, LSTM, Transformer)
Live trading on Binance Futures with proper risk management
Position sizing based on volatility and risk metrics (fixed fraction, Kelly Criterion, volatility-adjusted)
Integration of market data signals (funding rates, open interest, volatility)
Stop loss and take profit management (handled by the execution logic)
Dynamic leverage adjustment based on market conditions

Installation

Clone this repository:

git clone https://github.com/DeeNA95/trading_bot.git
cd trading_bot

Create a virtual environment and install dependencies:

# Using venv
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Set up your API keys:

Create a .env file in the project root
Add your Binance API keys (for testnet and/or mainnet):

# For testnet
BINANCE_KEY=your_testnet_api_key
BINANCE_SECRET=your_testnet_secret_key
testnet_url=https://testnet.binancefuture.com

# For mainnet (when ready for live trading)
BINANCE_KEY=your_mainnet_api_key
BINANCE_SECRET=your_mainnet_secret_key

Quick Start 🚀

Start the Full Application (Recommended)

# Start both backend API and React frontend
./start.sh
# OR
python start_app.py

Access your application:

Frontend UI: http://localhost:3000
Backend API: http://localhost:8080

Alternative Startup Methods

Start Backend Only

python start_backend.py

Start Frontend Only

cd react-frontend
npm start

Application Architecture

Backend API Server (Flask) - Port 8080
- REST API endpoints for training/inference
- WebSocket for real-time logs
- Manages training and inference processes
React Frontend - Port 3000
- Modern Material-UI interface
- Real-time training/inference monitoring
- Configuration panels for model parameters

Usage

1. Generate training data

# Generate data for BTCUSDT with 15-minute candlesticks for the past 30 days
python data.py --symbol BTCUSDT --interval 15m --days 30 --output data/btcusdt_15m.csv

# Split data into training and testing sets
python data.py --symbol BTCUSDT --interval 15m --days 60 --split --output data/btcusdt_15m_split.csv

2. Train the agent

# Train with default parameters (often specified in a config file)
# Example using a simplified command structure, actual training might involve a config file:
python train.py --data_path data/btcusdt_15m_train.csv --symbol BTCUSDT --model_type lstm --train_episodes 100 --output_dir models/

# Train an encoder-only transformer (example)
# This typically involves setting num_decoder_layers to 0 in the model configuration.
# If using a JSON config: "num_decoder_layers": 0
python train.py --data_path data/btcusdt_15m_train.csv --symbol BTCUSDT --model_type transformer --train_episodes 200 --leverage 3 --window_size 50 --output_dir models/ --model_config '{"num_encoder_layers": 6, "num_decoder_layers": 0, "embedding_dim": 128}'

Note: train.py accepts many arguments. For complex model configurations like specifying layer numbers, using a JSON config file passed via the --config argument or providing a JSON string to --model_config (as shown above) is recommended. The exact parameters for --model_config should match the ModelConfig dataclass fields.

3. Run backtests

# Backtest a trained model
python backtest.py --symbol BTCUSDT --data_path data/btcusdt_15m_test.csv --model_path models/BTCUSDT_lstm_YYYYMMDD_HHMMSS/model.pt --model_type lstm --output_dir results/

4. Live trading

# Live trading on testnet
python inference.py --symbol BTCUSDT --model_path models/BTCUSDT_lstm_YYYYMMDD_HHMMSS/model.pt --model_type lstm --interval 1m --testnet --output_dir logs/

# Live trading on mainnet (use with caution!)
python inference.py --symbol BTCUSDT --model_path models/BTCUSDT_lstm_YYYYMMDD_HHMMSS/model.pt --model_type lstm --interval 1m --output_dir logs/

Project Structure

rl_agent/ - Core RL agent implementation
- agent/ - PPO agent (ppo_agent.py), neural network models (models.py), LSTM core (lstm_core.py), feature extractors (feature_extractors.py), various attention mechanisms (attention/), and transformer blocks (blocks/)
- environment/ - Trading environment (trading_env.py), position sizing strategies (position_sizer.py), trade execution logic (execution.py), and reward functions (reward.py)
training/ - Scripts and utilities for training models
- trainer.py - Main training loop and evaluation logic
- model_factory.py - Creates model instances (including DynamicTransformerCore)
- load_preprocess_data.py - Data loading for training
data.py - Script for fetching, preprocessing, and saving market data
train.py - Script for initiating the training process for the RL agent
backtest.py - Script for backtesting trained models on historical data
inference.py - Script for running live trading with trained models
requirements.txt - Python package dependencies
.env_example - Example environment file for API keys (user should create .env)

Model Architectures

CNN

Convolutional Neural Networks are used for feature extraction from market data, capable of detecting spatial hierarchies of patterns. Implemented via various extractor classes like BasicConvExtractor, ResNetExtractor, and InceptionExtractor.

LSTM

Long Short-Term Memory networks are effective for sequence modeling and capturing temporal dependencies in time series data. The LSTMCore provides the LSTM implementation.

Transformer

Advanced architecture that captures long-range dependencies and complex relationships in market data using self-attention mechanisms. The implementation includes EncoderBlock and DecoderBlock, various attention types (e.g., MultiHeadAttention, PyramidalAttention), and is dynamically constructed via DynamicTransformerCore in the model_factory.py.

Risk Management

The bot implements several risk management strategies:

Position Sizing: Adjusts position size based on account balance, market volatility, and chosen strategy (e.g., Fixed Fraction, Kelly Criterion, Volatility-Adjusted). See position_sizer.py.
Stop Loss/Take Profit: While not explicitly managed by discrete TP/SL orders placed on the exchange by default, the agent learns to exit positions. Custom logic for TP/SL can be integrated into the BinanceFuturesExecutor or trading environment.
Dynamic Leverage: Adjusts leverage based on market volatility or other factors, configured in the environment. See calculate_adaptive_leverage and BinanceFuturesCryptoEnv.
Risk-Adjusted Rewards: Penalizes excess risk-taking during training through custom reward functions. See reward.py.

Performance Monitoring

During training and backtesting, the bot tracks various performance metrics:

Total return and drawdown
Win rate and average trade profitability
Sharpe ratio and other risk-adjusted metrics
These are typically logged and can be visualized using tools like TensorBoard or W&B (if integrated).

Disclaimer

This software is for educational and research purposes only. Use at your own risk. The author(s) is(are) not responsible for any financial losses incurred by using this software. Always start with paper trading (testnet) before committing real capital.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github		.github
backend		backend
conf		conf
experiments		experiments
react-frontend		react-frontend
rl_agent		rl_agent
scripts		scripts
training		training
.env.template		.env.template
.gitignore		.gitignore
IMPROVEMENT_PLAN.md		IMPROVEMENT_PLAN.md
MODEL_FACTORY_REFACTOR_PLAN.md		MODEL_FACTORY_REFACTOR_PLAN.md
Makefile		Makefile
PPO_AGENT_IMPROVEMENT.md		PPO_AGENT_IMPROVEMENT.md
PROJECT_REVIEW_SUMMARY.md		PROJECT_REVIEW_SUMMARY.md
README.md		README.md
backtest.py		backtest.py
data.ipynb		data.ipynb
data.py		data.py
inference.py		inference.py
requirements.txt		requirements.txt
run_inference_eth.sh		run_inference_eth.sh
train.py		train.py
train.sh		train.sh
train_pyr.sh		train_pyr.sh
trainer_args.md		trainer_args.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning Trading Bot for Binance Futures

Features

Installation

Quick Start 🚀

Start the Full Application (Recommended)

Alternative Startup Methods

Start Backend Only

Start Frontend Only

Application Architecture

Usage

1. Generate training data

2. Train the agent

3. Run backtests

4. Live trading

Project Structure

Model Architectures

CNN

LSTM

Transformer

Risk Management

Performance Monitoring

Disclaimer

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

DeeNA95/trading_bot

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Trading Bot for Binance Futures

Features

Installation

Quick Start 🚀

Start the Full Application (Recommended)

Alternative Startup Methods

Start Backend Only

Start Frontend Only

Application Architecture

Usage

1. Generate training data

2. Train the agent

3. Run backtests

4. Live trading

Project Structure

Model Architectures

CNN

LSTM

Transformer

Risk Management

Performance Monitoring

Disclaimer

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages