MambaQuant: Production-Ready Stock Prediction with Mamba (S6)

MambaQuant implements a stock price prediction model based on the Mamba (S6) architecture (Structured State Space Sequence Models). Mamba has achieved remarkable success in sequence modeling tasks, offering linear time complexity while maintaining the performance of Transformers.
This repository leverages historical stock data to predict future price trends using a Sliding Window approach and includes a dedicated inference step for forecasting the next trading day's price.

✨ Key Features

Mamba Architecture: Efficiently handles long time-series data with lower memory usage than Transformers.
Auto Data Fetching: Integrated with yfinance to automatically download stock data (supports global tickers, e.g., 2330.TW, AAPL, NVDA).
Sliding Window: Uses historical windows (e.g., past 20 days) to predict the next time step, preventing look-ahead bias.
Future Inference: Automatically predicts the stock price for the upcoming trading day (T+1) after training.

📊 Results & Demo

🛠️ Requirements

We use uv for high-speed dependency management and environment setup.

1. Installation

First, clone the repository:

git clone https://github.com/LouieLK/MambaQuant.git
cd MambaQuant

2. Setup Environment

Sync dependencies (this will automatically create a virtual environment and install PyTorch with CUDA support if configured):

uv sync

🚀 Usage

You can run the training script directly using uv run. The script handles data downloading, preprocessing, training, and visualization automatically.

With CUDA (Recommended)

uv run python main.py --use-cuda

CPU Only

uv run python main.py

Custom Training Example

Train on TSMC (2330.TW) with a 60-day sliding window:

uv run python main.py --ts-code 2330.TW --seq-len 60 --use-cuda

Training Result

⚙️ Options

The model behavior can be customized using command-line arguments. Here is the full list of available options:

Argument	Type	Default	Description
--use-cuda	Flag	False	Enable CUDA training (requires NVIDIA GPU).
--ts-code	str	2330.TW	Stock ticker symbol (e.g., 2330.TW, AAPL).
--seq-len	int	20	Size of the sliding window (lookback period).
--epochs	int	50	Number of training epochs.
--batch-size	int	64	Batch size for training.
--lr	float	0.001	Learning rate.
--hidden	int	32	Dimension of the hidden state in Mamba layer.
--layer	int	2	Number of Mamba layers stacked.
--n-test	int	365	Number of days to use for the test set (backtesting).
--wd	float	1e-5	Weight decay (L2 regularization).
--seed	int	1	Random seed for reproducibility.

👨‍💻 Maintainer & Contributions

This project is a fork and enhanced implementation of zshicode/MambaStock. Maintained by Louie Huang (GitHub: @LouieLK).

While the core Mamba architecture preserves the original design, this project focuses on transforming it into a Production-Ready tool. Key differences and contributions include:

Global Market Support: Integrated yfinance to support dynamic data fetching for global stocks (US, Taiwan, Crypto), removing the dependency on static CSV files.
Production-Ready Pipeline: Refactored the codebase into a modular structure (src/, data/) and migrated to uv for deterministic dependency management.
Advanced Inference Logic: Implemented a Sliding Window mechanism to prevent look-ahead bias and added a dedicated Future Inference step to predict the next trading day's price (T+1).
DevOps & Usability: Added robust CLI argument parsing, bilingual documentation (English/Chinese), and streamlined CUDA/CPU execution flows.

🔮 Roadmap

I plan to introduce the following features in future updates. Contributions are welcome!:

Graphical User Interface (GUI): Implement Streamlit or Gradio dashboards for interactive visualization and parameter tuning.

📚 Citation

@article{shi2024mamba,  
  title={MambaStock: Selective state space model for stock prediction},  
  author={Zhuangwei Shi},  
  journal={arXiv preprint arXiv:2402.18959},  
  year={2024},  
}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github		.github
assets		assets
data		data
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-TW.md		README.zh-TW.md
env.yml		env.yml
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MambaQuant: Production-Ready Stock Prediction with Mamba (S6)

✨ Key Features

📊 Results & Demo

🛠️ Requirements

1. Installation

2. Setup Environment

🚀 Usage

With CUDA (Recommended)

CPU Only

Custom Training Example

Training Result

⚙️ Options

👨‍💻 Maintainer & Contributions

🔮 Roadmap

📚 Citation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MambaQuant: Production-Ready Stock Prediction with Mamba (S6)

✨ Key Features

📊 Results & Demo

🛠️ Requirements

1. Installation

2. Setup Environment

🚀 Usage

With CUDA (Recommended)

CPU Only

Custom Training Example

Training Result

⚙️ Options

👨‍💻 Maintainer & Contributions

🔮 Roadmap

📚 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages