The advantage of incremental learning over supervised learning methods is that it is additive depends on the subjective judgment of humans (who label the data), thus minimizing Significant overfitting. In addition, we can flexibly adjust the purpose of the model (total return or expect return, ...). If you can fit a good RL model, it will most likely work good move in the future.
- I added more than 50 technical indicators as the new features. Then perform principal component analysis to get the 20 most informative components from the technical indicators.
- I built the trading environment as shown above and trained the reinforcement learning agent using PPO algorithms.