Train - test split (allready seen samples) #38

illUkc · 2021-03-08T15:43:55Z

Hello,

First of all great work Robert.

I find one big mistake ( everyone do that ) in backtesting.py -> row 40 - u are using shuffle = True ( by default is true in train_test_split ) and when u doing i+1 or i+x targets data is already seen when doing learning. Because of that u get always different result when running backtesting.py. If u change shuffle = False u will get 45-50% less of trades and Accuracy score will drop to 0.6/0.65 max.

Best

robertmartin8 · 2021-03-09T03:11:27Z

@illUkc this is the mistake that's being referred to in the readme

Tom-Ryder · 2021-04-07T10:13:15Z

There's another important caveat btw - the data is biased to companies who outperform the market. That is, deciding to buy all of the shares in a 20% test split and you will outperform the market by ~4%.

robertmartin8 closed this as completed Mar 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train - test split (allready seen samples) #38

Train - test split (allready seen samples) #38

illUkc commented Mar 8, 2021

robertmartin8 commented Mar 9, 2021

Tom-Ryder commented Apr 7, 2021 •

edited

Train - test split (allready seen samples) #38

Train - test split (allready seen samples) #38

Comments

illUkc commented Mar 8, 2021

robertmartin8 commented Mar 9, 2021

Tom-Ryder commented Apr 7, 2021 • edited

Tom-Ryder commented Apr 7, 2021 •

edited