Replies: 3 comments
-
Thanks @strcat32 for the praise and your question. Connection between Jurkatis and Grauer et al.
There also some notable differences:
For more detailed insights, I guess it's best to reach out to the authors directly. They are more eligible to comment on conceptual differences and their reasoning. Obtaining True Labels
As you can see, all approaches have some imperfections, I'd base the decision on the data, to which you got access to. Hope this helps. Best, [1]: Jurkatis, S., 2020. Inferring trade directions in fast markets. Journal of Financial Markets, Vol 58, 100635 |
Beta Was this translation helpful? Give feedback.
-
Hey, thank you for your very extensive answer and overview. Much appreciated. :) Regarding the trade size rule, i might have misunderstood something. trade size rule So my understanding is when for example the trade size matches the contemporaneous ask size, this is most likely an aggressive buyer crossing the spread to buy the lowest offer immediately, taking the offer of the book. This would be classified as 1 (Buy) in Jurkatis "unique match" arm of the FI algorithm. But as you linked this trade rule is reversed in your and Grauer et. al. approach to classifying in the options market with the rationale that it happens more often that the aggressor of a trade tries to price improve with a limit order on the other side of the book and apparently market makers often allow this and do not force the trade to cross the book? applicability Another thing i was thrown off by is the reverse tick test as a fallback. Of course from a pure research perspective it makes sense if it improves the score of the model. ground truth Thank you for listing some interesting approaches to obtain usable labels. I was thinking about these, but i might go a whole different route. I am at the moment acquiring a ton of stock+options trades and quotes from SIP and OPRA data (which was the only thing affordable for me). Your work (amongst others of course) and comments here have massively helped me. Thank you again! :) |
Beta Was this translation helpful? Give feedback.
-
Thanks @strcat. Tradesize Rule I also agree with your understanding on the trade size rule. The authors empirically motivate the tradesize rule by the use of limit orders filled by market makers. Recall from the previous comment on true labels, that their definition of the trade initiator is the party opposite to the market maker. The rationale is, that the market maker or broker only provides liquidity to the investor and the trade would not exist without the initial investor’s demand. Picking up our previous example, the limit sell order would be filled by the marketmaker, hence -1=sell. Therefore, the reversed sign makes sense. Just to be clear, I did not contribute to the Grauer et. al paper. I just worked on related research with some of the authors. All in all, I'd use a definition for the true label that works best for you and choose classification approaches accordingly. It might also be wise to perform an explanatory data analysis to learn about order types, trading frequency and execution of trades within the spread. Guess its also enlightening for a discussion. Reverse Tick test In the work of Grauer et al., the choice of the rev. tick test is empircally motivated, as far as I know. Overall, the differences between the reverse and standard tick test are minor for their work. This is due to the fact, that only few trades are classified by the rev. tick test and that both perform poorly/similar to a random guess ( Depending on the setting, forward looking indicators may not be a good idea. If you are in an online setting, be sure to use past indicators only. Application Study Good luck with your paper. Once you published a draft, feel free to open an issue here, so that we can share it in our community page. Best, PS: Just noticed a typo in my comment above regarding Odders-White's definition, which is now fixed. Footnotes
|
Beta Was this translation helpful? Give feedback.
-
Hey, first of all very nice and interesting work. Also thanks for making code public. :)
I just have a question. Are you familiar with Jurkatis, S., (2020)? [1]
I feel like some of the reasoning of chaining rules of the FI algorithm using quotes, depth and ticks are very similar. Though i am not yet informed enough to comment on the differences and how it may apply to the options market in contrast to the stock market (which is what Jurkatis focused on).
I am still trying to procure data for both markets to make some experiments but finding ground truth for the aggressor side for these markets is not easy.
Very interested in your thoughts about that. If this is the wrong forum, i am sorry and it can be disregarded.
[1] Jurkatis, S., 2020. Inferring trade directions in fast markets. Journal of Financial Markets, Vol 58, 100635
Beta Was this translation helpful? Give feedback.
All reactions