Set uses_full_dataframe=True for rolling and exponential primitives#2354
Conversation
|
Should we do this for Exponential primitives too, as they are window primitives as well? |
@sbadithe Yes. Any time we need to access values from a previous row for feature calculation, we need to set this to True. This makes sure all the data is available when calculating features, and is particularly important if you run DFS with |
Codecov Report
@@ Coverage Diff @@
## main #2354 +/- ##
=======================================
Coverage 99.48% 99.48%
=======================================
Files 309 309
Lines 19771 19781 +10
=======================================
+ Hits 19670 19680 +10
Misses 101 101
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
Thanks for the explanation! That sounds good. I think it would make sense to add the Exponential primitives to this PR. Alternatively, we can update them as part of the addition of the Expanding primitives, or in a separate PR altogether. |
|
@thehomebrewnerd I wonder if we should add an attribute We could default this to |
@gsheni No, this doesn't accomplish the same thing. This basically says that we need all the data during feature value calculation, but doesn't necessarily mean the order is important. Take We might need to consider adding an attribute to indicate that ordering is important separately. |
Set uses_full_dataframe=True for rolling primitives
Sets
uses_full_dataframe = Truefor theRolling*andExponential*primitives. This is necessary to ensure proper feature value calculation ifn_jobsis greater than 1.