Decomposition vs. Prediction #85

wpro-ds · 2021-05-26T18:28:12Z

Hi Robyn team,

Thanks for the great package ! I have been experimenting with it and seeing positive results. My question is somewhat related to #79 . For that issue resolution, you mentioned that Robyn is supposed to be used as a decomposition tool rather than a prediction tool. I think it would be useful to have some predictive functionality in the model. My questions are the following:

How do we ensure that the model is reliable (i.e. validate the model) and that we can trust its recommendations ? In classic ML approaches, we answer this question based on prediction error on hold out data. In the absence of this predictive functionality in Robyn, what approaches do you recommend? P.S. - This is a critical issue to get buy-in when requesting increased budgets :)
In Make predictions #79 , you mentioned that it is controversial how to best provide future dataframe for intercept/trend/season/other baselines. Could you shed some light on that ? What are the issues ?

Again, thanks for this wonderful package and looking forward to future releases.

gufengzhou · 2021-05-27T15:07:10Z

Hi, thanks for trying out Robyn!

For you information, we've removed the time series out-of-sample validation about a month ago. One important reason is that we want to build a new feature to enable MMM users to refresh the initial model using new data, a direct conflict to our previous OOS validation approach. As you know Robyn uses ridge regression, an approach that prevents overfitting by design. To be precise, we do have a 100-fold lambda cross validation for ridge regression. This is the major reason we're confident to go without time series OOS validation.
For example, if you use Prophet for forecasting, you'll need to provide the future dataframe. While for some predictors (trend/season/weekday etc.) you can use the default predicted values from Prophet, for other predictors you need to make some strong assumptions. For example, if you have competitors as predictor, you'll have to somehow predict the future competition itself first. Weather as predictor is another example, which you'll need to forecast and is a topic for itself. Another example would be Covid, if you have it in the model: we all know it's not easy to predict Covid. That's why.

To summarise, Robyn's recommendation is based on your model choice in the end. If you ask how can you know if you've selected the "right" decay and saturation for your media, well the only way to know that is actually experimental calibration. In the spirit of "All models are wrong, some are useful", we believe only experiments can give you certainty. A model that is closer to experiment is therefore "more correct". Hope it makes sense.

wpro-ds · 2021-06-02T16:56:17Z

Thanks for the responses ! Appreciate it and look forward to the new features.

wpro-ds closed this as completed Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decomposition vs. Prediction #85

Decomposition vs. Prediction #85

wpro-ds commented May 26, 2021 •

edited

gufengzhou commented May 27, 2021

wpro-ds commented Jun 2, 2021

Decomposition vs. Prediction #85

Decomposition vs. Prediction #85

Comments

wpro-ds commented May 26, 2021 • edited

gufengzhou commented May 27, 2021

wpro-ds commented Jun 2, 2021

wpro-ds commented May 26, 2021 •

edited