clarification about model input parameters #70

RiccardoBarb · 2022-09-19T08:05:52Z

Hello, I have a couple of really basic questions concerning the input variables of the model.

Is it correct that media_data contains impressions or clicks for each channel but NOT the corresponding costs?
Concerning the costs variable, is it to be considered a single vector tracking the global costs (the sum of each channel spend) or rather a matrix with a column for each channel spend?
Are the costs to be intended as absolute values, such as the spend for a specific channel o a given day/week, or rather the cost per thousand impressions / cost per click / marketing cost per unit?

Cheers

pabloduque0 · 2022-09-19T08:24:52Z

Hello @RiccardoBarb !

Media data can contain impressions, clicks or spend (cost). But if is indeed impressions or clicks it wont contain cost as you said.
A single vector indicating the total cost of the channel in the data.
When we talk about costs we refer total costs. For optimisation and other parts we refer to price as the price per unit of an impression/click.

Hope it helps! Let me know if anything needs further clarification!

RiccardoBarb · 2022-09-19T12:21:40Z

hi @pabloduque0 thanks for your response!

What would be the recommended variable to use for media_data? would it be better to use spend or clicks/impressions?.
Also if I decide to use spend as media_data, then wouldn't cost be redundant information? would it still be needed to fit the model?
As far as the second answer is concerned, what if I have multiple media channels? Should I insert a single vector for each channel (which, in this case, is in fact equivalent to a matrix)?

Thanks

pabloduque0 · 2022-09-19T13:19:06Z

Between clicks and impressions, impressions are preferred.

In reality the costs are used for the media prior (if you are referring to mmm.fit(..., media_prior=costs) therefore costs are still relevant as they serve as the prior. If you want to use a different prior then you mostly wont need costs. The only place where costs are "mandatory" is for ROI calculation (for obvious reasons).

Apologies if that was not clear before, what I mean is a single vector indicating the total cost each channel in the data, with ONE value per channel, so the total costs. So always one vector with one value per media channel. Eg. for three channels your costs could be jnp.array([1, 2, 3]).

RiccardoBarb · 2022-09-19T14:40:57Z

Thanks a lot @pabloduque0 it's much clearer now :)

Cheers

pabloduque0 self-assigned this Sep 19, 2022

pabloduque0 added the question Further information is requested label Sep 19, 2022

pabloduque0 closed this as completed Sep 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clarification about model input parameters #70

clarification about model input parameters #70

RiccardoBarb commented Sep 19, 2022

pabloduque0 commented Sep 19, 2022

RiccardoBarb commented Sep 19, 2022

pabloduque0 commented Sep 19, 2022

RiccardoBarb commented Sep 19, 2022

clarification about model input parameters #70

clarification about model input parameters #70

Comments

RiccardoBarb commented Sep 19, 2022

pabloduque0 commented Sep 19, 2022

RiccardoBarb commented Sep 19, 2022

pabloduque0 commented Sep 19, 2022

RiccardoBarb commented Sep 19, 2022