-
Notifications
You must be signed in to change notification settings - Fork 778
Description
Hello,
I saw that in the Case Study - Customer Segmentation at An Online Media Company, you take the natural logarithm of the price.
In my case, I also use CausalForestDML and prices can vary from below 1 to way more. Applying the logarithm function distorts the value distribution of prices, especially for prices below 1 and creates negative treatment values. For the moment, I transform the data by adding a constant of 1 to prices (before applying logarithm) to prevent this as illustrated below.
Product | Price | log(Price) | log(Price+1) |
---|---|---|---|
A | 0.1 | -2.302 | 0.095 |
A | 0.5 | -0.693 | 0.405 |
A | 10 | 2.303 | 2.398 |
I also perform the same modification on the demand values, as the demand can be 0 for some periods.
This felt the most natural way in the situation. What do you think about this modification? Will it have positive, neutral or negative impact on the model estimation?
I should also mentionned that I have a multi treatment use case where the other treatments are the base treatment raised to different powers. Therefore any change on the base treatment will have an impact on the related polynomial treatments.