So a t-test is needed to rest this debate, which is One of the most common tests in statistics is the t-test, used to determine whether the means of two groups are equal to each other. The assumption for the test is that both groups are sampled from normal distributions with equal variances.
T-test: One of the most common tests in statistics, used to determine whether the means of two groups are equal to each other. The assumption for the test is that both groups are sampled from normal distributions with equal variances.
Here I plot the Density distribution of four items spending (not technically normally distributed, but as instructed in the documentation) This might be the most vulnerable part of this analysis. And I hope professor could leave some comments on this later on
From this preview of data, -- this does not tell many things. -- To Dig deeper, need to segment the customer and examine the corresponding coupon effects on each segment
We now have a better understanding of client distribution
Based on the above 18 segmentations and the 4 types of spending, we got 72 (433*2 = 72) subgroups of customer spending and got 72 pair of t-test results. Then I get a new dataset like this.
the data contains not only the characteristics (like age, joindate, etc.) of this customer segment, but also the t-test results (like p-value and the mean of t-test). Based on these data, we implemented the following analysis to check the coupon effects on the spending of these customers (significant or insignificant, positive or negative)
Set the significance level to be 1%, we can get 49 remaining t-test that represent significant coupon effects on customer spending
Before we dive into the final revenue, we can analyze at a single item spending level.In this way, we can test ‘spending level’ Cross check the distribution on different age/spending level
Its time to go deeper to see the coupon effects on the commission, with the purpose to maximize revenue
The purchase of sculpture from young-early joined –medium users are the most responsive and stimulated one. In terms of commission, we can get 15USD more from a single purchase with coupon
Clear to see sculpture and mosaics benefit most from this coupon. Almost every subgroups buy more, except for middle-aged group. Its also worth mentioning that the behavior changes dramatically as yon grow older. See, the pattern here btw them are totally different.
Similar reasoning process for behavior change among different age groups
To see from the level of aggregated commission, I plotted this heatmap
It’s clear that younger customers(the blue tiles) xare the only subgroup that are positively boosted by this coupon, elders are kind of neutral and slightly negative, and middle-age customers are more easily to take advantage of coupon promotion. Which means, this segment may fail our coupon promotion plan.
With regard to different join_date level, there is no much difference between the upper row and lower row. The same thing happens on the visit_frequency level. Anyways, these two factors are significant to affect the natural spending, but there is no much internal difference.