# 🧪 **Two-Sample Statistical Tests for Customer Analysis**

## **🎯 Notebook Purpose**

This notebook implements comprehensive two-sample statistical testing for customer segmentation data, focusing on comparing numerical customer characteristics across categorical groups (e.g., gender-based spending differences). Two-sample tests are essential for validating business hypotheses, identifying significant group differences, and making data-driven decisions about customer targeting and segmentation strategies.

---

## **🔍 Comprehensive Analysis Coverage**

### **1. Parametric Two-Sample Tests**
- **Independent Samples T-Test**
  - **Importance:** Tests whether mean customer characteristics differ significantly between two groups (e.g., male vs female spending)
  - **Interpretation:** p < 0.05 indicates significant difference; t-statistic shows effect direction; assumes normality and equal variances
- **Welch's T-Test (Unequal Variances)**
  - **Importance:** Robust alternative when groups have unequal variances in customer characteristics
  - **Interpretation:** More reliable than standard t-test when variance assumption violated; degrees of freedom adjusted for unequal variances
- **Paired Samples T-Test**
  - **Importance:** Compares related measurements (e.g., before/after intervention spending) within same customers
  - **Interpretation:** Tests mean difference of paired observations; controls for individual customer differences; higher statistical power

### **2. Non-Parametric Two-Sample Tests**
- **Mann-Whitney U Test (Wilcoxon Rank-Sum)**
  - **Importance:** Distribution-free test for comparing customer group medians when normality assumptions violated
  - **Interpretation:** Tests whether one group tends to have higher values; robust to outliers and non-normal distributions
- **Wilcoxon Signed-Rank Test**
  - **Importance:** Non-parametric alternative to paired t-test for related customer measurements
  - **Interpretation:** Tests median difference in paired observations; robust to non-normality; uses ranks instead of raw values
- **Kolmogorov-Smirnov Two-Sample Test**
  - **Importance:** Tests whether two customer groups have identical distributions (not just central tendency)
  - **Interpretation:** Sensitive to any distributional differences; D-statistic shows maximum difference between cumulative distributions

### **3. Robust Two-Sample Tests**
- **Trimmed Means Comparison**
  - **Importance:** Compares group means after removing extreme values to reduce outlier influence
  - **Interpretation:** More stable comparison when customer data contains outliers; shows differences for typical customers
- **Bootstrap Two-Sample Tests**
  - **Importance:** Uses resampling to create empirical sampling distributions for group comparisons
  - **Interpretation:** Distribution-free approach; provides confidence intervals; robust to distributional assumptions
- **Permutation Tests**
  - **Importance:** Tests group differences by randomly reassigning customers to groups many times
  - **Interpretation:** Exact p-values under null hypothesis; no distributional assumptions; computationally intensive but precise

### **4. Effect Size Calculations**
- **Cohen's d for Independent Groups**
  - **Importance:** Quantifies practical significance of customer group differences in standardized units
  - **Interpretation:** d = 0.2 (small), 0.5 (medium), 0.8 (large effect); shows business importance beyond statistical significance
- **Glass's Delta**
  - **Importance:** Effect size measure using control group standard deviation as denominator
  - **Interpretation:** Useful when groups have different variances; shows effect relative to baseline group variability
- **Hedges' g (Bias-Corrected Effect Size)**
  - **Importance:** Provides unbiased effect size estimate, especially important for small customer samples
  - **Interpretation:** Similar to Cohen's d but corrected for small sample bias; more accurate for business decisions

### **5. Power Analysis and Sample Size Planning**
- **Statistical Power Calculation**
  - **Importance:** Determines probability of detecting true customer group differences of specified magnitude
  - **Interpretation:** Power ≥ 0.80 recommended; low power may miss important business differences; guides sample size needs
- **Sample Size Determination**
  - **Importance:** Calculates required customer sample size to detect meaningful business differences
  - **Interpretation:** Balances statistical requirements with data collection costs; ensures adequate power for decision-making
- **Post-Hoc Power Analysis**
  - **Importance:** Evaluates power of completed studies to interpret non-significant results
  - **Interpretation:** Low power suggests study may have missed real differences; high power confirms true null results

### **6. Assumption Testing and Validation**
- **Normality Testing for Each Group**
  - **Importance:** Validates distributional assumptions required for parametric tests
  - **Interpretation:** Shapiro-Wilk, Anderson-Darling tests assess normality; violations suggest non-parametric alternatives
- **Homogeneity of Variance Testing**
  - **Importance:** Tests equal variance assumption critical for standard t-tests
  - **Interpretation:** Levene's test, F-test assess variance equality; violations require Welch's t-test or transformations
- **Independence Assumption Validation**
  - **Importance:** Ensures customer observations are independent within and between groups
  - **Interpretation:** Critical for valid inference; violations require specialized methods (e.g., mixed models)

### **7. Multiple Comparisons and Corrections**
- **Bonferroni Correction**
  - **Importance:** Controls family-wise error rate when testing multiple customer group comparisons
  - **Interpretation:** Divides α by number of tests; conservative but protects against false discoveries
- **False Discovery Rate (FDR) Control**
  - **Importance:** Less conservative approach to multiple testing correction for exploratory customer analysis
  - **Interpretation:** Controls expected proportion of false discoveries; better power than Bonferroni for many comparisons
- **Holm-Bonferroni Sequential Method**
  - **Importance:** Step-down procedure that's less conservative than standard Bonferroni
  - **Interpretation:** Tests hypotheses sequentially; stops at first non-significant result; balances power and error control

### **8. Bayesian Two-Sample Analysis**
- **Bayesian T-Test**
  - **Importance:** Provides probability statements about customer group differences using prior information
  - **Interpretation:** Bayes factors quantify evidence for/against differences; credible intervals show parameter uncertainty
- **Bayesian Estimation with Uncertainty**
  - **Importance:** Estimates group differences with full uncertainty quantification
  - **Interpretation:** Posterior distributions show all plausible parameter values; incorporates prior business knowledge
- **Model Comparison and Selection**
  - **Importance:** Compares different models for customer group differences using Bayesian criteria
  - **Interpretation:** Model probabilities guide selection; accounts for model uncertainty in conclusions

### **9. Equivalence and Non-Inferiority Testing**
- **Two One-Sided Tests (TOST)**
  - **Importance:** Tests whether customer groups are practically equivalent within specified bounds
  - **Interpretation:** Demonstrates similarity rather than just absence of difference; important for business equivalence claims
- **Confidence Interval Approach to Equivalence**
  - **Importance:** Uses confidence intervals to assess practical equivalence of customer groups
  - **Interpretation:** If CI falls within equivalence bounds, groups are practically equivalent; visual and intuitive
- **Non-Inferiority Testing**
  - **Importance:** Tests whether one customer group is not meaningfully worse than another
  - **Interpretation:** Important for cost-benefit decisions; shows acceptable performance within tolerance limits

### **10. Robust and Resistant Methods**
- **Median-Based Tests**
  - **Importance:** Focuses on central tendency measures less affected by customer outliers
  - **Interpretation:** Mood's median test, median confidence intervals; robust to extreme values and skewness
- **Quantile-Based Comparisons**
  - **Importance:** Compares customer groups across different parts of the distribution
  - **Interpretation:** Reveals differences in distribution shape, not just central tendency; comprehensive group comparison
- **Winsorized and Trimmed Comparisons**
  - **Importance:** Reduces outlier influence while retaining more data than median-only approaches
  - **Interpretation:** Balances robustness with information retention; shows differences for typical customers

### **11. Specialized Customer Applications**
- **Gender-Based Spending Analysis**
  - **Importance:** Tests whether male and female customers have significantly different spending patterns
  - **Interpretation:** Guides gender-targeted marketing strategies; identifies product preferences by gender
- **Age Group Comparisons**
  - **Importance:** Compares customer characteristics across different age categories
  - **Interpretation:** Reveals life-stage effects on customer behavior; guides age-targeted product development
- **Geographic Region Analysis**
  - **Importance:** Tests customer characteristic differences across geographic markets
  - **Interpretation:** Identifies regional preferences and behaviors; guides market-specific strategies

### **12. Time Series and Longitudinal Comparisons**
- **Before-After Intervention Analysis**
  - **Importance:** Tests effectiveness of marketing campaigns or policy changes on customer behavior
  - **Interpretation:** Paired tests control for customer-specific factors; shows intervention effectiveness
- **Seasonal Group Comparisons**
  - **Importance:** Compares customer behavior between different seasons or time periods
  - **Interpretation:** Identifies temporal patterns in customer segments; guides seasonal marketing strategies
- **Cohort Analysis Comparisons**
  - **Importance:** Compares customer characteristics across different acquisition cohorts
  - **Interpretation:** Shows evolution of customer base; identifies changing customer profiles over time

### **13. Advanced Statistical Methods**
- **Randomization and Permutation Tests**
  - **Importance:** Exact tests that don't rely on distributional assumptions
  - **Interpretation:** Provides exact p-values; computationally intensive but precise; good for small samples
- **Bootstrap Confidence Intervals**
  - **Importance:** Non-parametric confidence intervals for group differences
  - **Interpretation:** Robust to distributional assumptions; provides uncertainty quantification without normality
- **Jackknife Methods for Group Comparisons**
  - **Importance:** Assesses stability of group difference estimates
  - **Interpretation:** Shows sensitivity to individual observations; identifies influential customers

### **14. Business Applications and Decision Support**
- **Customer Segmentation Validation**
  - **Importance:** Tests whether proposed customer segments have significantly different characteristics
  - **Interpretation:** Validates segmentation schemes; ensures segments are meaningfully different for business purposes
- **A/B Testing for Customer Interventions**
  - **Importance:** Compares customer responses between control and treatment groups
  - **Interpretation:** Guides marketing decisions; shows intervention effectiveness with proper statistical rigor
- **Market Research and Survey Analysis**
  - **Importance:** Analyzes customer survey responses across different demographic groups
  - **Interpretation:** Identifies customer preferences and attitudes; guides product development and positioning
- **Quality Control and Customer Satisfaction**
  - **Importance:** Compares customer satisfaction metrics across different service channels or time periods
  - **Interpretation:** Identifies service quality issues; guides operational improvements and resource allocation

---

## **📊 Expected Outcomes**

- **Hypothesis Validation:** Rigorous testing of business hypotheses about customer group differences
- **Effect Quantification:** Clear understanding of practical significance through effect size measures
- **Decision Support:** Statistical evidence to guide customer targeting and segmentation strategies
- **Risk Management:** Proper error control through multiple comparison corrections and power analysis
- **Business Intelligence:** Translation of statistical results into actionable business insights
- **Methodological Rigor:** Appropriate test selection based on data characteristics and business context

This comprehensive two-sample testing framework provides essential statistical tools for comparing customer groups, enabling evidence-based decision-making, hypothesis validation, and strategic customer analysis through rigorous statistical methodology and business-focused interpretation.
