In [None]:
'''
Chi Square Test
Association between Device Type and Customer Satisfaction
Background:
Mizzare Corporation has collected data on customer satisfaction levels for two types of smart home devices: Smart Thermostats and Smart Lights. They want to determine if there's a significant association between the type of device purchased and the customer's satisfaction level.
Data Provided:
The data is summarized in a contingency table showing the counts of customers in each satisfaction level for both types of devices:
Satisfaction	Smart Thermostat	Smart Light	Total
Very Satisfied	50	70	120
Satisfied	80	100	180
Neutral	60	90	150
Unsatisfied	30	50	80
Very Unsatisfied	20	50	70
Total	240	360	600
Objective:
To use the Chi-Square test for independence to determine if there's a significant association between the type of smart home device purchased (Smart Thermostats vs. Smart Lights) and the customer satisfaction level.
Assignment Tasks:
1. State the Hypotheses:
2. Compute the Chi-Square Statistic:
3. Determine the Critical Value:
Using the significance level (alpha) of 0.05 and the degrees of freedom (which is the number of categories minus 1)
4. Make a Decision:
Compare the Chi-Square statistic with the critical value to decide whether to reject the null hypothesis.
Submission Guidelines:
•	Provide a detailed report of your analysis, including each step outlined in the assignment tasks in a python file.
•	Include all calculations, the Chi-Square statistic, the critical value, and your conclusion.
'''

In [1]:
import pandas as pd
import scipy.stats as stats

# Step 1: State the Hypotheses
print("1. Hypotheses:")
print("H0 (Null Hypothesis): There is no association between device type and customer satisfaction.")
print("H1 (Alternative Hypothesis): There is an association between device type and customer satisfaction.\n")

# Step 2: Construct the Contingency Table
data = [[50, 70],     # Very Satisfied
        [80, 100],    # Satisfied
        [60, 90],     # Neutral
        [30, 50],     # Unsatisfied
        [20, 50]]     # Very Unsatisfied

rows = ["Very Satisfied", "Satisfied", "Neutral", "Unsatisfied", "Very Unsatisfied"]
cols = ["Smart Thermostat", "Smart Light"]

df = pd.DataFrame(data, index=rows, columns=cols)

# Step 3: Compute the Chi-Square Statistic
chi2_stat, p_value, dof, expected = stats.chi2_contingency(df)

print("2. Chi-Square Statistic Calculation:")
print(f"Chi-Square Statistic: {chi2_stat:.2f}")
print(f"Degrees of Freedom: {dof}")
print("\nExpected Frequencies Table:")
print(pd.DataFrame(expected, index=rows, columns=cols).round(2))
print(f"\nP-Value: {p_value:.4f}\n")

# Step 4: Determine the Critical Value
alpha = 0.05
critical_value = stats.chi2.ppf(1 - alpha, df=dof)
print("3. Critical Value:")
print(f"Chi-Square Critical Value at α = 0.05 with {dof} degrees of freedom: {critical_value:.2f}\n")

# Step 5: Make a Decision
print("4. Decision:")
if chi2_stat > critical_value:
    print("Reject the null hypothesis – there is a significant association between device type and customer satisfaction.\n")
else:
    print("Fail to reject the null hypothesis – no significant association found.\n")

# Step 6: Conclusion
print("5. Conclusion:")
if chi2_stat > critical_value:
    print("Conclusion: There is strong statistical evidence that the type of device purchased is associated with customer satisfaction level.")
else:
    print("Conclusion: There is insufficient statistical evidence to suggest an association between the type of device and customer satisfaction level.")


1. Hypotheses:
H0 (Null Hypothesis): There is no association between device type and customer satisfaction.
H1 (Alternative Hypothesis): There is an association between device type and customer satisfaction.

2. Chi-Square Statistic Calculation:
Chi-Square Statistic: 5.64
Degrees of Freedom: 4

Expected Frequencies Table:
                  Smart Thermostat  Smart Light
Very Satisfied                48.0         72.0
Satisfied                     72.0        108.0
Neutral                       60.0         90.0
Unsatisfied                   32.0         48.0
Very Unsatisfied              28.0         42.0

P-Value: 0.2278

3. Critical Value:
Chi-Square Critical Value at α = 0.05 with 4 degrees of freedom: 9.49

4. Decision:
Fail to reject the null hypothesis – no significant association found.

5. Conclusion:
Conclusion: There is insufficient statistical evidence to suggest an association between the type of device and customer satisfaction level.
