## ***CHI-SQUARE TEST***
**Association between Device Type and Customer Satisfaction**

**Background:**

Mizzare Corporation has collected data on customer satisfaction levels for two types of smart home devices: Smart Thermostats and Smart Lights. They want to determine if there's a significant association between the type of device purchased and the customer's satisfaction level.

**Data Provided:**

The data is summarized in a contingency table showing the counts of customers in each satisfaction level for both types of devices:

Satisfaction	Smart Thermostat	Smart Light	Total
Very Satisfied	50	70	120
Satisfied	80	100	180
Neutral	60	90	150
Unsatisfied	30	50	80
Very Unsatisfied	20	50	70
Total	240	360	600

**Objective:**

To use the Chi-Square test for independence to determine if there's a significant association between the type of smart home device purchased (Smart Thermostats vs. Smart Lights) and the customer satisfaction level.

**Assignment Tasks:**

**1. State the Hypotheses:**

**2. Compute the Chi-Square Statistic:**

**3. Determine the Critical Value:**

Using the significance level (alpha) of 0.05 and the degrees of freedom (which is the number of categories minus 1)

**4. Make a Decision:**

Compare the Chi-Square statistic with the critical value to decide whether to reject the null hypothesis.
Submission Guidelines:

•	Provide a detailed report of your analysis, including each step outlined in the assignment tasks in a python file.

•	Include all calculations, the Chi-Square statistic, the critical value, and your conclusion.


In [1]:
import numpy as np
from scipy.stats import chi2

# Contingency table
observed = np.array([[50, 70], [80, 100], [60, 90], [30, 50], [20, 50]])

# Calculate expected frequencies
total_rows = observed.sum(axis=1)
total_cols = observed.sum(axis=0)
total = observed.sum()
expected = np.outer(total_rows, total_cols) / total

# Calculate Chi-Square statistic
chi2_stat = np.sum((observed - expected) ** 2 / expected)

# Determine degrees of freedom
df = (observed.shape[0] - 1) * (observed.shape[1] - 1)

# Determine critical value
alpha = 0.05
critical_value = chi2.ppf(1 - alpha, df)

# Make a decision
if chi2_stat > critical_value:
    print("Reject the null hypothesis. There is a significant association between device type and customer satisfaction.")
else:
    print("Fail to reject the null hypothesis. There is no significant association between device type and customer satisfaction.")

print(f"Chi-Square statistic: {chi2_stat}")
print(f"Critical value: {critical_value}")
print(f"Degrees of freedom: {df}")



Fail to reject the null hypothesis. There is no significant association between device type and customer satisfaction.
Chi-Square statistic: 5.638227513227513
Critical value: 9.487729036781154
Degrees of freedom: 4


**Step 1: State the Hypotheses**

The null hypothesis (H0) is that there is no significant association between the type of smart home device purchased and the customer satisfaction level. The alternative hypothesis (H1) is that there is a significant association between the type of smart home device purchased and the customer satisfaction level.

**Step 2: Compute the Chi-Square Statistic**

We calculate the expected frequencies using the marginal totals. Then, we compute the Chi-Square statistic using the formula: Σ [(observed - expected)^2 / expected].

**Step 3: Determine the Critical Value**

We determine the degrees of freedom (df) as (number of rows - 1) * (number of columns - 1). Then, we use the Chi-Square distribution to find the critical value for the given significance level (alpha) and degrees of freedom.

**Step 4: Make a Decision**

We compare the calculated Chi-Square statistic with the critical value. If the Chi-Square statistic is greater than the critical value, we reject the null hypothesis and conclude that there is a significant association between the type of smart home device purchased and the customer satisfaction level. Otherwise, we fail to reject the null hypothesis and conclude that there is no significant association.