# Hypothesis Testing and Chi-Square Test Assessments

## Part 1: Hypothesis Testing

### Background:
Bombay Hospitality Ltd. operates a franchise model. The weekly operating cost is given by:

\[ W = 1000 + 5X \]

Where `X` is the number of units produced. The observed sample mean is Rs. 3050 for 25 restaurants. We test whether the weekly cost has increased.

- Mean of units (X): 600
- Std Dev of X: 25
- Sample size (n): 25
- Sample mean cost (x̄): 3050

### Step 1: Hypotheses
- **Null Hypothesis (H₀)**: µ = 4000 (no increase in cost)
- **Alternative Hypothesis (H₁)**: µ > 4000 (cost has increased)

### Step 2: Test Statistic Calculation
Using Z-test:

\[ Z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}} \]

In [None]:
import math
from scipy.stats import norm

# Given data
sample_mean = 3050
theoretical_mean = 1000 + 5 * 600  # 4000
sigma = 5 * 25  # 125
n = 25
alpha = 0.05

# Z-test calculation
standard_error = sigma / math.sqrt(n)
z_score = (sample_mean - theoretical_mean) / standard_error

# Critical value for right-tailed test
z_critical = norm.ppf(1 - alpha)

# Output results
print(f"Z-Score: {z_score}")
print(f"Critical Value (Z): {z_critical}")
if z_score > z_critical:
    print("Reject the null hypothesis: Cost is significantly higher.")
else:
    print("Fail to reject the null hypothesis: No significant increase in cost.")

### Step 3: Critical Value
From Z-table for α = 0.05 (one-tailed), Z-critical ≈ 1.645

### Step 4: Decision
Since Z = -38 < 1.645, we **fail to reject** the null hypothesis.

### Step 5: Conclusion
There is no evidence to support that the actual weekly operating cost is higher than the model suggests.

## Part 2: Chi-Square Test for Independence

### Background:
Mizzare Corporation wants to test if there's a significant association between the **type of smart device** and **customer satisfaction level**.

Contingency Table:

| Satisfaction       | Smart Thermostat | Smart Light | Total |
|--------------------|------------------|-------------|--------|
| Very Satisfied     | 50               | 70          | 120    |
| Satisfied          | 80               | 100         | 180    |
| Neutral            | 60               | 90          | 150    |
| Unsatisfied        | 30               | 50          | 80     |
| Very Unsatisfied   | 20               | 50          | 70     |
| Total              | 240              | 360         | 600    |

### Step 1: Hypotheses
- **Null Hypothesis (H₀)**: Satisfaction is independent of device type
- **Alternative Hypothesis (H₁)**: Satisfaction depends on device type

### Step 2: Compute Chi-Square Statistic

In [None]:
import numpy as np
import pandas as pd
from scipy.stats import chi2_contingency, chi2

# Observed Frequency Table
observed = np.array([
    [50, 70],
    [80, 100],
    [60, 90],
    [30, 50],
    [20, 50]
])

# Chi-square test
chi2_stat, p_value, dof, expected = chi2_contingency(observed)

# Critical value
alpha = 0.05
critical_value = chi2.ppf(1 - alpha, df=dof)

# Display results
print(f"Chi-Square Statistic: {chi2_stat:.4f}")
print(f"Degrees of Freedom: {dof}")
print(f"Critical Value at alpha=0.05: {critical_value:.4f}")
print(f"P-Value: {p_value:.4f}")
print("
Expected Frequencies:")
print(pd.DataFrame(expected, columns=["Smart Thermostat", "Smart Light"]))

if chi2_stat > critical_value:
    print("\nConclusion: Reject the null hypothesis. There is a significant association between device type and customer satisfaction.")
else:
    print("\nConclusion: Fail to reject the null hypothesis. There is no significant association between device type and customer satisfaction.")

### Step 3: Critical Value
Degrees of freedom = (5-1)(2-1) = 4. Critical value from χ² table at α = 0.05 is **9.488**

### Step 4: Conclusion
If χ² statistic < 9.488, we **fail to reject** the null hypothesis. Hence, there is no significant association.