## Chisquare test.docx

## 1.State the Hypothesis

In [7]:
import pandas as pd
from scipy.stats import chi2_contingency, chi2
data = {
    'Smart Thermostat': [50, 80, 60, 30, 20],
    'Smart Light': [70, 100, 90, 50,50]
}
index = ['Very Satisfied', 'Satisfied', 'Neutral', 'Unsatisfied', 'Very Unsatisfied']
df = pd.DataFrame(data, index=index)

print("Contingency Table:")
print(df)

Contingency Table:
                  Smart Thermostat  Smart Light
Very Satisfied                  50           70
Satisfied                       80          100
Neutral                         60           90
Unsatisfied                     30           50
Very Unsatisfied                20           50


## 2.Chi-square Statistic

In [10]:
chi2_stat, p_val, dof, expected = chi2_contingency(df)

print("\nChi-Square Statistic:", round(chi2_stat, 3))
print("Degrees of Freedom:", dof)
print("P-value:", round(p_val, 4))
print("\nExpected Frequencies:")
print(pd.DataFrame(expected, index=index, columns=df.columns))


Chi-Square Statistic: 5.638
Degrees of Freedom: 4
P-value: 0.2278

Expected Frequencies:
                  Smart Thermostat  Smart Light
Very Satisfied                48.0         72.0
Satisfied                     72.0        108.0
Neutral                       60.0         90.0
Unsatisfied                   32.0         48.0
Very Unsatisfied              28.0         42.0


## 3.Critical value

In [13]:
alpha = 0.05  # Significance level
critical_value = chi2.ppf(1 - alpha, dof)

print("\nCritical Value (α = 0.05):", round(critical_value, 3))


Critical Value (α = 0.05): 9.488


## 4.Making Decision

In [16]:
if chi2_stat > critical_value:
    print("\n🚨 Reject the Null Hypothesis: There is a significant association between device type and satisfaction level.")
else:
    print("\n✅ Fail to Reject the Null Hypothesis: No significant association between device type and satisfaction level.")


✅ Fail to Reject the Null Hypothesis: No significant association between device type and satisfaction level.


## Conclusion

In [None]:
*Conclusion for the Chi-Square Test
Based on the observed and expected frequencies of customer satisfaction levels across two device types (Smart Thermostat and Smart Light), we performed a Chi-Square Test of Independence.

Chi-Square Statistic = 5.638

Degrees of Freedom = 4

Critical Value (at α = 0.05) = 9.488

P-value = 0.2278

🎯 Interpretation:
Since the calculated Chi-Square statistic (5.638) is less than the critical value (9.488) and the p-value (0.2278) is greater than 0.05, we fail to reject the null hypothesis.

📌 Final Conclusion:
There is no statistically significant association between the type of smart home device (Smart Thermostat or Smart Light) and the customer satisfaction level.

In simpler terms, customer satisfaction appears to be independent of the type of device they purchased.












## Hypthesis Test.Docx

## 1.State the Hypothesis

In [21]:
sample_mean = 4050  # x̄
theoretical_mean = 1000 + 5 * 600  # μ = 4000
std_deviation = 5 * 25  # σ = 125
sample_size = 25  # n

print("Step 1: Hypothesis Setup")
print("H₀: μ = 4000 (The model is accurate)")
print("H₁: μ > 4000 (The operating cost is higher than the model predicts)")


Step 1: Hypothesis Setup
H₀: μ = 4000 (The model is accurate)
H₁: μ > 4000 (The operating cost is higher than the model predicts)


## 2.Calculate the Test Statistic

In [24]:
import math
numerator = sample_mean - theoretical_mean
denominator = std_deviation / math.sqrt(sample_size)
z_statistic = numerator / denominator

print("\nStep 2: Test Statistic Calculation")
print(f"Z-statistic = ({sample_mean} - {theoretical_mean}) / ({std_deviation} / √{sample_size})")
print(f"Z-statistic = {z_statistic:.4f}")



Step 2: Test Statistic Calculation
Z-statistic = (4050 - 4000) / (125 / √25)
Z-statistic = 2.0000


## 3.Critical Value

In [27]:
from scipy.stats import norm

# Step 3: Determine the critical value for right-tailed test at α = 0.05
alpha = 0.05
z_critical = norm.ppf(1 - alpha)  # For right-tailed test

print("\nStep 3: Critical Value")
print(f"Critical Z-value at α = {alpha} is: {z_critical:.4f}")



Step 3: Critical Value
Critical Z-value at α = 0.05 is: 1.6449


## 4.Make a Decision

In [30]:
# Step 4: Make a Decision

print("\nStep 4: Decision")
if z_statistic >= z_critical:
    print(f"Since Z = {z_statistic:.4f} ≥ Critical Z = {z_critical:.4f}, we REJECT the null hypothesis.")
else:
    print(f"Since Z = {z_statistic:.4f} < Critical Z = {z_critical:.4f}, we FAIL TO REJECT the null hypothesis.")



Step 4: Decision
Since Z = 2.0000 ≥ Critical Z = 1.6449, we REJECT the null hypothesis.


## 5.Conclusion

In [33]:
# Step 5: Conclusion

print("\nStep 5: Conclusion")
print("Conclusion:")
print("There is strong statistical evidence at the 5% significance level")
print("to support the restaurant owners' claim that the actual weekly operating costs")
print("are higher than what the cost model (W = $1000 + $5X) suggests.")



Step 5: Conclusion
Conclusion:
There is strong statistical evidence at the 5% significance level
to support the restaurant owners' claim that the actual weekly operating costs
are higher than what the cost model (W = $1000 + $5X) suggests.


## Overall Conclusion and analysis

In [None]:
✅ Overall Conclusion (In Simple Words)
You were testing if the actual weekly cost for running a restaurant is higher than what the company’s cost model predicts.

The company’s model says weekly cost should be ₹1000 + ₹5 per unit produced.

For 600 units, the predicted cost is ₹4000.

You collected data from 25 restaurants, and the average real cost was ₹4050.

Using hypothesis testing:

You calculated a Z-statistic = 2.0

The critical Z-value (at 5% significance) is 1.645

🎯 Since 2.0 > 1.645:
You rejected the null hypothesis

This means your analysis supports the restaurant owners’ claim

✅ Final Statement:
There is strong evidence to say that the actual operating costs are higher than what the company's model predicts.
The current model is outdated or inaccurate, and needs to be revised.











