# Chi Squared Test

### Null Hypothesis

There is no association between a country's GDP levels and its refugee count levels.
(GDP levels and refugee count levels are independent variables.)

### Variables in the Test

Variable 1 (GDP Levels): Categorized into Low, Medium, and High.

Variable 2 (Refugee Count Levels): Categorized into Low, Medium, and High.

In [9]:
import pandas as pd
import numpy as np
from scipy.stats import chi2_contingency

significance level alpha=0.05

In [10]:
# Load Dataset
combined_df = pd.read_csv('combined.csv')

# Step 1: Categorize GDP and Refugee Counts into Low, Medium, and High
combined_df['gdp_category'] = pd.qcut(combined_df['gdp'], 3, labels=['Low', 'Medium', 'High'])
combined_df['refugee_category'] = pd.qcut(combined_df['total_refugees'], 3, labels=['Low', 'Medium', 'High'])

# Step 2: Create a Contingency Table
contingency_table = pd.crosstab(combined_df['gdp_category'], combined_df['refugee_category'])

# Step 3: Perform Chi-Squared Test
chi2, p, dof, expected = chi2_contingency(contingency_table)

# Step 4: Display Results
print("Contingency Table:")
print(contingency_table)

print("\nChi-Squared Statistic:", chi2)
print("P-Value:", p)
print("Degrees of Freedom:", dof)

if p < 0.05:
    print("\nResult: There is a statistically significant relationship between GDP levels and refugee counts.")
else:
    print("\nResult: There is no statistically significant relationship between GDP levels and refugee counts.")

Contingency Table:
refugee_category  Low  Medium  High
gdp_category                       
Low                39      20    18
Medium             33      29    14
High                5      31    41

Chi-Squared Statistic: 45.57048184746975
P-Value: 3.025541767903403e-09
Degrees of Freedom: 4

Result: There is a statistically significant relationship between GDP levels and refugee counts.


The p-value is less than 0.05, indicating that there is a statistically significant relationship between GDP levels and refugee count levels.

This means GDP levels are not independent of refugee influx patterns.