To test the hypothesis that there is no significant difference in the average happiness scores of the United States during Biden's presidency compared to Obama's presidency, we can perform a statistical hypothesis test using the available data.

In [2]:
import pandas as pd
from scipy.stats import ttest_ind

# Load the data
data = pd.read_csv("../../Dataset/csv_format/combined.csv")

# Extract data for the United States
us_data = data[data["Country name"] == "United States"]

# Define the periods for Obama's and Biden's presidencies
obama_period = (us_data["year"] >= 2009) & (us_data["year"] <= 2017)
biden_period = us_data["year"] >= 2021

# Extract happiness scores for the respective periods
obama_scores = us_data.loc[obama_period, "Life Ladder"]
biden_scores = us_data.loc[biden_period, "Life Ladder"]

# Perform the two-sample t-test
t_statistic, p_value = ttest_ind(obama_scores, biden_scores, equal_var=False)

# Print the results
print(f"Mean happiness score during Obama's presidency: {obama_scores.mean()}")
print(f"Mean happiness score during Biden's presidency: {biden_scores.mean()}")
print(f"t-statistic: {t_statistic}")
print(f"p-value: {p_value}")

# Interpret the results
alpha = 0.05  # Significance level
if p_value < alpha:
    print("The happiness scores in the United States are significantly different during Obama's and Biden's presidencies.")
else:
    print("The happiness scores in the United States are not significantly different during Obama's and Biden's presidencies.")

Mean happiness score during Obama's presidency: 7.058111111111111
Mean happiness score during Biden's presidency: 6.826
t-statistic: 1.6351784592031084
p-value: 0.30530269002168847
The happiness scores in the United States are not significantly different during Obama's and Biden's presidencies.
