# My question was whether people who had family members aboard were more or less likely to survive.
# I think an observational study is more fit to this, specifically a retrospective study.
# I will use a census, as we will need to collect data from every passenger in order to answer this question.
# We can look at each passenger, and perform a simple logical check to make two groups depending on if each passenger had family aboard.
# This can be done using the SibSp (siblings/spouses aboard) and Parch (parents/children aboard) columns.
# Then, we can look at the Survived column to see which group was more likely to survive.

In [19]:
import pandas as pd

titanic_df = pd.read_csv("train.csv")

passengers_w_family = {}
passengers_wo_family = {}

for p_ind in range(len(titanic_df)):
    if titanic_df['SibSp'].iloc[p_ind] > 0 or titanic_df['Parch'].iloc[p_ind] > 0:
        passengers_w_family[p_ind] = titanic_df['Survived'].iloc[p_ind]
    else:
        passengers_wo_family[p_ind] = titanic_df['Survived'].iloc[p_ind]

survival_rate_w_family = 0
counter = 0
for p_ind, survived in passengers_w_family.items():
    survival_rate_w_family += survived
    counter += 1
survival_rate_w_family /= counter

survival_rate_wo_family = 0
counter = 0
for p_ind, survived in passengers_wo_family.items():
    survival_rate_wo_family += survived
    counter += 1
survival_rate_wo_family /= counter

print("Survival rate with family members aboard:", survival_rate_w_family)
print("Survival rate without family members aboard:", survival_rate_wo_family)

Survival rate with family members aboard: 0.5056497175141242
Survival rate without family members aboard: 0.30353817504655495


# Conclusion: Passengers with family members aboard were more likely to survive.
# Specifically, passengers with family members aboard had an ~50% chance to survive.
# Meanwhile, passengers without family members aboard had an ~30% chance to survive.
# Our results may be biased as we used the Kaggle dataset, which contains more data about rich and well-known passengers.
# It also may not contain data about stowaways or crew members.
# It is unknown if rich or poor passengers were more likely to bring their families with them aboard the Titanic.
# Nonetheless, this introduces a notable bias that may make our conclusions less reliable.