# T-tests for survey 1 and survey 2

In this notebook, the two-sample t-test (also called the independent samples t-test) is performed to compare responses to the similar questions in survey 1 and survey 2. To do this, the t-statistic and p-values of questions are calculated using SciPy library to see whether the null hypothesis can be rejected or not.


**Null Hypothesis (H0)**: There is no significant difference between the responses of the two surveys.

**Alternative Hypothesis (H1)**: There is a statistically significant difference between the responses of the two surveys.



Code implemented from SciPy documentation,

URL: https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.ttest_ind.html

## Q1 in survey 1 and survey 2

In [15]:
from scipy import stats

# 1 = correct answer and 0 = incorrect answer in this survey data
survey1_data = [1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] # 21 responses
survey2_data = [1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] #22 responses

# Perform two-sample t-test
t_statistic, p_value = stats.ttest_ind(survey1_data, survey2_data)

# Print the t-statistic and p-value
print(f"T-statistic: {t_statistic}, P-value: {p_value}")

if p_value < 0.05:
    print(f"P-value is less than 0.05, so we reject the null hypothesis and thus there is a statistically significant different between the responses of the two surveys")
else:
    print(f"P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys")

T-statistic: 0.7442032693390465, P-value: 0.46099772264804917
P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys


## Q2 in survey 1 and survey 2

In [16]:
from scipy import stats

# 1 = correct answer and 0 = incorrect answer in this survey data
survey1_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0] # 21 responses
survey2_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0] #22 responses

# Perform two-sample t-test
t_statistic, p_value = stats.ttest_ind(survey1_data, survey2_data)

# Print the t-statistic and p-value
print(f"T-statistic: {t_statistic}, P-value: {p_value}")

if p_value < 0.05:
    print(f"P-value is less than 0.05, so we reject the null hypothesis and thus there is a statistically significant different between the responses of the two surveys")
else:
    print(f"P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys")

T-statistic: 0.2898034202909603, P-value: 0.7734272413616794
P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys


## Q3 in survey 1 and survey 2

In [17]:
from scipy import stats

# 1 = correct answer and 0 = incorrect answer in this survey data
survey1_data = [1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] # 21 responses
survey2_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] #22 responses

# Perform two-sample t-test
t_statistic, p_value = stats.ttest_ind(survey1_data, survey2_data)

# Print the t-statistic and p-value
print(f"T-statistic: {t_statistic}, P-value: {p_value}")

if p_value < 0.05:
    print(f"P-value is less than 0.05, so we reject the null hypothesis and thus there is a statistically significant different between the responses of the two surveys")
else:
    print(f"P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys")

T-statistic: -0.7996657148680572, P-value: 0.4285133759134584
P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys


## Q4 in survey 1 and survey 2

In [18]:
from scipy import stats

# 1 = correct answer and 0 = incorrect answer in this survey data
survey1_data = [1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] # 21 responses
survey2_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] #22 responses

# Perform two-sample t-test
t_statistic, p_value = stats.ttest_ind(survey1_data, survey2_data)

# Print the t-statistic and p-value
print(f"T-statistic: {t_statistic}, P-value: {p_value}")

if p_value < 0.05:
    print(f"P-value is less than 0.05, so we reject the null hypothesis and thus there is a statistically significant different between the responses of the two surveys")
else:
    print(f"P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys")

T-statistic: -0.1842820116776752, P-value: 0.8547006953874694
P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys


## Q5 in survey 1 and survey 2

In [19]:
from scipy import stats

# 1 = correct answer and 0 = incorrect answer in this survey data
survey1_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0] # 21 responses
survey2_data = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] #22 responses

# Perform two-sample t-test
t_statistic, p_value = stats.ttest_ind(survey1_data, survey2_data)

# Print the t-statistic and p-value
print(f"T-statistic: {t_statistic}, P-value: {p_value}")

if p_value < 0.05:
    print(f"P-value is less than 0.05, so we reject the null hypothesis and thus there is a statistically significant different between the responses of the two surveys")
else:
    print(f"P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys")

T-statistic: 0.7728439070175043, P-value: 0.4440475533464042
P-value is greater than 0.05, so we fail to reject the null hypothesis and thus there is not enough evidence to suggest a significant difference between the responses of the two surveys
