In [5]:
#1.
#Key factor for testable ideas: The key factor is whether the idea can be quantified and measured using data.
#Statistical tests require variables that can be observed and analyzed to determine relationships, so abstract
#or unmeasurable concepts cannot be examined statistically.

#Criteria for a good null hypothesis: A good null hypothesis is one that is specific, testable, and falsifiable. 
#It should state that there is no effect or no difference between groups or variables, allowing the researcher to test
#if the data provide evidence to reject this assumption.

#Difference between null and alternative hypotheses:
#The null hypothesis (H₀) asserts that there is no relationship or effect, and it is the default assumption
#that researchers seek to challenge.
#The alternative hypothesis (H₁) proposes that there is a relationship or effect, suggesting
#an outcome different from the null hypothesis.

In [4]:
#2.
#x̄ (Sample Mean): This is the average value from the sample, the small group we actually measure. 
#It's a statistic representing just the sample.

#μ (Population Mean): This is the true average of the entire population, 
#the large group we’re really interested in. Since we can't measure everyone, we estimate this based on the sample.

#μ₀ (Hypothesized Population Mean): This is a specific value we believe the population mean might be,
#based on previous research or assumptions. We test whether our data supports this hypothesis.

In [None]:
#3.
#When calculating a p-value, we assume that the null hypothesis is true.
#if the null hypothesis (which usually states that there is no effect or difference) is true, then any differences we see in the data 
#are due to random variation. The p-value helps us measure how unusual our results are under this "null world."
#A very small p-value means our data is unlikely to occur if the null hypothesis is true, 
#which gives us reason to doubt the null hypothesis.

In [None]:
#4.
#A smaller p-value makes the null hypothesis seem less believable because it shows that the observed data is very unlikely 
#if the null hypothesis were true.

#In hypothesis testing, the null hypothesis usually states that there is no effect or difference. The p-value measures the chance 
#of getting a result as extreme as (or more extreme than) the one from your sample, assuming the null hypothesis is true.
#A smaller p-value means your result is rare if the null hypothesis is correct.

In [34]:
#5.
#Null Hypothesis (H0): Couples have no head tilt preference (50% right, 50% left).
#Observed Data: 80 out of 124 couples (64.5%) tilted their heads to the right.
#Simulation: Perform a large number of simulated trials (e.g., 10,000) where each trial represents flipping a fair coin 124 times (simulating 124 couples). Each trial counts how many couples tilt their heads right (i.e., heads in a coin flip).
#Calculate the p-value: The p-value is the proportion of simulated trials that result in 80 or more "heads" (right tilts) out of 124 flips.


In [35]:
#6.
A smaller p-value does not definitively prove that the null hypothesis (H0) is false. It simply indicates stronger evidence against (H0)
A p-value quantifies the probability of observing the test results, or something more extreme, under the assumption that H0 is true. 
A very small p-value (e.g., 𝑝<0.05) suggests that the null hypothesis is unlikely, but it does not "prove" anything with certainty [5].

Regarding Fido’s innocence or guilt:

A p-value cannot definitively prove Fido’s innocence or guilt. Even if the p-value is very low or very high, statistical evidence only provides degrees of confidence, not absolute proof.
P-values only inform us about the likelihood of the evidence under the assumption of innocence (null hypothesis), but they cannot prove it outright [6].
Thus, p-values help guide decisions, but they do not definitively prove hypotheses.

In [None]:
#7.
Code Changes:

Identify the part of the code that computes the p-value for the two-tailed test. This typically involves a function call like stats.ttest_ind() or similar.
For a one-tailed test, modify this to compute the p-value based on the specific direction of interest. For example, if you are testing whether one group has a significantly higher mean than another, you would use:

p_value = 1 - stats.t.cdf(t_statistic, df)  # for upper tail
Here, t_statistic is your test statistic and df is the degrees of freedom.
Interpretation Changes:

In a one-tailed test, you are testing for the possibility of the relationship in one direction only (e.g., whether the treatment is greater than control), which typically gives more statistical power compared to a two-tailed test that considers both directions.
The null hypothesis (H0) remains the same, but the alternative hypothesis (H1) is now directional (H1: mean treatment > mean control, for example).
Expectations on p-value:

The p-value for a one-tailed test is expected to be smaller than that for a two-tailed test when the effect is in the specified direction. This is because the one-tailed test allocates all of the alpha level (significance level) to one side of the distribution, thus increasing the likelihood of observing a statistically significant result if the effect exists [3].
This adjustment in the approach allows for a more focused test, enhancing the power to detect an effect when one is expected in a specific direction.

In [37]:
import numpy as np
import scipy.stats as stats

# Set random seed for reproducibility
np.random.seed(42)

# Parameters
n = 80  # sample size
p_observed = 49 / n  # observed proportion
p_null = 0.5  # null hypothesis proportion

# Calculate the test statistic
z = (p_observed - p_null) / np.sqrt((p_null * (1 - p_null)) / n)

# Calculate the p-value for a two-tailed test
p_value = 2 * (1 - stats.norm.cdf(abs(z)))
print(f"Z-test statistic: {z}, P-value: {p_value}")


Z-test statistic: 2.0124611797498115, P-value: 0.044171344908442434


In [38]:
#Here is the chatbot record: https://chatgpt.com/share/670e07f0-e3cc-800e-83e5-f17549029867