Q1: Difference between t-test and z-test:
- **T-Test:** A t-test is used to compare the means of two groups to determine if there is a statistically significant difference between them. It is typically used when the sample size is small, and the population standard deviation is unknown. The t-test is based on the t-distribution.

  Example Scenario: You want to test if a new drug is effective in reducing blood pressure. You have a small sample of 30 patients, and you measure their blood pressure before and after taking the drug. You use a t-test to compare the means of the before and after measurements.

- **Z-Test:** A z-test is used when you have a larger sample size (typically n > 30) and know the population standard deviation. It is used to compare sample data to a known population or to compare two large samples. The z-test is based on the standard normal distribution.

  Example Scenario: You are analyzing the weights of a population of adult males. You have a sample of 100 individuals, and you know the population standard deviation. You use a z-test to determine if the sample mean differs significantly from the population mean.

Q2: One-Tailed vs. Two-Tailed Tests:
- **One-Tailed Test:** In a one-tailed test, you are interested in the possibility of an effect in only one direction, either greater than or less than a certain value. It is used when you have a specific hypothesis about the direction of the effect.

  Example: Testing whether a new fertilizer increases crop yield. You are only interested in whether it increases yield, not decreases it.

- **Two-Tailed Test:** In a two-tailed test, you are interested in the possibility of an effect in either direction, greater than or less than a certain value. It is used when you want to determine if there is any significant difference, regardless of the direction.

  Example: Testing whether a new drug affects heart rate. You want to know if it can increase or decrease heart rate.

Q3: Type 1 and Type 2 Errors:
- **Type 1 Error (False Positive):** This occurs when you reject a true null hypothesis. In other words, you conclude that there is an effect or difference when there isn't one.

  Example: You mistakenly conclude that a person is guilty of a crime when they are actually innocent (a wrongful conviction).

- **Type 2 Error (False Negative):** This occurs when you fail to reject a false null hypothesis. In other words, you conclude that there is no effect or difference when there is one.

  Example: You fail to detect a serious disease in a patient even though they have it, leading to a missed diagnosis.

Q4: Bayes's Theorem:
Bayes's Theorem is a mathematical formula used to update the probability for a hypothesis as new evidence becomes available. It is particularly useful in conditional probability and Bayesian statistics. The formula is:

\[P(A|B) = \frac{P(B|A) * P(A)}{P(B)}\]

Example: Suppose you want to find the probability of a patient having a disease (A) given a positive test result (B). You know the probability of a positive test result if the patient has the disease (P(B|A)), the prior probability of the patient having the disease (P(A)), and the overall probability of a positive test result (P(B)). Bayes's Theorem helps you update the probability of the patient having the disease after the test result.

Q5: Confidence Interval:
A confidence interval is a range of values around a sample statistic that is used to estimate a population parameter with a certain level of confidence. It provides a range of values within which the true parameter is likely to fall. The formula to calculate a confidence interval for the population mean (μ) is:

\[CI = \bar{X} ± Z * (s / √n)\]

Where:
- \(\bar{X}\) is the sample mean.
- Z is the Z-score associated with the desired confidence level (e.g., 1.96 for a 95% confidence level).
- s is the sample standard deviation.
- n is the sample size.

Example: You have a sample of 100 students, and you want to estimate the average height of all students in your school with a 95% confidence level. You calculate the confidence interval using the formula and find that the height is between 160 cm and 170 cm.

Q6: Bayes's Theorem Example:
Let's say you want to calculate the probability of a patient having a rare disease given some prior information and a new diagnostic test.

- Prior Probability: The probability of the patient having the disease before any test information (P(A) = 0.01, which is 1%).
- Test Sensitivity: The probability of a positive test result when the patient has the disease (P(B|A) = 0.95, which is 95%).
- Test Specificity: The probability of a negative test result when the patient doesn't have the disease (P(not B|not A) = 0.90, which is 90%).
- Test Positivity: The probability of a positive test result in general (P(B) = 0.099, which is 9.9%).

Now, you can use Bayes's Theorem to find the probability of the patient having the disease after the positive test:

\[P(A|B) = \frac{P(B|A) * P(A)}{P(B)}\]
\[P(A|B) = \frac{0.95 * 0.01}{0.099}\]
\[P(A|B) ≈ 0.959\]

So, given the positive test result, there is approximately a 95.9% probability that the patient has the disease.

Q7: Calculate the 95% Confidence Interval:
To calculate the 95% confidence interval for a sample with a mean of 50 and a standard deviation of 5, you can use the formula:

\[CI = \bar{X} ± Z * \frac{s}{√n}\]

Where:
- \(\bar{X}\) is the sample mean (50).
- Z is the Z-score for a 95% confidence level (1.96).
- s is the sample standard deviation (5).
- n is the sample size (you didn't provide it, so I'll assume n = 30).

Now, plug in the values:

\[CI = 50 ± 1.96 * \frac{5}{√30}\]

Calculate the values:

\[CI ≈ 50 ± 1.96 * (2.8867)\]

This gives a 95% confidence interval of approximately 44.66 to 55.34.

Interpretation: With 95% confidence, the true population mean is estimated to be between 44.66 and 55.34 based on the sample mean of 50 and a standard deviation of 5.

Q8: Margin of Error and Sample Size:
The margin of error (MOE) is a measure of the accuracy of a confidence interval. It is determined by the confidence level, standard deviation, and sample size. The formula for the MOE is:

\[MOE = Z * \frac{s}{√n}\]

Where Z is the critical value associated with the desired confidence level. A larger sample size (n) leads to a smaller MOE. In other words, as you increase the sample size, the confidence interval becomes more precise.

Example: In a political poll, a sample of 1000 respondents results in a smaller MOE compared to a poll with 100 respondents. A larger sample provides a more accurate estimate of the population parameter.

Q9: Calculate Z-Score:
To calculate the Z-score for a data point with a value of 75, a population mean of 70, and a population standard deviation of 5, use the formula:

\[Z = \frac{X - μ}{σ}\]

Where:
- X is the data point (75).
- μ is the population mean (70).
- σ is the population standard deviation (5).

Plug in the values:

\[Z = \frac{75 - 70}{5}\]
\[Z = \frac{5}{5}\]
\[Z = 1.0\]

Interpretation: A Z-score of 1.0 means that the data point is 1 standard deviation above the mean in the population.

Q10: Hypothesis Test with a t-test:
To conduct a hypothesis test for the effectiveness of the weight loss drug at a 95% confidence level using a t-test, you need more information, specifically the hypothesized population mean (null hypothesis) and the alternative hypothesis. For this example, let's assume:

- Null Hypothesis (H0): The drug is not significantly effective (μ = 0).
- Alternative Hypothesis (H1): The drug is significantly effective (μ ≠ 0).

You can perform a two-tailed t-test using the sample mean, standard deviation, sample size, and the t-distribution to calculate the t-statistic and compare it to the critical t-value at a 95% confidence level. If the t-statistic falls in the rejection region, you can reject the null hypothesis and conclude that the drug is significantly effective.

Q11: Calculate the 95% Confidence Interval for Proportion:
To calculate the 95% confidence interval for the proportion of people satisfied with their job, use the formula for a confidence interval for a proportion:

\[CI = \hat{p} ± Z * √\frac{\hat{p}(1 - \hat{p})}{n}\]

Where:
- \(\hat{p}\) is the sample proportion (0.65 for 65%).
- Z is the Z-score for a 95% confidence level (approximately 1.96).
- n is the sample size (500).

Plug in the values:

\[CI = 0.65 ± 1.96 * √\frac{0.65(1 - 0.65)}{500}\]

Calculate the values:

\[CI ≈ 0.65 ± 0.0457\]

This gives a 95% confidence interval of approximately 0.6043 to 0.6957.

Interpretation: With 95% confidence, the true proportion of people satisfied with their job is estimated to be between 60.43% and 69.57% based on the sample of 500 people.

Q12: Hypothesis Test for Two Teaching Methods:
To test whether the two teaching methods have a significant difference in student performance, you can use a two-sample t-test. Here's how you conduct the test:

- Null Hypothesis (H0): The two teaching methods have no significant difference in student performance (μA = μB).
- Alternative Hypothesis (H1): The two teaching methods have a significant difference in student performance (μA ≠ μB).

Given:
- Sample A: mean (X̄A) = 85, standard deviation (SA) = 6, sample size (nA) = ?
- Sample B: mean (X̄B) = 82, standard deviation (SB) = 5, sample size (nB) = ?
- Significance level (α) = 0.01.

You need to find the sample sizes for both groups, and then calculate the t-statistic using the formula:

\[t = \frac{(X̄A - X̄B)}{\sqrt{\frac{S^2_A}{nA} + \frac{S^2_B}{nB}}}\]

To find the degrees of freedom (df), use:

\[df = \frac{\left(\frac{S^2_A}{nA} + \frac{S^2_B}{nB}\right)^2}{\frac{\left(\frac{S^2_A}{nA}\right)^2}{nA - 1} + \frac{\left(\frac{S^2_B}{nB}\right)^2}{nB - 1}}\]

With the calculated t-statistic and degrees of freedom, you can find the critical t-value for a significance level of 0.01. If the absolute t-statistic is greater than the critical t-value, you can reject the null hypothesis and conclude that the teaching methods have a significant difference in student performance.

Q13: Calculate the 90% Confidence Interval:
To calculate the 90% confidence interval for the true population mean, you can use the formula for a confidence interval for the mean:

\[CI = \bar{X} ± Z * \frac{σ}{√n}\]

Given:
- Population mean (μ) = 60.
- Population standard deviation (σ) = 8.
- Sample mean (X̄) = 65.
- Sample size (n) = 50.
- Confidence level = 90%.

Now, calculate the critical Z-score for a 90% confidence level (Z ≈ 1.645) and use the formula:

\[CI = 65 ± 1.645 * \frac{8}{√50}\]

Calculate the values:

\[CI ≈ 65 ± 1.845\]

This gives a 90% confidence interval of approximately 63.155 to 66.845.

Interpretation: With 90% confidence, the true population mean is estimated to be between 63.155 and 66.845 based on the sample mean of 65 and a standard deviation of 8.

Q14: Hypothesis Test for Caffeine Effects on Reaction Time:
To test if caffeine has a significant effect on reaction time at a 90% confidence level using a t-test, you need to set up the hypotheses:

- Null Hypothesis (H0): Caffeine has no significant effect on reaction time (μ = 0).
- Alternative Hypothesis (H1): Caffeine has a significant effect on reaction time (μ ≠ 0).

Given:
- Sample mean (X̄) = 0.25 seconds.
- Sample standard deviation (s) = 0.05 seconds.
- Sample size (n) = 30.
- Significance level (α) = 0.10 (90% confidence level).

Calculate the t-statistic using the formula:

\[t = \frac{X̄ - μ}{\frac{s}{√n}}\]

Substitute the values:

\[t = \frac{0.25 - 0}{\frac{0.05}{√30}}\]

Calculate the t-statistic. Once you have it, compare it to the critical t-value for a two-tailed test at a 90% confidence level with 29 degrees of freedom. If the absolute t-statistic is greater than the critical t-value, you can reject the null hypothesis and conclude that caffeine has a significant effect on reaction time.