# Background
In quality control processes, especially when dealing with high-value items, destructive sampling is a necessary but costly method to ensure product quality. The test to determine whether an item meets the quality standards destroys the item, leading to the requirement of small sample sizes due to cost constraints.
# Scenario
A manufacturer of print-heads for personal computers is interested in estimating the mean durability of their print-heads in terms of the number of characters printed before failure. To assess this, the manufacturer conducts a study on a small sample of print-heads due to the destructive nature of the testing process.
# Data
A total of 15 print-heads were randomly selected and tested until failure. The durability of each print-head (in millions of characters) was recorded as follows:
1.13, 1.55, 1.43, 0.92, 1.25, 1.36, 1.32, 0.85, 1.07, 1.48, 1.20, 1.33, 1.18, 1.22, 1.29


# a. Build 99% Confidence Interval Using Sample Standard Deviation
Assuming the sample is representative of the population, construct a 99% confidence interval for the mean number of characters printed before the print-head fails using the sample standard deviation. Explain the steps you take and the rationale behind using the t-distribution for this task.


In [2]:
# prompt:  Build 99% Confidence Interval Using Sample Standard Deviation

import numpy as np
from scipy import stats

# Sample data
sample_data = np.array([1.13, 1.55, 1.43, 0.92, 1.25, 1.36, 1.32, 0.85, 1.07, 1.48, 1.20, 1.33, 1.18, 1.22, 1.29])

# Calculate the sample mean and standard deviation
sample_mean = np.mean(sample_data)
sample_std = np.std(sample_data)

# Calculate the sample size
sample_size = len(sample_data)

# Calculate the standard error of the mean
sem = sample_std / np.sqrt(sample_size)

# Calculate the critical value for a 99% confidence interval with 14 degrees of freedom
t_critical = stats.t.ppf(1 - 0.01/2, sample_size - 1)

# Calculate the confidence interval
ci_lower = sample_mean - t_critical * sem
ci_upper = sample_mean + t_critical * sem

# Print the confidence interval
print(f"99% confidence interval: ({ci_lower:.2f}, {ci_upper:.2f})")

99% confidence interval: (1.10, 1.38)


# Explanation:

1. We first calculate the sample mean and standard deviation from the provided sample data.
2. We then calculate the standard error of the mean (SEM), which is the standard deviation of the sampling distribution of the mean.
3. We use the t-distribution to calculate the critical value for a 99% confidence interval. The t-distribution is used because the population standard deviation is unknown and the sample size is small (n < 30).
4. The critical value is multiplied by the SEM to obtain the margin of error.
5. The margin of error is added to and subtracted from the sample mean to obtain the lower and upper bounds of the confidence interval.
6. The confidence interval is then printed.

# b. Build 99% Confidence Interval Using Known Population Standard Deviation
If it were known that the population standard deviation is 0.2 million characters, construct a 99% confidence interval for the mean number of characters printed before failure.


In [4]:
# prompt:  Build 99% Confidence Interval Using Known Population Standard Deviation 0.2 million characters

# Sample data
sample_data = np.array([1.13, 1.55, 1.43, 0.92, 1.25, 1.36, 1.32, 0.85, 1.07, 1.48, 1.20, 1.33, 1.18, 1.22, 1.29])

# Calculate the sample mean and standard deviation
sample_mean = np.mean(sample_data)
sample_std = 0.2

# Calculate the sample size
sample_size = len(sample_data)

# Calculate the standard error of the mean
sem = sample_std / np.sqrt(sample_size)

# Calculate the critical value for a 99% confidence interval with 14 degrees of freedom
t_critical = stats.t.ppf(1 - 0.01/2, sample_size - 1)

# Calculate the confidence interval
ci_lower = sample_mean - t_critical * sem
ci_upper = sample_mean + t_critical * sem

# Print the confidence interval
print(f"99% confidence interval: ({ci_lower:.2f}, {ci_upper:.2f})")

99% confidence interval: (1.08, 1.39)


In [None]:
# Explanation:

1. We first calculate the sample mean and standard deviation from the provided sample data.
2. We then calculate the standard error of the mean (SEM), which is the standard deviation of the sampling distribution of the mean.
3. We use the t-distribution to calculate the critical value for a 99% confidence interval. The t-distribution is used because the population standard deviation is unknown and the sample size is small (n < 30).
4. The critical value is multiplied by the SEM to obtain the margin of error.
5. The margin of error is added to and subtracted from the sample mean to obtain the lower and upper bounds of the confidence interval.
6. The confidence interval is then printed.
