**1.) Table 1 shows a frequency distribution of grades on a final examination in college algebra. Find the quartiles of the distribution.**

To start, we need to reverse the order of the table to make it from lowest to highest. Also, we need to interpolate to find the actual quartile values.

In [32]:
import numpy as np

grades = ["30-39", "40-49", "50-59", "60-69", "70-79", "80-89", "90-100"]
frequencies = [1, 3, 11, 21, 43, 32, 9]

cumulative_frequencies = np.cumsum(frequencies)
n = sum(frequencies)

Q1_position = (n * 1) / 4
Q2_position = (n * 2) / 4
Q3_position = (n * 3) / 4

def find_quartile(position):
    for i, cf in enumerate(cumulative_frequencies):
        if position <= cf:
            class_interval = grades[i]
            lower_bound = int(class_interval.split('-')[0])
            frequency = frequencies[i]
            if i == 0:
                cumulative_frequency_before = 0
            else:
                cumulative_frequency_before = cumulative_frequencies[i - 1]
            return lower_bound + ((position - cumulative_frequency_before) / frequency) * (10)

Q1 = find_quartile(Q1_position)
Q2 = find_quartile(Q2_position)
Q3 = find_quartile(Q3_position)

print(f'Quartile 1: {Q1}\nQuartile 2: {Q2}\nQuartile 3: {Q3}\n')

Quartile 1: 67.14285714285714
Quartile 2: 75.5813953488372
Quartile 3: 83.4375



**2.) On a final examination in statistics, the mean grade of a group of 150 students was 78 and the standard deviation was 8.0. In algebra, however, the mean final grade of the group was 73 and the standard deviation was 7.6. In which subject was there the greater (a) absolute dispersion and (b) relative dispersion?**

In [8]:
mean_stats, std_stats, n_stats = 78, 8.0, 150
mean_algebra, std_algebra, n_algebra = 73, 7.6, 120

print(f'Absolute Dispersion of Statistics Data: {std_stats}\nAbsolute Dispersion of Algebra Data: {std_algebra}\n')

rel_dispersion_stats = (std_stats / mean_stats) * 100
rel_dispersion_algebra = (std_algebra / mean_algebra) * 100
print(f'Relative Dispersion of Statistics Data: {rel_dispersion_stats}\nRelative Dispersion of Algebra Data: {rel_dispersion_algebra}')

Absolute Dispersion of Statistics Data: 8.0
Absolute Dispersion of Algebra Data: 7.6

Relative Dispersion of Statistics Data: 10.256410256410255
Relative Dispersion of Algebra Data: 10.410958904109588


**3.) Prove that the mean and standard deviation of a set of standard scores are equal to 0 and 1, respectively. Use the following problem to illustrate this: Convert the set 6, 2, 8, 7, 5 into standard scores.**

Consider the set {6, 2, 8, 7, 5}. We will first get the mean and standard deviation of the set so we can convert the data into z-scores.

We can get the **mean** or first moment by getting the total scores and dividing it by the number of entries, that is, $$\frac{6+2+8+7+5}{5} = \frac{28}{5}$$.

To get the standard deviation, we can use the moment formula:
$$ \sigma = \sqrt{E[X^2] - E[X]^2} $$
$$ \sigma = \sqrt{\frac{6^2+2^2+8^2+7^2+5^2}{5} - (\frac{28}{5})^2} $$
$$ \sigma = \frac{\sqrt{106}}{5} $$

Now, we can get the z-scores for each entry:
$$ Z = \frac{X - E[X]}{\sigma} $$
$$ Z_1 = \frac{6 - \frac{28}{5}}{\frac{\sqrt{106}}{5}};  Z_2 = \frac{2 - \frac{28}{5}}{\frac{\sqrt{106}}{5}};  Z_3 = \frac{8 - \frac{28}{5}}{\frac{\sqrt{106}}{5}};  Z_4 = \frac{7 - \frac{28}{5}}{\frac{\sqrt{106}}{5}};  Z_5 = \frac{5 - \frac{28}{5}}{\frac{\sqrt{106}}{5}} $$
$$ Z_1 = \frac{\sqrt{106}}{53}; Z_2 = \frac{-9\sqrt{106}}{53}; Z_3 = \frac{6\sqrt{106}}{53}; Z_4 = \frac{7\sqrt{106}}{106}; Z_5 = \frac{-3\sqrt{106}}{106} $$

We will now get the mean of the obtained standard scores:
$$ \mu = \frac{\frac{\sqrt{106}}{53} + \frac{-9\sqrt{106}}{53} + \frac{6\sqrt{106}}{53} + \frac{7\sqrt{106}}{106} + \frac{-3\sqrt{106}}{106}}{5} $$
$$ \mu = 0 $$

For the standard deviation:
$$ \sigma = \sqrt{E[X^2] - E[X]^2} $$
$$ \sigma = \sqrt{[(\frac{\frac{\sqrt{106}}{53})^2 + (\frac{-9\sqrt{106}}{53})^2 + (\frac{6\sqrt{106}}{53})^2 + (\frac{7\sqrt{106}}{106})^2 + (\frac{-3\sqrt{106}}{106}}{5})^2] - 0^2} $$
$$ \sigma = \sqrt{1 - 0} $$
$$ \sigma = 1 $$

**4.) Three masses are measured as 20.48, 35.97, and 62.34 g, with standard deviations of 0.21, 0.46, and 0.54 g, respectively. Find the (a)  mean and (b) standard deviation of the sum of the masses.**

In [22]:
masses = [20.48, 35.97, 62.34] 
std_devs = [0.21, 0.46, 0.54]

mean_sum = sum(masses)
print(f'The mean of the sum of the masses is {mean_sum}')

The mean of the sum of the masses is 118.79


In [23]:
variance_sum = sum([sd**2 for sd in std_devs])
sdev_sum = (variance_sum)**(1/2)

print(f'The standard deviation of the sum of the masses is {sdev_sum}')

The standard deviation of the sum of the masses is 0.7397972695272672


**5.) Find 𝜇 and 𝜎^2 . Give the 25 (with replacement) possible samples of size 2, their means, and their probabilities.**

In [29]:
x_values = [6, 9, 12, 15, 18]
probabilities = [0.1, 0.2, 0.4, 0.2, 0.1]

mean_x = sum([x * p for x, p in zip(x_values, probabilities)])
variance_x = sum([((x - mean_x) ** 2) * p for x, p in zip(x_values, probabilities)])

print(f'The mean of the credit hour distribution is {mean_x}')
print(f'The variance of the credit hour distribution is {variance_x}\n')

The mean of the credit hour distribution is 12.0
The variance of the credit hour distribution is 10.8



In [30]:
samples = []
for x1 in x_values:
    for x2 in x_values:
        samples.append((x1, x2))

# Calculate the mean of each sample
sample_means = [(s[0] + s[1]) / 2 for s in samples]

# Calculate the probability of each sample (product of individual probabilities)
sample_probs = [
    probabilities[x_values.index(s[0])] * probabilities[x_values.index(s[1])]
    for s in samples
]

# Display the results
for sample, mean, prob in zip(samples, sample_means, sample_probs):
    print(f"Sample: {sample}, Mean: {mean}, Probability: {prob}")


Sample: (6, 6), Mean: 6.0, Probability: 0.010000000000000002
Sample: (6, 9), Mean: 7.5, Probability: 0.020000000000000004
Sample: (6, 12), Mean: 9.0, Probability: 0.04000000000000001
Sample: (6, 15), Mean: 10.5, Probability: 0.020000000000000004
Sample: (6, 18), Mean: 12.0, Probability: 0.010000000000000002
Sample: (9, 6), Mean: 7.5, Probability: 0.020000000000000004
Sample: (9, 9), Mean: 9.0, Probability: 0.04000000000000001
Sample: (9, 12), Mean: 10.5, Probability: 0.08000000000000002
Sample: (9, 15), Mean: 12.0, Probability: 0.04000000000000001
Sample: (9, 18), Mean: 13.5, Probability: 0.020000000000000004
Sample: (12, 6), Mean: 9.0, Probability: 0.04000000000000001
Sample: (12, 9), Mean: 10.5, Probability: 0.08000000000000002
Sample: (12, 12), Mean: 12.0, Probability: 0.16000000000000003
Sample: (12, 15), Mean: 13.5, Probability: 0.08000000000000002
Sample: (12, 18), Mean: 15.0, Probability: 0.04000000000000001
Sample: (15, 6), Mean: 10.5, Probability: 0.020000000000000004
Sample: 