
$\textbf{Task 1}$
### Question 1
**Expected Value of a Binomial Distribution**

We aim to demonstrate that in a series of repeated trials (denoted by $Y$), the average value we expect (expected value) is equal to $n * p_i$, where:

* $n$ represents the total number of trials conducted.
* $p_i$ represents the probability of success in each individual trial.

**Step-by-Step Breakdown:**

1. **Expected Value of a Single Trial (Y_i):**
   - To find the expected value of a single trial ($Y_i$), we consider all possible outcomes ($y_i$) weighted by their corresponding probabilities ($P(Y_i = y_i)$). This is expressed mathematically as:

     $
     E[Y_i] = sum(y_i * P(Y_i = y_i)) for all possible values of y_i
     $

2. **Probability of Each Outcome (Y_i):**
   - Since each trial follows a binomial distribution, the probability of getting a specific outcome (`y_i`) can be calculated using the following formula:

     - $binom(n, y_i)$: This represents the number of ways to achieve $y_i$ successes out of $n$ trials.
     - $p_i^(y_i)$: This signifies the probability of success in each trial raised to the power of the number of successes ($y_i$).
     - $(1 - p_i)^(n - y_i)$: This represents the probability of failure in each trial raised to the power of the number of failures ($n - y_i$).

3. **Expected Value of a Single Trial (Formula):**
   - Combining the probability and outcome for each possible value of $Y_i$, we get the formula for the expected value of a single trial:

     $
     E[Y_i] = sum(y_i * binom(n, y_i) * p_i^(y_i) * (1 - p_i)^(n - y_i)) for all possible values of y_i
     $

4. **Expected Value of a Single Trial (Result):**
   - This formula, when evaluated for a binomial distribution, is known to simplify to $n * p_i$. Therefore, the expected value of each trial ($Y_i$) is $E[Y_i] = n * p_i$.

5. **Expected Value of the Sum (Y):**
   - Since the trials ($Y_1$, $Y_2$, ..., $Y_k$) are assumed to be independent (the outcome of one trial doesn't affect the others), the expected value of their sum ($Y$) is simply the sum of their individual expected values.

6. **Expected Value of the Sum (Formula):**
   - The expected value of the sum ($Y$) is expressed as:

     $
     E[Y] = E[Y_1] + E[Y_2] + ... + E[Y_k]
     $

7. **Expected Value of the Sum (Result):**
   - Substituting the known expected value for each trial ($n * p_i$):

     $
     E[Y] = n * p_1 + n * p_2 + ... + n * p_k
     $

   - Factoring out `n`, we get:

     $
     E[Y] = n * (p_1 + p_2 + ... + p_k)
     $

   - Since the sum of probabilities for all possible outcomes in a trial is always 1, this simplifies to:

     $
     E[Y] = n * 1 = n
     $

**Conclusion:**

Therefore, for a series of independent Bernoulli trials ($Y$), the expected value of their sum is indeed $n * p_i$.


### Question 2:

Given the data:

$
\begin{align*}
\text{Age} & \quad \text{Proportion} \\
18-24 & \quad 0.18 \\
25-34 & \quad 0.23 \\
35-44 & \quad 0.16 \\
45-64 & \quad 0.27 \\
65-100 & \quad 0.16 \\
\end{align*}
$

If 500 adults are sampled randomly, we want to find the probability that the sample contains 100 persons between 18 and 24, 200 between 25 and 34, and 200 between 45 and 64.

Multinomial Distribution for Sample Composition

 Imagine you're drawing a random sample of people from a larger population with different age groups. The multinomial probability formula helps us calculate the probability of getting a specific distribution of ages in your sample.

 The Formula Explained:

 The formula looks like this:

 $P(y_1, y_2, y_3, y_4, y_5) = (n!)/(y_1!y_2!y_3!y_4!y_5!) * p_1^(y_1) * p_2^(y_2) * p_3^(y_3) * p_4^(y_4) * p_5^(y_5)$

P(y_1, y_2, y_3, y_4, y_5): This represents the probability of getting a specific distribution of individuals across different age groups. For example, P(100, 200, 200, 0, 0) might represent the probability of having 100 people in the 18-24 age group, 200 each in the 25-34 and 45-64 groups, and none in the others.
n: This is the total number of people you're drawing in your sample (e.g., 500).
$y_i$: This represents the number of individuals in the $i-th$ age group (e.g., $y_1$might be the number of people in the 18-24 age group).
$p_i$: This represents the proportion of people in the $i-th$age group in the entire population (e.g., $p_1$ might be the proportion of people aged 18-24 in the whole population).
$!$ : This symbol represents a factorial. For example, 5! is 5 multiplied by 4 multiplied by 3, and so on.
 Expected Number in an Age Group:

 We can also calculate the expected number of people in a specific age group in a random sample. This is simply the total sample size (n) multiplied by the proportion ($p_i$) of that age group in the population.

 For example, if you're drawing a sample of 500 people and the proportion of people aged 65 and above in the population is 0.16, you would expect to have around:

 $E[y_5] = n * p_5 = 500 * 0.16 = 80 $ people aged 65 and above
## Task 2 Answers:

### Question 1: Probability Distribution

The probability distribution for an experiment involving independent trials with success ($S$) or failure ($F$) outcomes can be derived as follows:

The probability of observing $y$ successes in $n$ trials follows a binomial distribution due to the independence of trials. The binomial probability formula expresses this probability:

$$P(Y=y) = \binom{n}{y} p^y q^{n-y}$$

where:

* $p$ is the probability of success on a single trial.
* $q = (1 - p)$ is the probability of failure on a single trial.
* $n$ is the total number of trials.
* $y$ is the number of successes observed.

This formula represents the experiment's probability distribution.

### Question 2: Expectation of the Distribution

To show that the expectation ($E[y]$) of this probability distribution is $E[y] = \sum y \cdot P(y) = np$:

The formula for the expectation of a binomial distribution is:

$$E[y] = \sum_{y=0}^{n} y \cdot P(Y=y)$$

Substituting the binomial probability formula, we get:

$$E[y] = \sum_{y=0}^{n} y \cdot \binom{n}{y} p^y q^{n-y}$$

This summation simplifies to $np$, proving that $E[y] = np$.

### Question 3: Probability of at Least 9 Recoveries

Given a worthless medication (30% recovery chance), we seek the probability of at least 9 recoveries in a 10-person experiment (denoted by $Y$). We need to find:

$$P(Y \geq 9) = P(Y = 9) + P(Y = 10)$$

First, calculate $P(Y = 9)$ and $P(Y = 10)$:

$$P(Y = 9) = \binom{10}{9} (0.3)^9 (0.7)^1 \approx 0.00014368$$
$$P(Y = 10) = (0.3)^{10} \approx 0.0000059049$$

Therefore:

$$P(Y \geq 9) \approx 0.00014368 + 0.0000059049 \approx 0.00014958$$

### Question 4: Probability of at Least One Defective Fuse

A lot contains 5000 fuses with a 5% defect rate (p = 0.05). We test 5 fuses (n = 5) and need $P(Y \geq 1)$, which can be found using the complement rule:

$$P(Y \geq 1) = 1 - P(Y = 0)$$

First, compute $P(Y = 0)$:

$$P(Y = 0) = \binom{5}{0} (0.05)^0 (0.95)^5 \approx 0.77378$$

Therefore:

$$P(Y \geq 1) = 1 - 0.77378 \approx 0.22622$$

The probability of observing at least one defective fuse in a sample of 5 is approximately 0.22622 (22.62%).

### Task 3 Answers:
#### Question 1:

For a Poisson random variable $(Y)$ with parameter $(\lambda)$, the expected value (mean) $(E[Y])$ can be derived from the definition of the Poisson distribution:

$$[ p(y) = \frac{\lambda^y}{y!} e^{-\lambda}, \quad y = 0, 1, 2, \ldots ]$$

To find the expected value $(E[Y]$), we use the definition of expectation:

We will derive the expected value (mean) $E[Y]$ of a Poisson random variable $Y$ with parameter $\lambda$.

**Poisson Probability Mass Function:**

The Poisson distribution describes the probability of getting a specific number of events (e.g., successes) in a fixed interval. The probability mass function (PMF) defines the probability $p(y)$ of each possible outcome ($y$):

\begin{equation}
p(y) = \frac{\lambda^y}{y!} e^{-\lambda}, \quad y = 0, 1, 2, \ldots
\end{equation}

where:

* $y$ is the number of events (e.g., successes)
* $\lambda$ is the parameter of the distribution
* $e$ is the mathematical constant (approximately 2.718)

**Expected Value Formula:**

The expected value, denoted by $E[Y]$, represents the average outcome we expect to see over many trials from this distribution. It's calculated by summing the product of each possible outcome ($y$) and its corresponding probability ($p(y)$):

\begin{equation}
E[Y] = \sum_{y=0}^{\infty} y \cdot p(y)
\end{equation}

**Deriving the Expected Value:**

We substitute the Poisson PMF into the expected value formula:

\begin{equation}
E[Y] = \sum_{y=0}^{\infty} y \cdot \frac{\lambda^y}{y!} e^{-\lambda}
\end{equation}

To simplify the summation, we can rewrite $(y \cdot \frac{\lambda^y}{y!})$ as $(\lambda \cdot \frac{\lambda^{y-1}}{(y-1)!})$. This allows us to shift the index in the sum. Let's also substitute $(z = y - 1)$ for a new index variable.

These transformations lead to:

\begin{align*}
E[Y] &= \sum_{y=0}^{\infty} y \cdot \frac{\lambda^y}{y!} e^{-\lambda} \\
&= \sum_{y=0}^{\infty} \lambda \cdot \frac{\lambda^{y-1}}{(y-1)!} e^{-\lambda} \\
&= \lambda e^{-\lambda} \sum_{y=1}^{\infty} \frac{\lambda^{y-1}}{(y-1)!} \\
&= \lambda e^{-\lambda} \sum_{z=0}^{\infty} \frac{\lambda^z}{z!}
{\text{Substitute } z = y - 1}
\end{align*}

**Recognizing the Summation:**

The resulting summation $(\sum_{z=0}^{\infty} \frac{\lambda^z}{z!})$ is actually the Taylor series expansion of the mathematical constant $e$ raised to the power of $\lambda$ (denoted by $e^{\lambda}$).

**Conclusion:**

Since the summation represents $e^{\lambda}$, we can substitute:

\begin{equation}
E[Y] = \lambda e^{-\lambda} \cdot e^{\lambda} = \lambda
\end{equation}

Therefore, the expected value of a Poisson random variable with parameter $\lambda$ is simply $\lambda$ itself.




#### Question 2:

In this problem, we are given that the mean density of seedlings is $(\lambda = 5)$ seedlings per square yard. A forester randomly locates ten 1-square-yard sampling regions. Our objective is to find the probability that none of these regions will contain seedlings.

Since the presence of seedlings in different regions is independent, we can model the number of seedlings in each 1-square-yard region as a Poisson random variable with $(\lambda = 5)$.


The probability of finding zero seedlings in a single region can be calculated using the Poisson probability mass function (PMF):

$$P(Y = 0) = \frac{\lambda^0}{0!} e^{-\lambda} = e^{-\lambda}$$

In this case, $\lambda = 5$, so the probability of no seedlings in a single region is:

$$P(Y = 0) = e^{-5}$$

**Probability of No Seedlings (All Regions):**

Since we have ten independent regions, and we're interested in the probability that none of them have seedlings, we consider the probability for a single region (no seedlings) raised to the power of ten (number of regions).

This gives us the probability that none of the ten regions contain seedlings:

$$P(\text{none of the regions contain seedlings}) = (e^{-5})^{10} = e^{-50}$$

**Conclusion:**

Therefore, the probability of finding no seedlings in any of the ten 1-square-yard sampling regions is:

$$\boxed{e^{-50}}$$


