[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://githubtocolab.com/CU-Denver-MathStats-OER/Statistical-Theory/blob/main/Chap4/14-Estimation-MOM.ipynb) <nbsp>

Estimation is generally the process of predicting the value(s) of unknown population parameters using data collected from a sample. There are different statistical methods that can be applied to a data set to reach the same ultimate goal, such as finding an accurate estimate for a parameter. We explored the method of [maximum likelihood estimation (MLE)](13-Estimation-MLE.qmd). With MLE estimation, we find the most likely value of a parameter, $\theta$. We find the value for $\theta$ that maximizes a likelihood function that gives the probability of randomly selecting the observed sample data.

We now explore another estimation method called the <span style="color:dodgerblue">**method of moments**</span> (commonly abbreviated as <span style="color:dodgerblue">**MoM**</span>). Both MLE and MoM and estimates are used to answer the same question, what is the value of an unknown population parameter, $\theta$? MoM estimates approach the question following a different objective then MLE. We find the values for the population parameters such that certain properties of the population (for example $\mu_X=E(X)$ and $\sigma^2_X = \mbox{Var}(X)$) are equal to the corresponding statistics (such as the sample mean, $\bar{x}$, and sample variance, $s^2$) that we calculate from a randomly selected sample.


# ggg Building a Model for Bear Cub Weight zzz

From the code cell above, we have generated the following sample of cub birth weights (in ounces) that are stored in `cub$wt`,

$$x = (9.71, 7.77, 8.47, 7.35, 7.83, 9.06, 8.66, 8.74, 10.82, 8.27 ).$$

Our goal is to find the "best" description of the distribution of the population (all black bear cubs) birth weights. The interpretation of "best" depends on the context of the question and can mean different things to different statisticians. 


## ggg Question 1 zzz

Which of the models labeled 1-4 in the plot above do you believe best fits the sample data cub birth weights?


#### ggg Solution to Question 1a zzz

What type of continuous distribution best matches the graph you selected? Explain why in terms of birth weights of black bear cubs this distribution is reasonable and makes practical sense.


- *Hint: See the [appendix of common continuous random variables section](08-Common-Continuous-Distributions.qmd#sec-append) for some options.*


#### ggg Solution to Question 1b zzz

```{r}
# be sure you have already run the first code cell and 
# stored sample weights to variable `wt` in data frame `cub`


```

## ggg Identifying Key Properties for Our Model zzz

Let $X$ be a random variable with pdf $f(x)$. For a positive integer $k$, the <span style="color:dodgerblue">**k^th^ theoretical moment of $X$**</span> is $\color{dodgerblue}{\mu_k = E \left( X^k \right) }$.

$$\color{dodgerblue}{\boxed{\mu_k = E \left( X^k \right) = \int_{-\infty}^{\infty} x^kf(x) \, dx \ \ \ \mbox{(for continuous)} \qquad \mbox{or} \qquad  \mu_k = E \left( X^k \right) = \sum_X x^kp(x) \ \ \ \mbox{(for discrete)}}},$$

- The  <span style="color:dodgerblue">**first moment**</span> is $\color{dodgerblue}{\mu_1 = E \left( X^2 \right) }$.
    - $\mu$ is the <span style="color:dodgerblue">**mean**</span>. 

- The  <span style="color:tomato">**second moment**</span> is $\color{tomato}{\mu_2 = E \left( X^2 \right) }$.
    - $\mu_2$ is related *(but not equal)* to the <span style="color:tomato">**variance**</span>. 
    - If we can find $\mbox{Var}(X)$ and have computed the first theoretical moment, $\mu_1$, we have:

$${\color{tomato}{\mu_{2}}} = \mbox{Var}(X) + \mu_1^2 \qquad \mbox{since} \qquad \mbox{Var}(X) = E \big( (X-\mu_1)^2 \big) = {\color{tomato}{E(X^2)}} - \mu_1^2 = {\color{tomato}{\mu_{2}}} - \mu_1^2.$$

- The  <span style="color:mediumseagreen">**third moment**</span> is $\color{mediumseagreen}{\mu_3 = E \left( X^3 \right) }$.
  - $\mu_3$ is related  to the <span style="color:mediumseagreen">**skewness**</span> of $X$ which is defined as $E \big( (X-\mu_1)^3 \big)$


![Diva Jain, [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0), via [Wikimedia Commons](https://commons.wikimedia.org/wiki/File:Relationship_between_mean_and_median_under_different_skewness.png)](https://upload.wikimedia.org/wikipedia/commons/c/cc/Relationship_between_mean_and_median_under_different_skewness.png){fig-align="center" width=80% fig-alt="Comparing Plots of Skewness"}


- The  <span style="color:mediumpurple">**fourth moment**</span> is $\color{mediumpurple}{\mu_4 = E \left( X^4 \right) }$.
  - $\mu_4$ is related to the <span style="color:mediumpurple">**kurtosis**</span> of $X$ which is defined as $E \big( (X-\mu_1)^4 \big)$.
    - Informally, the kurtosis measures how "peaky" or flat the distribution is.


## ggg Sample Moments zzz

Let $X$ be a random variable with pdf $\displaystyle f(x; \lambda, \delta)=\lambda e^{-\lambda(x-\delta)}$ for $x > \delta$ with parameters $\lambda, \delta >0$.


### ggg Question 2a zz

$$\mu_1 = E(X) = \int_0^{\infty} \left(   ?? \right)  \, dx$$
$$\mu_2 = E(X^2) = \int_0^{\infty} \left(   ?? \right)  \, dx$$


### ggg Question 2b zzz

<br>  
<br>  
<br>  


### ggg Question 2c zzz

<br>  
<br>  
<br>  



# ggg Defining the Method of Moments Estimate zzz

## ggg Question 3 zzz

<br>  
<br>  
<br>  




## ggg Question 4 zzz

<br>  
<br>  
<br>  


## ggg Question 5 zzz

<br>
<br>
<br>

## ggg Question 6 zzz

Derive the following general formulas for the MoM estimates of $\alpha$ and $\beta$:

$$\hat{\alpha}_{\rm{MoM}} = M_1 - \sqrt{3} \left( \sqrt{ M_2 - M_1^2} \right)$$
$$\hat{\beta}_{\rm{MoM}} = M_1 + \sqrt{3} \left( \sqrt{ M_2 - M_1^2} \right)$$

where $M_1=\bar{x} = \frac{1}{n}\sum_{i=1}^n x_i$ and $M_2= \frac{1}{n} \sum_{i=1}^n x_i^2$ denote the first and second sample moments, respectively. Find the MoM estimates of $\alpha$ and $\beta$. 


#### ggg Solution to Question 6a zzz

Verify your solution to [Question 4] using the formulas for the MoM estimates for $\alpha$ and $\beta$ in [Question 6a].

-   Complete and run the partially completed R code cell below.


#### ggg Solution to Question 6b zzz

The code below generates sampling distributions for MoM estimates for the parameters $\alpha$ and $\beta$ for random variable $X \sim \mbox{Unif}(\alpha, \beta)$ using sample size $n=4$.

- A total of 1,000 random samples each of size $n$ are generated in the for loop.
- The distribution of $\hat{\alpha}_{\rm{MoM}}$ values are stored in the vector `mom.alpha`.
- The distribution of $\hat{\beta}_{\rm{MoM}}$ values are stored in the vector `mom.beta`.


```{r}
#| eval: true
#############################
# do not edit
# run the code cell as is
#############################
n <- 4  # sample size

mom.alpha <- numeric(1000)
mom.beta <- numeric(1000)

for (i in 1:1000)
{
  x.temp <- runif(n, 0, 11)  # generate random sample
  m1 <- sum(x.temp)/n  # first sample moment
  m2 <- sum(x.temp^2)/n  # second sample moment
  k <- sqrt(3) * sqrt(m2 - m1^2)  # compute sqrt(3)*(m2 - m1^2)
  mom.alpha[i] <- m1 - k  # enter formula for MoM estimate for alpha
  mom.beta[i] <- m1 + k  # enter formula for MoM estimate for beta
}
```

The distribution of $\hat{\alpha}_{\rm{MoM}}$ values generated by the code above is plotted in the histogram below.

- A <span style="color:dodgerblue">blue vertical line</span> is drawn at the actual value of $\color{dodgerblue}{\alpha=0}$.
- A <span style="color:tomato">red vertical line</span> is drawn at the value of
$\color{tomato}{\hat{\alpha}_{\rm{MoM}}=-0.797}$ we found for the sample in [Question 4].


```{r}
#| eval: true
#############################
# do not edit
# run the code cell as is
#############################
hist(mom.alpha, 
     breaks = 20,
     xlab = "MoM for alpha",
     main = "Dist. of MoM's for alpha")
abline(v = 0, col = "dodgerblue", lwd = 2)  # plot at actual value of alpha
abline(v = -0.797, col = "tomato", lwd = 2)  # plot at estimated value of alpha
```

The distribution of $\hat{\beta}_{\rm{MoM}}$ values generated by the code above is plotted in the histogram below.

- A <span style="color:dodgerblue">blue vertical line</span> is drawn at the actual value of $\color{dodgerblue}{\beta=11}$.
- A <span style="color:tomato">red vertical line</span> is drawn at the value of
$\color{tomato}{\hat{\beta}_{\rm{MoM}}=11.297}$ we found for the sample in [Question 4].


```{r}
#| eval: true
#############################
# do not edit
# run the code cell as is
#############################
hist(mom.beta, 
     breaks = 20,
     xlab = "MoM for beta",
     main = "Dist. of MoM's for beta")
abline(v = 11, col = "dodgerblue", lwd = 2)  # plot at actual value of beta
abline(v = 11.297, col = "tomato", lwd = 2)  # plot at estimated value of alpha
```


### ggg Question 7a zzz

<br>  
<br>  
<br>  


### ggg Question 7b zzz

```{r}
# check whether or not each estimator is biased


```


<br>  
<br>  


### ggg Question 7c zzz

```{r}
# check whether which estimator is more efficient


```

<br>  
<br>  


### ggg Question 7d zzz

#### ggg Experiment with different sample sizes, $n$. zzz

- As $n$ gets larger, does each estimator seem to get more, less, or no change in bias?

```{r}
# check for change to bias


```


<br>  
<br>  


##### ggg Exploring Efficiency of Each Estimator zzz

- Does the shape of each distribution change as $n$ gets larger?


```{r}
# check for change in shape


```

<br>  
<br>  
<br>  


---

![Creative Commons License](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png) <nbsp>

*Statistical Methods: Exploring the Uncertain* by [Adam Spiegler](https://github.com/CU-Denver-MathStats-OER/Statistical-Theory) is licensed under a [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-nc-sa/4.0/).
