<table align="left" style="border-style: hidden" class="table"> <tr><td class="col-md-2"><img style="float" src="../icon.png" alt="Prob140 Logo" style="width: 120px;"/></td><td><div align="left"><h3 style="margin-top: 0;">Probability for Data Science</h3><h4 style="margin-top: 20px;">UC Berkeley, Spring 2023</h4><p>Ani Adhikari</p>CC BY-NC-SA 4.0</div></td></tr></table><!-- not in pdf -->

This content is protected and may not be shared, uploaded, or distributed.

In [None]:
# Run this cell to set up your notebook

# These lines make warnings go away
import warnings
warnings.filterwarnings('ignore')

import numpy as np
from scipy import stats
from datascience import *
from prob140 import *

# These lines do some fancy plotting magic
import matplotlib
%matplotlib inline
import matplotlib.pyplot as plt
plt.style.use('fivethirtyeight')

# Homework 10

### Instructions

Your homeworks will generally have two components: a written portion and a portion that also involves code.  Written work should be completed on paper, and coding questions should be done in the notebook. Start the work for the written portions of each section on a new page. You are welcome to $\LaTeX$ your answers to the written portions, but staff will not be able to assist you with $\LaTeX$ related issues. 

It is your responsibility to ensure that both components of the lab are submitted completely and properly to Gradescope. **Make sure to assign each page of your pdf to the correct question. Refer to the bottom of the notebook for submission instructions.**

Every answer should contain a calculation or reasoning. For example, a calculation such as $(1/3)(0.8) + (2/3)(0.7)$ or `sum([(1/3)*0.8, (2/3)*0.7])`is fine without further explanation or simplification. If we want you to simplify, we'll ask you to. But just ${5 \choose 2}$ by itself is not fine; write "we want any 2 out of the 5 frogs and they can appear in any order" or whatever reasoning you used. Reasoning can be brief and abbreviated, e.g. "product rule" or "not mutually exclusive."

## 1. Functions of Uniform Random Variables ##
Let $X$ and $Y$ have joint density

$$f(x, y) = 
\begin{cases}
90(y-x)^8, ~~~~ 0 < x < y < 1 \\
0 ~~~~~~~~~~~~~~~~~~~~~ \text{otherwise}
\end{cases}$$ 

In what follows, please do the calculus yourself and **show all your work**. No `SymPy`.

**a)** Find $P(Y > 2X)$.

**b)** Find the marginal density of $X$ and remember to provide the possible values. Recognize the density as a member of a famous family and state its name and parameters.

**c)** Fill in the blanks (**prove your answer**):
The joint density $f$ above is the joint density of the $\underline{~~~~~~~~~~~~~~~~}$ and $\underline{~~~~~~~~~~~~~~~~}$ of ten independent uniform $(0, 1)$ random variables.

[Follow the steps in [this calculation](http://prob140.org/textbook/content/Chapter_17/04_Beta_Densities_with_Integer_Parameters.html#joint-density-of-two-order-statistics) in the textbook.]

\newpage

## 2. Peter Meets Paul ##
Peter and Paul agree to meet at a restaurant at noon. Peter arrives at time normally distributed with mean 12:00 noon and SD 3 minutes. Paul arrives at a time normally distributed with mean 12:02 P.M. and SD 4 minutes. 

Find the chances below assuming that the two arrival times are independent. 

- First, write a formula for the chance in terms of the standard normal cdf $\Phi$. 
- Then use a code cell to find the numerical value. You do not have to turn in any coding work for this question.

**a)** $P$(Peter arrives before Paul)

**b)** $P$(both men arrive within 3 minutes of noon)

**c)** $P$(the two men arrive within 3 minutes of each other)



In [None]:
# Calculation for a
...

In [None]:
# Calculation for b
...

In [None]:
# Calculation for c
...

\newpage

## 3. Rayleigh Facts and an Application ##

Let $R$ have the Rayleigh density given by $f(r) ~ = ~ re^{-\frac{1}{2}r^2}$ for $r > 0$.

Refer to [Section 18.1](http://prob140.org/textbook/content/Chapter_18/01_Standard_Normal_Basics.html#variance) and [Section 18.4](http://prob140.org/textbook/content/Chapter_18/04_Chi_Squared_Distributions.html#from-chi-squared-1-to-chi-squared-n) for ways in which this density arises.

The point of Parts **a** and **b** is for you *not to reinvent wheels*, and also for you to notice that math calculation can be reduced by using probability facts. The probabilistic approach to math is powerful, so please follow the approach in the instructions. We will not give credit for other approaches. 

**a)** Write $E(R)$ as an integral but don't find its value yet. Let $Z$ be standard normal. You know the numerical value of the variance of $Z$. Write the variance of $Z$ as an integral, compare with your integral for $E(R)$, and use the comparison to find the value of $E(R)$. 

**b)** Use properties of $R^2$ to find $E(R^2)$ without any further integration, and hence find $Var(R)$.

**c)** Suppose two shots are fired at a target. Assume each shot hits with independent normally distributed coordinates, with the same means and equal unit variances. Let $D$ be the distance between the point where the two shots strike. Find $E(D)$ and $Var(D)$.

[Your calculations will go faster if you remember that a normal $(0, \sigma^2)$ variable can be written as $\sigma Z$ where $Z$ is standard normal, and if you use Parts **a-b**.]

\newpage

## 4. Poisson MGF ##
Let $X$ have Poisson($\mu$) distribution, and let $Y$ independent of $X$ have Poisson $(\lambda)$ distribution.

**a)** Find the mgf of $X$.

**b)** Use the result of (a) to show that the distribution of $X+Y$ is Poisson.

\newpage

## 5. Gamma Tail Bound ##

Before you do this exercise, carefully study a [relevant example](http://prob140.org/textbook/content/Chapter_19/04_Chernoff_Bound.html#application-to-the-normal-distribution) in the textbook. You will have to follow similar steps.

You will need the [mgf of the gamma distribution](http://prob140.org/textbook/content/Chapter_19/02_Moment_Generating_Functions.html#mgf-of-a-gamma-r-lambda-random-variable). Also remember that you found the gamma mean and variance in Homework 9.

Let $X$ have the gamma $(r, \lambda)$ distribution. 

**a)** Show that $P(X \ge 2E(X)) \le \left(\frac{2}{e}\right)^r$.

**b)** Find Markov's and Chebyshev's bounds on $P(X \ge 2E(X))$. 

**c) [CODE]** Fix $\lambda = 1$. Display overlaid plots of the following four graphs as functions of $r$, for $r$ in the interval $(0.5, 15)$ :

- The exact tail probability $P(X \ge 2E(X))$
- The bound in Part **a**: $\left(\frac{2}{e}\right)^r$
- Chebyshev's bound on $P(X \ge 2E(X))$
- Markov's bound on $P(X \ge 2E(X))$

The code uses `plt.plot` which you have used before. The expression `stats.gamma.cdf(x, r, scale=1)` evaluates to the cdf of the gamma $(r, 1)$ distribution at the point $x$.

In [None]:
# Answer to c
r = np.arange(0.05, 15, 0.1) 

markov_bound = ...

chebyshev_bound = ...

part_a_bound = ...

# Use as many lines as you need for the exact values
exact = ...
...

plt.plot(r, exact, lw=2, label='Exact Chance')
plt.plot(r, part_a_bound, lw=2, label='Part (a) Bound')
plt.plot(r, chebyshev_bound, lw=2, label='Chebyshev Bound')
plt.plot(r, markov_bound, lw=2, label='Markov Bound')
plt.legend()
plt.xlabel('$r$')
plt.ylim(0, 1)
plt.xlim(0, 15)
plt.title('$P(X \geq 2E(X))$ for $X$ gamma $(r, 1)$');

## Submission Instructions ##

Many assignments throughout the course will have a written portion and a code portion. Please follow the directions below to properly submit both portions.

### Written Portion ###
*  Scan all the pages into a PDF. You can use any scanner or a phone using applications such as CamScanner. Please **DO NOT** simply take pictures using your phone. 
* Please start a new page for each question. If you have already written multiple questions on the same page, you can crop the image in CamScanner or fold your page over (the old-fashioned way). This helps expedite grading.
* It is your responsibility to check that all the work on all the scanned pages is legible.
* If you used $\LaTeX$ to do the written portions, you do not need to do any scanning; you can just download the whole notebook as a PDF via LaTeX.

### Code Portion ###
* Save your notebook using `File > Save and Checkpoint`.
* Generate a PDF file using `File > Download As > PDF via LaTeX`. This might take a few seconds and will automatically download a PDF version of this notebook.
    * If you have issues, please post a follow-up on the general Homework 10 Ed thread.
    
### Submitting ###
* Combine the PDFs from the written and code portions into one PDF. [Here](https://smallpdf.com/merge-pdf) is a useful tool for doing so. 
* Submit the assignment to Homework 10 on Gradescope. 
* **Make sure to assign each page of your pdf to the correct question.**
* **It is your responsibility to verify that all of your work shows up in your final PDF submission.**

If you are having difficulties scanning, uploading, or submitting your work, please read the [Ed Thread](https://edstem.org/us/courses/35049/discussion/2398718) on this topic and post a follow-up on the general Homework 10 Ed thread.

## **We will not grade assignments which do not have pages selected for each question.** ##