# Matthew Joel
# Common Statistical Distributions in R

---

# Telephone Line Distribution

------------------------------------------------------------------------

Suppose a mail-order computer business has six telephone lines, and the probability distribution function (pdf) of $X$ is given in the accompanying table.

| $x$      | 0      | 1      | 2      | 3      | 4      | 5      | 6      |
|----------|--------|--------|--------|--------|--------|--------|--------|
| $P(X=x)$ | $0.05$ | $0.15$ | $0.20$ | $0.30$ | $0.15$ | $0.10$ | $0.05$ |




------------------------------------------------------------------------

The probability that:

<br>

i.  At most three lines are in use.  0.7

<br>

ii. Fewer than three lines are in use. 0.4

<br>

iii. At least three lines are NOT in use. 0.4  




In [None]:
#0.05+0.15+0.20+0.30+0.15+0.10+0.05
#i.
0.05+0.15+0.20+0.30
#ii.
0.05+0.15+0.20
#iii.
1 - (0.30+0.15+0.10+0.05)

------------------------------------------------------------------------

The expected value, $E(X)=\mu$, of $X$, can be found as follows:

<br>

In [None]:
0*0.05 + 1*0.15 + 2*0.20 + 3*0.30 + 4*0.15 + 5*0.10 + 6*0.05

x <- c(0, 1, 2, 3, 4, 5, 6)
pdf <- c(0.05, 0.15, 0.20, 0.30, 0.15, 0.10, 0.05)
ex <- sum(x * pdf)

------------------------------------------------------------------------

The variance, $\mbox{Var}(X)=\sigma^2$, of $X$, could be found as follows:

<br>

In [None]:
ex_squared <- sum(x^2 * pdf)
v_x <- ex_squared - ex^2
v_x

# Playing Insurance Actuary
------------------------------------------------------------------------

Let $X$ be the damage incurred (in dollars) in a certain type of accident during a given year.
Possible $X$ values are losses of $\$0$, $\$1000$, $\$5000$, and $\$10,000$, with probabilities $0.75$, $0.12$, $0.10$, and $0.03$, respectively.
A particular company offers a $\$500$ deductible policy.This means:

-   When no claim is filed (a loss of $\$0$), no money is paid by either the driver or the insurance company.
-   If a loss of $\$1000$ is reported, then the driver pays a $\$500$ deductible and the insurance company pays the remaining $\$500$.
-   If a loss of $\$5000$ is reported, then the driver pays a $\$500$ deductible and the insurance company pays the remaining $\$4500$.
-   If a loss of $\$10,000$ is reported, then the driver pays a $\$500$ deductible and the insurance company pays the remaining $\$9500$.

If the company wishes to profit $\$100$ for each policy, a premium amount of $\$895$ should be charged. Heres why:
------------------------------------------------------------------------

In [None]:
x <- c(0, 1000, 5000, 10000)
loss <- c(0, 500, 4500, 9500)
pdf <- c(0.75, 0.12, 0.1, .03)
#checking
#sum(pdf)
ex <- sum(loss * pdf)
ex+100

# Modeling the California Stop

------------------------------------------------------------------------

Suppose that only 30% of all drivers come to a complete stop at an intersection having flashing red lights in all directions when no other cars are visible.



------------------------------------------------------------------------
Let's find the probability that, of 20 randomly chosen drivers coming to an intersection under these conditions:

<br>

i.  Exactly 7 will come to a complete stop: 0.164261985217237

<br>

ii. At most 7 will come to a complete stop: 0.772271797418161

<br>

iii. At least 7 will come to a complete stop: 0.391990187799076

<br>

### Here's Why

------------------------------------------------------------------------

In [None]:
dbinom(7, size = 20, prob = 0.30)

pbinom(7, size = 20, prob = 0.30)

1 - pbinom(6, size = 20, prob = 0.3)

------------------------------------------------------------------------

Let's also find the expected value for the number of drivers (out of 20) that will come to a complete stop.

------------------------------------------------------------------------
6 drivers because:
<br>  



In [None]:
#n*p
20*.3

------------------------------------------------------------------------
The variance of the number of drivers (out of 20) that will come to a complete stop? 4 drivers
------------------------------------------------------------------------

In [None]:
#np (1 − p)
20 * .3 * (1-.3)

# Airport Poisson Distribution

------------------------------------------------------------------------

Suppose small aircraft arrive at an airport according to a Poisson distribution with rate $\lambda = 7$ air craft per hour, so that the number of arrivals during a time period of $t$ hours is a Poisson random variable with parameter $\mu(t) =7t$.



The probability that:

<br>

i.  Exactly 5 small aircraft arrive during a **1 hour period**?

0.12771666829229

<br>  

ii. At most 5 small aircraft arrive during a **2 hour period**?

0.00553204969770016

<br>  

iii. At least 7 small aircraft arrive during a **1 hour period**?

0.550288944151301

<br>  

iv. What is the probability that at least 25 small aircraft arrive during a **3 hour period**?

0.217844981186114
<br>  

### Heres the work in R


In [None]:
# Probability that exactly 5 small aircraft arrive during a 1 hour period?
dpois(5,7)

# Probability that at most 5 small aircraft arrive during a 2 hour period?
ppois(5, 7*2)


# Probability that at least 7 small aircraft arrive during a 1 hour period?
1 - ppois(6, 7)


# Probability that at least 25 small aircraft arrive during a 3 hour period?
1 - ppois(24, 7*3)

------------------------------------------------------------------------
The expected value and standard deviation of the number of small aircraft that arrive during a 90 min period:

------------------------------------------------------------------------
$E(X) = \lambda, Var(X) = \lambda$
<br>

$E(x) = 10.5 = Var(X)$
<br>


In [None]:
#1hr * x = 1.5hr
#x = 1.5
#7 * x = y
7 * 1.5

# Geometric Carnival Game

------------------------------------------------------------------------

A carnival game consists of spinning a wheel with $10$ slots, eight red and two blue. If you land on the blue slot, you win a prize. Suppose your younger brother really wants the prize, so you will play until you win.



The probability you'll win on the first spin? 0.2
<br>

------------------------------------------------------------------------

In [None]:
# Enter code answer question 9a
dgeom(0, 2/10)

------------------------------------------------------------------------

The probability you'll require exactly 4 spins to win? 0.08192

In [None]:
# Enter code answer question 9b
dgeom(4, 2/10)

------------------------------------------------------------------------

What is the probability you'll require at most 4 spins? 0.67232

In [None]:
# Enter code answer question 9c
pgeom(4, 2/10)

------------------------------------------------------------------------
But, you may ask, how long should I play to win? For my little brother of course. The expected number of spins required for you to win the prize, and the corresponding variance is:
$E(X) = 5, Var(X) = 20$
<br>

In [None]:
# Enter code answer question 9d
#ex = 1/p
1/(2/10)
#var = (1-p)/p**2
(1-2/10)/(2/10)**2