In [17]:
library(ggplot2)

Suppose we have a bag with 10 balls of which 4 are black and 6 are white. Suppose we pick 2 balls randomly from the bag *with replacement*. What is the probability that both are black?

Let $X$ denote the number of black balls. Then $$X\sim\text{Bin}\left(\underbrace{n=2, p=4/(4+6)}_{\text{parameters of the distribution}}\right).$$
If we want the likelihood of getting $$j$$ successes, where $j=0,1,2,\ldots,n, $ then $$P(X=j) = \binom{n}{j}p^j(1-p)^{n-j},$$ which can be calculated using the function call $$\texttt{dbinom(j, size=n, prob=p)}$$ 

In [None]:
# Number of trials (draws)
n = 2
# Success probability
p = 4 / 10
# Desired number of successes
j = 2
choose(n,j)*p^j*(1-p)^(n-j)
dbinom(j, n, p)

# Another way of looking at the parameters
# Number of successes 
nsuccess = 4
# Number of failures 
nfailure = 6
# Draw size (number of trials)
n = 2
# Desired number of successes
j = 2
dbinom(j, n, nsuccess/(nsuccess+nfailure))

Suppose we have a bag with 10 balls of which 4 are black and 6 are white. Suppose we pick 2 balls randomly from the bag *without replacement*. What is the probability that both are black?

Let $X$ denote the number of black balls. Then $$X\sim\text{Hyper}(\text{nsuccess} = 4, \text{nfailure} = 6, \text{draw size} = 2).$$
If we want the likelihood of getting $$j$$ successes, where $j=0,1,2,\ldots,n, $ then $$P(X=j) = \frac{\binom{\text{nsuccess}}{j}\binom{\text{nfailure}}{n-j}}{\binom{\text{nsuccess+nfailure}}{n}},$$ which can be calculated using the function call $$\texttt{dhyper(j, nsuccess, nfailure, draw size n)}$$ 

In [None]:
# Draw size
n = 2
# Number of successes 
nsuccess = 4
# Number of failures 
nfailure = 6
# Desired number of successes
j = 2
(choose(nsuccess, j) * choose(nfailure, n-j))/(choose(nsuccess+nfailure, n))
dhyper(j, nsuccess, nfailure, n)

Type O blood is one of the best to be donated since it can be used for many people. Approximately 42% of people have type O blood. In a given day at a blood bank, about
120 people come in to donate. The blood bank is interested in the probability that the number of donors who are type O is equal to 30.

In [None]:
# Number of trials
n = 120
# Success probability
p = 0.42
# Desired number of successes
j = 100
dbinom(48, n, p)

 A certain stoplight, when coming from the North, is green approximately 31% of
the time. Over the next few days, someone comes to this light 8 times from the
North. We are interested in the probability that the person will come across green light 5 times.

$$X\sim\text{Bin}(n=8, p=0.31)$$

and we need $$P(X = 5)$$

In [None]:
# Number of trials
n = 8
# Success probability
p = 0.31
# Desired number of successes
j = 5
dbinom(j, n, p)

 A certain stoplight, when coming from the North, is green approximately 31% of
the time. Over the next few days, someone comes to this light 8 times from the
North. We are interested in the probability that the person will come across green light at most 5 times.

$$X\sim\text{Bin}(n=8, p=0.31)$$

and we need $$P(X \leq 5) = P(X=0)+P(X=1)+P(X=2)+P(X=3)+P(X=4)+P(X=5)$$

In [None]:
# Number of trials
n = 8
# Success probability
p = 0.31
# Desired number of at most successes
j = c(0:5)
sum(dbinom(j, n, p))
pbinom(5, n, p)

Probability Mass Function (PMF): $P_X(j) = P(X = j)$. We calculate $P_X(5) = P(X=5)$ using the $\texttt{dbinom}$ function.

Cumulative Distribution Funcion (CDF): $F_X(j) = P(X \leq j)$. We calculate $F_X(5) = P(X\leq5)$ using the $\texttt{pbinom}$ function.

An assembly line produces products that they put into boxes of 50. The inspector then randomly picks 3 items inside a box to test to see if they are defective. In a box containing 4 defectives, they are interested in the probability that at least one of the three items sampled is defective.

We need $$ P(X \geq 1),$$ where $$X\sim\text{Hyper}(4, 46, 3).$$

$$P(X >= 1) = 1 - \underbrace{P(X <=0)}_{=F_X(0)}$$

In [None]:
# Number of draws
n = 3
# Number of successes
nsuccess = 4
# Number of failures 
nfailure = 46 
# Desired number of (at least) successes
j = c(1:3)
sum(dhyper(j, nsuccess, nfailure, n))
1 - phyper(0, nsuccess, nfailure, n) # 1 - P(X <= 0)

In [None]:
r = 1
p = 0.02
j = 5
dnbinom(j, r, p)

![Production problem](https://bl3302files.storage.live.com/y4mQmboNuB8pzzTSeQ43MRiv0qHzGJuNApOK4K2tSP3rzOrM9q-geOoOpVpHtISM3kIq6BjF_Nfw4SxOcrQGYnvciSlcXZxpvV_fiw8xW5jHpQr0i5_fmnbXg300oB4JhhNULb4PB27BVND1kJRpTLK8LRMXyf7KBaaULR9Y9q_Rtz6qvi0bYPFZwcEkvCEHM7d?width=640&height=480&cropmode=none)

In [None]:
n = 5
j = c(0:5)
p_ic =? # defect rate when machine is in control
p_oc =  # defect rate when machine is out of control
icp =  # prior in control probability
ocp =  # prior out of control probability
uicp =  # updated/posterior in control probability
uocp =  # updated/posterior out of control probability
print(j)
print(uicp)
print(uocp)
