BINOMIAL DISTRIBUTION

A binomial distribution can be thought of as simply the probability of a SUCCESS (p) or FAILURE (q) outcome in an experiment or survey that is repeated multiple times. The binomial is a type of distribution that has two possible outcomes (the prefix “bi” means two, or twice). For example, a coin toss has only two possible outcomes: heads or tails and taking a test could have two possible outcomes: pass or fail (https://www.statisticshowto.com/probability-and-statistics/binomial-theorem/binomial-distribution-formula/https://www.statisticshowto.com/probability-and-statistics/binomial-theorem/binomial-distribution-formula/).

$$P(X) = \frac{n!}{X!*(n-X)!}*p^X*q^{n-X}$$
where:

+ p: the probability of success
+ q: the probability of failure
+ n: number of trials
+ X: number of successes or failures (X = 0, 1, 2, ..., n)

Exercise 1: Hospital records show that of patients suffering from a certain disease, %75% die of it. What is the probability that of 6 randomly selected patients, 4 will recover?

In [25]:
from scipy.stats import binom

# number of trials n =6
# q = 0.75 => p = 0.25
# X = 4 number of patients who will recover

#calculate binomial probability
binom.pmf(k=4, n=6, p=0.25)

0.03295898437499997

Exercise 2: A (blindfolded) marksman finds that on the average he hits the target 4 times out of 5. If he fires 4 shots, what is the probability of

(a) more than 2 hits?

(b) at least 3 misses?


In [26]:
from scipy.stats import binom

# number of trials n = 4
# p = 4/5 = 0.8 => q= 0.2

# a/ More than 2 hits
# P(X > 2) = 1 - P(X <= 2)

#calculate binomial probability
1-binom.cdf(k=2, n=4, p=0.8)

0.8192

In [27]:
from scipy.stats import binom

# number of trials n = 4
# p = 4/5 = 0.8 => q= 0.2

# b/ At least 3 misses
# P(Y >= 3) = 1 - P(Y<=2) 

#calculate binomial probability
1-binom.cdf(k=2, n=4, p=0.2)

0.027200000000000002

Normal Approximation to the Binomial

Exercise 3: Sixty two percent of 12th graders attend school in a particular urban school district. If a sample of 500 12th grade children are selected, find the probability that at least 290 are actually enrolled in school.

p = 0.62
q = 0.38
n = 500

Since n*p = 310 > 5 and n*q = 190 > 5,
We can use the normal approximation to the binomial

µ = n*p =310

σ = sqrt(n*p*q) = 10.854

The probability that at least 290 are actually enrolled in school:

P(X>=290) = 1 -P(X <= 289.5)


In [28]:
from scipy.stats import norm
n = 500
p = 0.62
q = 0.38
sd = (n*p*q)**0.5
mean = n*p
X = 289.5
# P(X >= 290) = 1- - P(X <= 289.5)
P_X_AtLeast290 = 1 - norm(loc = mean , scale = sd).cdf(X)

print(P_X_AtLeast290)

0.9705392966006449
