In [1]:
import scipy
import numpy as np

In [2]:
from scipy.stats import binom

![](http://www.stat.yale.edu/Courses/1997-98/101/binpdf.gif)

A  survey found that 65% of all financial consumers were very satisfied with
their primary financial institution. Suppose that 25 financial consumers are sampled
and if  survey result still holds true today, what is the probability that
exactly 19 are very satisfied with their primary financial institution?

In [3]:
print(binom.pmf(k=19, n=25, p=0.65))

0.09077799859322791


According to the U.S. Census Bureau, approximately 6% of all workers in Jackson, Mississippi, are unemployed. In conducting a random telephone survey in Jackson, what is the probability of getting two or fewer unemployed workers in a sample of 20?

In [4]:
binom.cdf(2, 20, 0.06)

0.8850275957378545

Solve the binomial probability for n = 20, p = .40, and x = 10

In [5]:
print(binom.pmf(k=10, n=20, p=0.4))

0.11714155053639011


Poisson Distribution

In [7]:
from scipy.stats import poisson

In [8]:
poisson.pmf(3,2)   #3 = x, Mean = 2

0.18044704431548356

Suppose bank customers arrive randomly on weekday afternoons at an average of 3.2
customers every 4 minutes. What is the probability of exactly 5 customers arriving in a
4-minute interval on a weekday afternoon?

In [9]:
poisson.pmf(5,3.2)

0.11397938346351824

Bank customers arrive randomly on weekday afternoons at an average of 3.2 customers
every 4 minutes. What is the probability of having more than 7 customers in
a 4-minute interval on a weekday afternoon?

In [11]:
prob = poisson.cdf(7,3.2)

In [12]:
prob_more_than_7 = 1-prob
prob_more_than_7

0.01682984174895752

A bank has an average random arrival rate of 3.2 customers every 4 minutes. What
is the probability of getting exactly 10 customers during an 8-minute interval?

In [13]:
poisson.pmf(10,6.4)

0.052790043854115495

Uniform Distribution

Suppose the amount of time it takes to assemble a plastic module ranges from 27 to 39 seconds and that assembly times are uniformly distributed. Describe the distribution. What is the probability that a given assembly will take between 30 and 35 seconds?

In [14]:
U= np.arange(27, 40, 1)
U

array([27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39])

In [15]:
from scipy.stats import uniform
uniform.mean(loc=27,scale=12)

33.0

In [16]:
uniform.cdf(np.arange(30, 36, 1), loc=27, scale=12)

array([0.25      , 0.33333333, 0.41666667, 0.5       , 0.58333333,
       0.66666667])

In [17]:
Prob = 0.66666667 - 0.25
Prob

0.41666667

According to the National Association of Insurance Commissioners, the average annual cost for automobile insurance in the United States in a recent year was $691.  Suppose automobile insurance costs are uniformly distributed in the United States with a range of from $200 to $1,182. What is the standard deviation of this uniform distribution?


In [18]:
uniform.mean(loc=200,scale=982)

691.0

In [19]:
uniform.std(loc=200,scale=982)

283.4789821721062

Normal Distribution 

![](https://ds055uzetaobb.cloudfront.net/brioche/uploads/enBFdM8LyU-basic-normal-distribution.png?width=1200)

In [21]:
from scipy.stats import norm

In [22]:
val,m,s = 68,65.5,2.5

In [23]:
print(norm.cdf(val,m,s))

0.8413447460685429


cdf(x>val)

In [24]:
print(1-norm.cdf(val,m,s))

0.15865525393145707


cdf(val1<x<val2)

In [25]:
print(norm.cdf(val,m,s)-(norm.cdf(63,m,s)))

0.6826894921370859


What is the probability of obtaining a score greater than 700 on a GMAT test that has
a mean of 494 and a standard deviation of 100? Assume GMAT scores are normally
distributed.

P(x > 700 | m = 494 and s = 100) = ?

In [26]:
print(1-norm.cdf(700,494,100))

0.019699270409376912


For the same GMAT examination, what is the probability of randomly drawing a
score that is 550 or less?

In [27]:
print(norm.cdf(550,494,100))

0.712260281150973


What is the probability of randomly obtaining a score between 300 and 600 on the
GMAT exam?

In [28]:
print(norm.cdf(600,494,100)-norm.cdf(300,494,100))

0.8292378553956377


What is the probability of getting a score between 350 and 450 on the same GMAT
exam?

In [29]:
print(norm.cdf(450,494,100)-norm.cdf(350,494,100))

0.2550348541262666


In [30]:
norm.ppf(0.95)

1.6448536269514722

In [31]:
norm.ppf(1-.6772)

-0.45988328292440145

Hypergeometric Distribution example

Suppose 18 major computer companies operate in the United States and that 12 are located in California’s Silicon Valley. If three computer companies are selected randomly from the entire list, what is the probability that one or more of the selected companies are located in the Silicon Valley?


In [38]:
from scipy.stats import hypergeom
pval = hypergeom.sf(0, 18, 3, 12) #hypergeom.sf(x-1, N, n, A) , sf = 1-cdf

In [39]:
pval

0.9754901960784306

A western city has 18 police officers eligible for promotion. Eleven of the 18 are Hispanic. Suppose only five of the police officers are chosen for promotion. If the officers chosen for promotion had been selected by chance alone, what is the probability that one or fewer of the five promoted officers would have been Hispanic?


In [40]:
pval = hypergeom.cdf(1, 18, 5, 11)

In [36]:
pval

0.04738562091503275

Exponential Distribution Example 

A manufacturing firm has been involved in statistical quality control for several years. As part of the production process, parts are randomly selected and tested. From the records of these tests, it has been established that a defective part occurs in a pattern that is Poisson distributed on the average of 1.38 defects every 20 minutes during production runs. Use this information to determine the probability that less than 15 minutes will elapse between any two defects?

In [41]:
mu1 = 1/1.38 # for 20 mins
mu1

0.7246376811594204

In [42]:
from scipy.stats import expon
expon.cdf(0.75,0,(1/1.38)) # 15/20 = 0.75, loc=0 because y = (x - loc) / scale. and y= x/scale, 

0.6447736190750485