<a href="https://colab.research.google.com/github/rajdepot/probnstats/blob/main/29th_June_2024.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

Problem Overview
A cosmetic company, Beauty Boost, is launching a new line of skincare products targeted at teenagers with acne-prone skin. They want to ensure the effectiveness of these products before launching them. They have developed a new acne-clearing gel and are conducting various tests to assess its efficacy.

Specific Question
Quality assurance tests on previous batches of skincare products have shown that 75% of the time, the products yield satisfactory results in clearing acne. Given a sample of 20 units of the new acne-clearing gel, we need to calculate the theoretical probability that at most 5 units do not yield satisfactory results.

Key Points
Success Rate: 75% of the products yield satisfactory results (probability of success,
𝑝
=
0.75
p=0.75).
Sample Size: The sample consists of 20 units (number of trials,
𝑛
=
20
n=20).
Failures: We need to find the probability that at most 5 units do not yield satisfactory results.
Steps to Solve the Problem
Identify the Distribution: The problem involves a fixed number of trials (20 units), each with two possible outcomes: yielding satisfactory results (success) or not (failure). This scenario fits a binomial distribution.

Define Success and Failure:

Success: A unit yields satisfactory results (probability
𝑝
=
0.75
p=0.75).
Failure: A unit does not yield satisfactory results (probability
𝑞
=
1
−
𝑝
=
0.25
q=1−p=0.25).
Calculate the Desired Probability: We want to find the probability that at most 5 units out of 20 do not yield satisfactory results, which is equivalent to finding
𝑃
(
𝑋
≤
5
)
P(X≤5) where
𝑋
X is the number of failures in 20 trials.

Binomial Distribution
The probability mass function of a binomial distribution is given by:
𝑃
(
𝑋
=
𝑘
)
=
(
𝑛
𝑘
)
𝑝
𝑘
(
1
−
𝑝
)
𝑛
−
𝑘
P(X=k)=(
k
n
​
 )p
k
 (1−p)
n−k


Where:

𝑛
n is the number of trials.
𝑘
k is the number of successes.
𝑝
p is the probability of success in each trial.
In our case, we sum the probabilities for
𝑘
k ranging from 0 to 5 (since we are looking for at most 5 failures):
𝑃
(
𝑋
≤
5
)
=
∑
𝑘
=
0
5
(
20
𝑘
)
(
0.25
)
𝑘
(
0.75
)
20
−
𝑘
P(X≤5)=∑
k=0
5
​
 (
k
20
​
 )(0.25)
k
 (0.75)
20−k


Calculation
Using statistical tools or software, we can calculate this cumulative probability. Using Python, we computed this using the cumulative distribution function (CDF) of the binomial distribution, which gave us:

𝑃
(
𝑋
≤
5
)
≈
0.617
P(X≤5)≈0.617

Conclusion
The theoretical probability that at most 5 units out of 20 do not yield satisfactory results is approximately 61.7%. This means there is a 61.7% chance that 5 or fewer of the 20 units tested will not work as expected.

In [1]:
import scipy.stats as stats

# Parameters
n = 20
p = 0.75
k = 5

# Calculate the cumulative probability P(X <= 5)
cumulative_probability = stats.binom.cdf(k, n, 1-p)
cumulative_probability


0.6171726543871046