# Introduction: Continuous Probability Distributions


- Welcome to the session on **‘Continuous Probability Distributions’**. 
- In the last session, you learnt about the
    - **binomial distribution** and 
    - the **uniform distribution**. 
    - Also, you learnt the concept of **cumulative probability**.

 

## In this session

In this session, you will learn about **cumulative probability** in a little more depth. You will see how the **probability of a continuous variable** is expressed and how it is different from the way the probability of a discrete variable is expressed. You will then learn about the **normal distribution**, which is a commonly used probability distribution among continuous random variables.

### Reference Ebook:

- Statistical Inference for Data Science by Brian Caffo.
    - https://leanpub.com/LittleInferenceBook

# Probability Density Functions - I

In the last section, you saw how to find the probability of certain events using multiplication and addition rules of probability. Also, for some specific cases, you saw that probability distributions like the binomial distribution and the uniform distribution can be used to find the probability.


However, so far we have only been talking about discrete random variables, e.g. number of balls, number of patients, cars, wickets, pasta packets, etc. What happens when we talk about the **probability of continuous random variables**, such as time, weight etc.? Is there any difference? Let’s see.

#### Q: Employee Commute Time
What would be the approximate probability that an employee’s daily commute time to work is exactly 35 minutes?

[You’re not expected to know the right answer at this point. This question is given just to get you thinking in the right direction.]

**Options**:
- Approximately 0%

- Approximately (35/30)%

- Insufficient information — I need the probability distribution

**Ans**:
- Approximately 0%
    - Since time is a continuous variable, the number of possible values is endless. An employee could have a commute time equal to 34.99, 35 or 35.01 minutes. So, there will be at most 1 or 2 employees with a commute time of 35 minutes, resulting in a very low probability.


#### Cumulative Probability

- Daily commute time of company X's Employees

| x(Commute Time) | Probability |
| --- | --- |
| 20-25 | 0.15 |
| 25-30 | 0.20 |
| 30-35 | 0.30 |
| 35-40 | 0.20 |
| 40-45 | 0.15 |

---

| x(Commute Time) | P(X<=x) |
| --- | --- |
| 20-25 | 0.15 |
| 25-30 | 0.35 |
| 30-35 | 0.65 |
| 35-40 | 0.85 |
| 40-45 | 1.00 |



## CDF - Cumulative Distribution Function

- A CDF, or a cumulative distribution function, is a distribution which plots the cumulative probability of X against X.

![image.png](attachment:image.png)

- Figure 1 - CDF (Cumulative Distribution Function)


---

## PDF - Probability Density Function

- A PDF, or Probability Density Function, however, is a function in which the area under the curve, gives you the cumulative probability.

![image.png](attachment:image.png)

- Figure 2 - PDF (Probability Density Functions)

For example, the area under the curve, between 20, the smallest possible value of X and 28, gives the cumulative probability for X = 28.

----


- The main difference between the **cumulative probability distribution** of a continuous random variable and a **discrete one**, is the way you plot them. 
    - While the continuous variables’ cumulative distribution is a **curve**, 
    - the distribution for discrete variables looks more like a **bar chart**:
    
    

![image.png](attachment:image.png)

- Figure 3 - Cumulative Probability Distribution for Continuous Variables (Commute Time)


![image-2.png](attachment:image-2.png)

- Figure 4 - Cumulative Probability Distribution for Discrete Variables (Number of Red Balls)


- The reason for showing both of these so differently is that, for discrete variables, the cumulative probability does not change very frequently. In the discrete example, we only care about what the probability is for 0, 1, 2, 3 and 4. This is because the cumulative probability will not change between, say, 3 and 3.999999. For all values between these two, the cumulative probability is equal to 0.8704.

 

- However, for the continuous variable, i.e. the daily commute time, you have a different cumulative probability value for every value of X. For example, the value of cumulative probability at 21 will be different from its value at 21.1, which will again be different from the one at 21.2 and so on. Hence, you would show its cumulative probability as a continuous curve, not a bar chart.





- So, now you know what a CDF is and what a PDF is. 
    - Since these two functions talk about probabilities in terms of intervals rather than exact values, it is advisable to use them when talking about continuous random variables, and not the bar chart distribution that we used for discrete variables.

#### Q: Cumulative Probability of Continuous Distributions
Recall that, in the above video, we talked about a fictional company with 3,000 employees in a local office. Now, let’s say you went to that office, and asked those 3,000 employees to play our UpGrad red ball game (from the previous sessions 1 and 2). Based on the data on these 3,000 people, if you create the probability distribution for X, then the values of P(X=0), P(X=1), P(X=2), P(X=3) and P(X=4) would all be:

[X = number of red balls drawn by a player after playing the game once]

**Options**:
- zero

- non-zero

**Ans**:
- non-zero
    - The probabilities mentioned will not be zero. It doesn’t matter whether the number of people playing the game is 75 or 3,000. X is not a continuous variable, and there are only 5 possible values of X. So, the probability of somebody drawing, say 3 red balls is much higher than zero.

----

# Probability Density Functions - II

A commonly observed type of distribution among continuous variables is the **uniform distribution**. For a continuous random variable following a uniform distribution, the value of probability density is equal for all possible values. Let’s explore this distribution a little more.

#### Q: Uniform Distribution
In a uniform PDF, all the possible values have the same probability density. The figure below shows such a uniform PDF, where the possible values are 0 to 10.

![image.png](attachment:image.png)

For this graph, what is the value of the probability density from X = 0 to X = 10?



**Ans**:

0.1

- Since all possible values are between 0 and 10, the area under the curve between 0 and 10 is equal to 1.

![image.png](attachment:image.png)

- Figure 5 - Uniform PDF


Clearly, this area is the area of a rectangle with length 10 and unknown height h. Hence, you can say that 10*h = 1, which gives us h = 0.1. So, the value of the PDF for all values between 0 and 10 is 0.1.

#### Q: Uniform Distribution
For the uniform PDF from the previous question, find the cumulative probability for X = 0.5.

**Ans**: The correct answer is 0.05.


The cumulative probability for X = 0.5 is equal to the area under the curve between X = 0, the lowest possible value, and X = 0.5.


![image.png](attachment:image.png)

- Figure 6 - Uniform PDF

- This area = 0.1*0.5 = 0.05.

Now, I’m sure you are wondering, when to use PDFs and when to use CDFs? They are both good for continuous variables, but which one is used more in real life analysis?

 

Well, PDFs are more commonly used in real life. The reason is that it is much **easier to see patterns in PDFs** as compared to CDFs. For example, here are the PDF and the CDF of a uniformly distributed continuous random variable:

![image.png](attachment:image.png)

- Figure 7 - PDF and CDF for a Uniformly Distributed Variable
    - The **PDF clearly shows uniformity**, as the probability density’s value remains constant for all possible values. 

    - However, the **CDF does not show any trends** that help you identify quickly that the variable is uniformly distributed.



Now, let’s see the PDF and the CDF of a symmetrically distributed continuous random variable:

![image.png](attachment:image.png)

- Figure 8 - PDF vs CDF for a Symmetrically Distributed Variable

Again, it is clear that the symmetrical nature of the variable is much more apparent in the PDF than in the CDF.


Hence, generally, PDFs are used more commonly than CDFs.





#### Q:Cumulative Probability of Continuous Variables
Suppose you work at a sports analysis company and you want to analyse the effect a bowler’s height has on his/her performance. So, you create a list of all 5 wicket hauls in the last decade. Based on this data, they created a cumulative probability distribution for X, where X = height of the bowler who took the 5 wicket haul.

Now, based on the data, you conclude that the cumulative probability, F(175.3 cm) = 0.3. In this case, which of the following statements is correct?

P(X<175.3 cm) = 0.3

P(X<175.3 cm) = 0.3

(Remember that height is a continuous variable.)

**Options**:
- Only statement 1 is correct

- Only statement 2 is correct

- Both statements 1 and 2 are correct

**Ans**:
- Both statements 1 and 2 are correct
    - You can say that P(X ≤ 175.3 cm) = P(X < 175.3 cm) + P(X = 175.3 cm). Now, since X is a continuous variable, you know that the probability of getting an exact value is zero. Hence, P(X=175.3 cm) = 0, which means that P(X ≤ 175.3 cm = P(X < 175.3 cm) + 0.


# Normal Distribution
    - Also called - Bell Curve / Gaussain Distribution

You’ve seen how the probability distributions of continuous random variables differ from those of discrete random variables.


But can you think of some examples of continuous distributions? Which is the most commonly used continuous probability distribution? Which distribution occurs most commonly in nature? Let’s hear from Prof. Tricha on this.


All data that is normally distributed follows the **1-2-3** rule. This rule states that there is a -

1. **68%** probability of the variable lying **within 1 standard deviation** of the mean

2. **95%** probability of the variable lying **within 2 standard deviations** of the mean

3. **99.7%** probability of the variable lying **within 3 standard deviations** of the mean


![image.png](attachment:image.png)

- Figure 9 - 1-2-3 Rule for a Normal Distribution


#### Q: FInding probability for normal variable X ?

- Mean(µ) = 35
- Standard deviation(σ) = 5

- P(25 < X < 45) = ?
    - = **P(µ-2σ < X < µ+2σ)** ~= **95%**
    - = P(35 - 2 * 5 < X < 35 + 2 * 5)
    - = P(25 < X < 45)
    - == 95 %

- Mean(µ) = 35
- Standard deviation(σ) = 5

- P(25 < X < 50) = ?
    - = **P(µ-2σ < X < µ+3σ)** 
    - = Left side of <-- X --> Right side     
    - = P(47.5 < X < 49.85)
    - == 97.35 %
    
 ![pob6.jpg](attachment:pob6.jpg)
 

- Mean(µ) = 35
- Standard deviation(σ) = 5

- P(X < 40) = ?
    - = **P( X < µ + σ)** 
    - = 50% + 34%    
    - == 84 %

This is actually like saying that, if you buy a loaf of bread everyday and measure it, then - (mean weight = 100 g, standard deviation = 1 g)

1. For 5 days every week, the weight of the loaf you bought that day will be within 99 g (100-1) and 101 g (100+1).

2. For 20 days every 3 weeks, the weight of the loaf you bought that day will be within 98 g (100-2) and 102 g (100+2).

3. For 364 days every year, the weight of the loaf you bought that day will be within 97 g (100-3) and 103 g (100+3).

A lot of naturally occurring variables are normally distributed. For example, the heights of a group of adult men would be normally distributed. To try this out, we asked 50 male employees at the UpGrad office for their height and then plotted the probability density function using that data.


![image.png](attachment:image.png)

- Figure 10 - PDF Generated in R, for X (Height in inches)

As you can see, the data is roughly normal.

 

#### Q:Probability of Normal Random Variables
Let’s say that you need to find the cumulative probability for a random variable X which is normally distributed. You do not know what the value of X is or, for that matter, what the value of µ and σ is. You only know that X = µ + σ. Can you find the cumulative probability, i.e. the probability of the variable being less than µ + σ?

**Options**:
- Yes

- No

**Ans**:
- Yes
    - You can still find the cumulative probability. If the variable is normally distributed, then it doesn’t matter what the value of µ and σ is, there is a 34% probability that X lies between µ and µ + σ, i.e. P(µ < X < µ + σ = 34%). Similarly, there is a 50% probability that X is less than µ, i.e. P(X < µ = 50%). Again, this would happen regardless of what the value of µ and σ is. Hence, you can say that P(X < µ + σ) = 84% for every normal variable, no matter what the value of µ and σ is.




# Standard Normal Distribution

As you learnt in the previous question, it doesn’t matter what the value of µ and σ is. All you need to know, if you want to find the probability, is how far the value of X is from µ — specifically, what **multiple of σ** is the **difference between X and µ**.

 

Let’s see how you can find this.

- Mean(µ) = 35
- Standard deviation(σ) = 5

- X = 43.25
    - = X - µ
    - = 43.25 - 35
    - = 8.25
    - = 8.25/σ = 8.25/5 = 1.65
    
    
- Z = (X - µ) / σ
- Z is called 'Standardised normal variable'

#### Q: Standardised Random Variable
A normally distributed random variable X is converted to Z. Find P(-2<Z<3). (Report the answer as a number rounded to two digits after the decimal point.)


**Ans**:The correct answer is 0.97.

![pob7-2.jpg](attachment:pob7-2.jpg)

![pob8.jpg](attachment:pob8.jpg)

# Reading the Z - Table

### Q:-  (Z = 0.68)

- Look the Z-Table 
    - 0.6 -> Row 
    - 0.08 -> Coloumn
    - 0.7517

![pob9.jpg](attachment:pob9.jpg)

### Q: 
- Mean(µ) = 35
- Standard deviation(σ) = 5

- P(X < 43.25) ?

- Z = (X - µ) / σ
    - = (43.25 - 35) / 5
    - = 1.65
- P(X < 43.25) == P(Z < 1.65) == .95 == 95 %

### Q: 
- Mean(µ) = 35
- Standard deviation(σ) = 5

- P(25.2 < X < 44.8) ?
    - = (25.2 - 35)/5 == - 1.96
    - = (44.8 - 35)/5 == + 1.96
    - = P(Z = 1.96) - P(Z = -1.96)
    - = 0.975 - 0.025
    - = 0.95 ~= 95%

As you just learnt, the standardised random variable is an important parameter. It is given by:

 

Z = X − μ /σ

 

Basically, it tells you how many standard deviations away from the mean your random variable is. As you just saw, you can find the cumulative probability corresponding to a given value of Z, using the Z table:

![image.png](attachment:image.png)

Figure 11 - Z Table

Alternatively, you can use the following equation to find the cumulative probability:

 

F
(
Z
)
=
1/
√
2
π
∫
Z
−
∞
e^
−
t^2/
2
d
t

 

However, I’ve a feeling that you will prefer the table! :-)

 

Not only that, you can also use Excel or Python to find the cumulative probability for Z. For example, let’s say you want to find the cumulative probability for Z = 1.5. In Excel, you would type:

 

= NORM.S.DIST(1.5, TRUE)

Basically, the syntax is:

= NORM.S.DIST(z, TRUE)

 

Here, z is the value of the Z score for which you want to find the cumulative probability. TRUE = find cumulative probability, FALSE = find probability density.

 

Also, you can find the probability without standardising. Let’s say that X is normally distributed, with mean (μ) = 35 and standard deviation (σ) = 5. Now, if you want to find the cumulative probability for X = 30, you would type:

 

= NORM.DIST(30, 35, 5, TRUE)

Basically, the syntax is:

= NORM.DIST(x, mean, standard_dev, TRUE)

 

# Z-table:

The Z-table mentioned below might be broken. Please use this link instead, for solving the problem.
- https://www.math.arizona.edu/~rsims/ma464/standardnormaltable.pdf




#### Q:Normal Variables
What is the probability of a normally distributed random variable lying within 1.65 standard deviations of the mean?


**Options**:
- 95%


- 90%


- 85%


- 80%

**Ans**:

- 90%
    - You have to find the probability of the variable lying between μ-1.65σ and μ+1.65σ. i.e. P(μ-1.65σ < X < μ+1.65σ). In terms of Z, this becomes P(-1.65 < Z < +1.65). This would be equal to P(1.65) - P(-1.65) = 0.95 - 0.05 = 0.90.


Again, there are some more probability distributions that are commonly seen among continuous random variables. They are not covered in this course, but if you want to go through some of them, you can use the links below -

1. Exponential Distribution
    - https://online.stat.psu.edu/stat414/lesson/15/15.1
2. Gamma Distribution
    - https://online.stat.psu.edu/stat414/lesson/15/15.4
3. Chi-Squared Distribution
    - https://online.stat.psu.edu/stat414/lesson/15/15.8

# Summary: Continuous Probability Distributions

You started this session by learning that, for a **continuous random variable**, the **probability of getting an exact value** is very low, almost **zero**. Hence, when talking about the probability of continuous random variables, you can only talk **in terms of intervals**. For example, for a particular company, the probability of an employee’s commute time being exactly equal to 35 minutes was zero, but the probability of an employee having a commute time between 35 and 40 minutes was 0.2.

 

Hence, for continuous random variables, **probability density functions (PDFs)** and **cumulative distribution functions (CDFs)** are used, instead of the bar chart type of distribution used for the probability of discrete random variables. These functions are preferred because they talk about probability in terms of intervals.

 

Then, you understood that the major difference between a PDF and a CDF is that in a CDF, you can find the cumulative probability directly by checking the value at x. However, for a PDF, you need to find the area under the curve between the lowest value and x to find the cumulative probability.

![image.png](attachment:image.png)

- Figure 12 - PDFs vs CDFs


However, you also learnt that **PDFs are still more commonly used**, mainly because it is very **easy to see patterns** in them. For example, for a uniformly distributed variable, the PDF and CDF look like this:


![image.png](attachment:image.png)

- Figure 13 - PDF and CDF for a Uniformly Distributed Variable


While the fact that the variable is uniformly distributed is clear from the PDF, the CDF does not offer any such quick insights.

 



- Next, you learnt about a very famous probability density function
    — the **normal distribution**. 
    - You saw that it is **symmetric** and its **mean, median and mode** lie at the **centre**.

![image.png](attachment:image.png)

- Figure 14 - Normal Distribution


### You also learnt the 1-2-3 rule, which states that there is a -

1. 68% probability of the variable lying within 1 standard deviation of the mean

2. 95% probability of the variable lying within 2 standard deviations of the mean

3. 99.7% probability of the variable lying within 3 standard deviations of the mean

![image.png](attachment:image.png)

- Figure 15 - 1-2-3 Rule for the Normal Distribution


However, you saw that, to find the probability, you do not need to know the value of the mean or the standard deviation — it is enough to know the number of standard deviations away from the mean your random variable is. That is given by:

Z=(X−μ)/σ


 
This is called the Z score, or the standard normal variable.

 

Finally, you learnt how to find the cumulative probability for various values of Z, using the Z table. For example, you found the cumulative probability for Z = 0.68 by using the Z table.

![image.png](attachment:image.png)

- Figure 16 - Z Table

The intersection of row “0.6” and column “0.08”, i.e. 0.7517, is your answer.


The normal distribution finds use in many statistical analyses. In our module, it finds use in the next session, central limit theorem, which is then useful for understanding the next module on hypothesis testing.





# Practice Questions
These questions are NOT graded.
 

Let’s say you **work as an analyst** at a **pharma company** which manufactures an antipyretic drug (tablet form) with **paracetamol** as the active ingredient. The amount of paracetamol specified by the drug regulatory authorities is **500 mg** with a **permissible error of 10%**. Anything below 450 mg would be a quality issue for your company since the drug will be ineffective, while above 550 mg would be a serious regulatory issue.

#### Q1: Cumulative Probability Distributions
The regulatory authority selects a random tablet from Batch Z2. Based on previous knowledge, you know that Batch Z2 has a mean paracetamol level of 510 mg, and its standard deviation is 20 mg.

What is the probability that the tablet that has been selected by the authority has a paracetamol level below 550 mg?


**Options**:
- 48%

- 95%

- 98%

- 93%

**Ans**:
- 98%
    - Let’s define X as the amount of paracetamol in the selected tablet. Now, X is a normally distributed random variable, with mean μ = 510 mg and standard deviation σ = 20 mg. Now, you have to find the probability of X being less than 550, i.e. P(X<550). Converting this to Z, you get P(X<550) = P(Z<{550-510}/20) = P(Z<2) = 0.977, or 97.7%.

- P(X<550) ? 
- Mean(µ) = 510
- Standard deviation(σ) = 20
- Z = (X - µ) / σ
    - = (550 - 510)/20 == 2
    - = P(Z = 2) == .97725 ~= 98%

--- 

#### Q2: Continuous Probability Distributions
Now, the company’s QC (Quality Control) department comes and selects a tablet at random from Batch Z2. It is interested in finding if the paracetamol level is above 450 mg or not.

What is the probability that the tablet selected by QC has a paracetamol level above 450 mg?

**Options**:
- 99.87%

- 99.74%

- 49.87%

- 99.61%

**Ans**:

- 99.87%
    - Let’s define X as the amount of paracetamol in the selected tablet. Now, X is a normally distributed random variable, with mean μ = 510 mg and standard deviation σ = 20 mg. Now, you have to find the probability of X being more than 450, i.e. P(X>450). Converting this to Z, you get P(X>450) = P(Z>{450-510}/20) = P(Z>-3) = 1 - P(Z<-3) = 0.9987, or 99.87%.


- P(X > 450)?
- Mean(µ) = 510
- Standard deviation(σ) = 20
- Z = (X - µ) / σ
    - = (450 - 510)/20 == -3
    - = P(Z = -3) == .00135
    - = 1 - .00135 ~= 0.99865 ~= 99.87%


#### Q3: Continuous Probability Distributions
Now, let’s say that QC decides to sample one more tablet. This time, it selects a tablet from Batch Y4. Based on previous knowledge, you know that Batch Y4 has a mean paracetamol level of 505 mg, and its standard deviation is 25 mg. This time, QC wants to check both the upper limit and the lower limit for the paracetamol level.

What is the probability that the tablet selected by QC has a paracetamol level between 450 mg and 550 mg?

**Options**:
- 91%

- 93%

- 95%

- 97%

**Ans**:

- 95%
    -Let’s define X as the amount of paracetamol in the selected tablet. Now, X is a normally distributed random variable, with mean μ = 505 mg and standard deviation σ = 25 mg. Now, you have to find the probability of X being more than 450 and less than 550, i.e. P(450 < X < 550). Converting this to Z, you get P(450 < X < 550) = P({450-505}/25 < Z < {550-505}/25) = P(-2.2 < Z < 1.8) = P(Z < 1.8) - P(Z < -2.2) = 0.9641 - 0.0139 = 0.9502, or 95%.
    

- P(450 < X < 550)?
- Mean(µ) = 505
- Standard deviation(σ) = 25
- Z = (X - µ) / σ
    - = (450 - 505) / 25 == -2.2
    - = (550 - 505) / 25 == 1.8
    - = P(Z = 1.8) - P(Z = -2.2)
    - = .96407 - .0139 == .95017 ~= 95.01


# Graded Questions

![image.png](attachment:image.png)

- Figure 17 - Uniformly Distributed Variable (p = 0.1)

The graph you see above represents the PDF of a uniformly distributed random variable X. As you can see, the probability density is equal for all the possible values of X (-5 to +5).





#### Q: Uniform Distribution
What is the probability of the random variable X lying between -1.5 and +2.5, i.e. P(-1.5<X<2.5)?



**Options**:
- 0.1

- 0.4

- 4.0

- 0.6

**Ans**:
- 0.4
    - The probability of the variable lying between -1.5 and 2.5 would be equal to the area under the PDF, between X = -1.5 and X = 2.5. This would be equal to the area of a rectangle, with breadth 0.1 and length 2.5 - (-1.5) = 4. Multiplying the two, you get the area of the rectangle, which is equal to 0.1*4 = 0.4.


The **normal distribution**, aka the **Gaussian distribution**, was discovered by **Carl Friedrich Gauss** in 1809. Gauss was trying to create a probability distribution for **astronomical errors**. Astronomical errors are the errors that were made by astronomers while observing phenomena such as distances in space.


For example, Gauss found that an astronomer trying to estimate the distance between Earth and Uranus always makes an error. This **error** is **normally distributed**, with **µ = 0 km** and **σ = 1,000 km**.

#### Q: Astronomical Error
Based on the information above, what is the probability of the astronomer overestimating the distance by 2,330 km or more?

**Options**:
- 1%


- 2%


- 1.5%


- 0.5%

**Ans**:
- 1%
    - Let’s define X as the astronomical error, which is normally distributed with mean 0 km and standard deviation 1,000 km. Now, you have to find the probability that X > 2330, i.e. P(X>2330). Converting this to Z, it becomes P(Z>2.33). Since P(Z<=2.33) + P(Z>2.33) = 1, P(Z>2.33) = 1 - P(Z<2.33) = 1 - 0.9901 = 0.0099 or 0.99%, which is approximately 1%.


- P(X > 2330)?
- Mean(µ) = 0
- Standard deviation(σ) = 1000
- Z = (X - µ) / σ
    - = (2330 - 0)/1000 == 2.33
    - = P(Z > 2.33) 
    - = 1 - P(Z > 2.33) = 1 - .99010 ~= .0099 == .99% ~= 1%


#### Q: Astronomical Error
Hence, what is the probability that the astronomer under- or over-estimates the distance by less than 500 km?



**Options**:
- 30.85%


- 69.15%


- 38.30%


- 48.25%

**Ans**:
- 38.30%
    - Let’s define X as the astronomical error, which is normally distributed with mean 0 km and standard deviation 1,000 km. Now, you have to find the probability that -500 < X < 500, i.e. P(-500 < X < 500). Converting this to Z, it becomes P(-0.5 < Z < 0.5) = P(Z < 0.5) - P(Z < -0.5) = 0.6915 - 0.3085 = 0.3830, or 38.30%.
    

- P(-500 < X < 500)?
- Mean(µ) = 0
- Standard deviation(σ) = 1000
- Z = (X - µ) / σ
    - = (500 - 0)/1000 == 0.5
    - = P(Z < 0.5) == .69146 
    - = (-500 - 0)/1000 == -0.5
    - = P(Z < -0.5) == .30854
    - = P(Z < 0.5) - P(Z < -0.5)
    - = .69146 -  .30854
    - = .38292 == 38.30%


Thank you 🙏🏻