# Introduction: Central Limit Theorem

Welcome to the session on 'Central Limit Theorem'. In the last session, you learnt about probability density functions, specifically the normal and standard normal distributions.

#### In this session
In this session, you will learn what a sample is and why it is so error-prone. You will then learn how to quantify this error made in sampling, using a very popular theorem in statistics, called the central limit theorem.

# Samples
So far, you have conducted analysis for data on 75 people, 3,000 people, and so on. But what if you need to analyse a very large amount of data, e.g. data on 300,000 people? Or what if you need to do this for, say, the entire Indian population?


Let’s say that, for a business application, you want to find out the average number of times people in urban India visited malls last year. That’s 400 million (40 crore) people! You can't possibly go and ask every single person how many times they visited the mall. That’s a costly and time-consuming process. How can you reduce the time and money spent on finding this number?

Upcoming Topics:
1. Sampling terminology
2. Sampling distributions
3. Central limit theorem (CLT)
4. Error estimation using CLT

![clt1.jpg](attachment:clt1.jpg)

#### Q:  Population and Sample Mean
For our sampling activity, what is the sample mean (¯X) equal to? What is the population mean (μ) equal to?


**Options**: 
¯
X
=
36.6
,
μ
=
36.6

¯
X
=
36.6
,
μ
=
u
n
k
n
o
w
n
Recall that the mean commute time for our sample (100 randomly selected employees) = 36.6 minutes. Also, recall that we did not know what the mean of the population (30,000 employees) was. So, the population mean is unknown.

¯
X
=
u
n
k
n
o
w
n
,
μ
=
36.6

**Ans**:
¯
X
=
36.6
,
μ
=
u
n
k
n
o
w
n
Recall that the mean commute time for our sample (100 randomly selected employees) = 36.6 minutes. Also, recall that we did not know what the mean of the population (30,000 employees) was. So, the population mean is unknown.


# 1: Sampling terminology

![clt1.jpg](attachment:clt1.jpg)

![clt2.jpg](attachment:clt2.jpg)

- Sample Mean(ˆX) =36.6
- Population Mean (µ) = ?
- Sample Size (n) = 100
- Population size (N) = 30,000
- Sample Standard Deviation (S) = ?



To reiterate, these are the notations and formulae related to populations and their samples:


![image.png](attachment:image.png)

- Figure 1 - Notations and Formulae Related to Populations and Their Samples


The reason for dividing by n-1 and not n is not covered in this course. If you are still curious as to why that is the case, please refer to this link.
    - https://www.ma.utexas.edu/users/mks/M358KInstr/SampleSDPf.pdf
 

For an upcoming government project, you want to find the average height of the students in Class VIII of a given school. Instead of asking each student, suppose you took a few students as your sample and wrote the data down:


| Roll Number|	Height|
| --- | --- |
|8012|	121.92 cm|
|8045|	133.21 cm|
|8053|	141.34 cm|
|8099|	126.23 cm|
|8125|	175.74 cm|


#### Q1: School Kids Sample
What is the sample size for this sample?

**Options**:
- 5

- 40334

- 698.44

- 10


**Ans**:
- 5
    - In this sample, you have data on 5 students. Hence, the sample size is equal to 5.



#### Q2: School Kids Sample
What is the sample mean (¯X)?

**Options**:
- 133.21

- 139.69

- 146.76

- 141.34

**Ans**:
- 139.69
    - The sample mean (¯X) = (121.92 + 133.21 + 141.34 + 126.23 + 175.74)/5 = 139.69.



#### Q3: School Kids Sample
What will be the sample's standard deviation (S)?

**Options**:
- 22.84

- 23.67

- 19.19

- 21.45

**Ans**:
- 21.45
    - We found in the previous question that the sample mean (¯X) = 139.69. Now, we can find ∑(Xi−¯X)^2, which turns out to be 1841.26. Dividing this by n-1, i.e, 4, we get the value of S^2 as 460.315. The value of S, thus, will be equal to √(460.315) = 21.45



# 2: Sampling Distributions

Now, let's move on to sampling distributions, whose properties, as we said earlier, will help you estimate the population mean from the sample mean.

![clt3.jpg](attachment:clt3.jpg)

So, the sampling distribution, specifically the sampling distribution of the sample means, is a probability density function for the sample means of a population.

 

This distribution has some very interesting properties, which will later help you estimate the sampling error. Let's take a look at these properties.


The sampling distribution’s mean is denoted by 
μ
¯
X
, as it is the mean of the sampling means. Let’s see what it is for this sampling distribution.





# Properties of Sampling Distributions

We’ve been saying that the sampling distribution has some interesting properties that will later help you estimate the error in your samples. Let’s finally see what these properties are.

![clt4.jpg](attachment:clt4.jpg)

Again, to recap, let’s see what the notations and formulae are for populations, samples and sampling distributions.

![image.png](attachment:image.png)

- Figure 3 - Notations and Formulae Related to Sampling Distributions


So, there are two important properties for a sampling distribution of the mean:

1. **Sampling distribution’s mean (μ¯X) = Population mean (μ)**

2. Sampling distribution’s standard deviation **(Standard error)** = **σ/√n**, where σ is the population’s standard deviation and n is the sample size

 



# 3: Central Limit Theorem

Now, you understand the third property for sampling distributions, which talks about their shape. Basically, it says that for n > 30, the sampling distributions become normally distributed. Let's recall all the three properties you have learnt so far for sampling distributions.

So, the central limit theorem says that, for any kind of data, provided a high number of samples has been taken, the following properties hold true:

1. Sampling distribution’s mean (μ¯X) = Population mean (μ)

2. Sampling distribution’s standard deviation (Standard error)= σ/√n

3. For n > 30, the sampling distribution becomes a normal distribution

- This is called **"Central Limit Theorem"**.

# Summary: Central Limit Theorem - Part I

This is a really intense session! Let’s summarise everything that's been taught so far and then you can move on the rest of the session.

 

First, you saw how, instead of finding the mean and standard deviation for the entire population, it is sometimes beneficial to find the mean and standard deviation for only a small representative sample. You may have to do this because of time and/or money constraints.

 

For example, for an office of 30,000 employees, we wanted to find the average commute time. So, instead of asking all employees, we asked only 100 of them and collected the data. The mean = 36.6 minutes and the standard deviation = 10 minutes.

 

However, it would not be fair to infer that the population mean is exactly equal to the sample mean. This is because the flaws of the sampling process must have led to some error. Hence, the sample mean’s value has to be reported with some margin of error.

 

For example, the mean commute time for the office of 30,000 employees would be equal to 36.6 + 3 minutes, 36.6 + 1 minutes, or 36.6 + 10 minutes, i.e. 36.6  minutes + some margin of error.

 

However, at this point in time, you do not exactly know how to find what this margin of error is.

 

Then, we moved on to sampling distributions, which have some properties that would help you find this margin of error.

![image.png](attachment:image.png)

- Figure 4 - Creation of Sampling Distribution


We created a sampling distribution, which was a probability density function for 100 sample means with sample size 5.

 

The sampling distribution, which is basically the distribution of sample means of samples, has some interesting properties which are collectively called the central limit theorem, which states that no matter how the original population is distributed, the sampling distribution will follow these three properties:

1. Sampling distribution’s mean (\mu_{\bar{X}}) = Population mean (\mu)

2. Sampling distribution’s standard deviation (Standard error) = \frac{\sigma}{\sqrt{n}}, where \sigma is the population’s standard deviation and n is the sample size

3. For n > 30, the sampling distribution becomes a normal distribution



To verify these properties, we performed sampling using the data collected for our UpGrad game from the first session on inferential statistics. The values for the sampling distribution thus created (\mu_{\bar{X}} = 2.348, S.E. = 0.4248) were pretty close to the values predicted by theory (\mu_{\bar{X}} = 2.385, S.E. = 0.44).


To summarise, the notations and formulae for populations, samples and sampling distributions are as follows:

![image.png](attachment:image.png)

- Figure 5 - Commonly Used Notations and Formulae



# Practice Questions - Part I

Let’s say that you work for a news agency, which is conducting an exit poll for the MCD (Municipal Corporation of Delhi) elections. You have been tasked with predicting the winner for ward 75N (Ashok Vihar). You asked 100 randomly selected voters from this ward to name the party they had voted for.

 

The data thus collected is as given in the following table.

 

| Contesting Party |	Number of Voters |
| --- | --- |
| BJP	| 58 |
| INC	| 42 |
 

From this sample, you have to estimate the percentage of voters that might have voted for BJP.


So, you **define X as the proportion of people that voted for BJP**. Then, the frequency distribution for X would be:

 

|X|    	Frequency|
|--- | --- |
|1|	58|
|0|	42|
 

Now, you have to find the mean for X, which is equal to (0+0+0+...... 42 times) + (1+1+1+......... 58 times), divided by the total frequency, i.e. 100. So, the **mean** = \frac{(0*42)+(1*58)}{100} = 0.58 or 58%.


Also, you would have to find the standard deviation for this sample of 100 voters. Since the mean is 0.58, the sample’s variance = \frac{((0-0.58)^{2}*42)+((1-0.58)^{2}*58)}{100-1} = 0.2461. So, the **standard deviation** is equal to its square root, i.e. 0.496 or **49.6%**.
 

#### Q:  Voter Sample
Now, let’s say you define Y as the proportion of people that voted for the INC party. Clearly, the mean of Y will be 0.42. What would be the standard deviation of Y?

**Options**:
- 24.61%

- 49.6%

- 50.4%

- 75.39%

**Ans**:
49.6%
✓ Correct
Feedback:
There would be 42 people for whom Y would be equal to 1, and 58 people for whom Y would be equal to 0. Hence, the sample’s variance = 
(
1
−
0.42
)
2
∗
42
+
(
0
−
0.42
)
2
∗
58
100
−
1
=
0.2461
. So, the standard deviation is equal to its square root, i.e. 0.496 or 49.6%


#### Q: Voter Sample
Let’s say you actually took some more samples, each of size 100, and made a sampling distribution for X, the proportion of people that voted for BJP.

The mean of this sampling distribution is 
μ
¯
X
 = 0.50 and the standard error = 0.052.

So, for the whole survey population, i.e. all the voters in ward 75N, the proportion of people that voted for BJP is approximately equal to:

**Options**:
50%

55.8%

44.2%

10.2%

**Ans**:
50%
You have to find the mean for all the people or, in other words, the population mean. Using CLT, you can say that the mean of the sampling distribution = the population mean. As the question says that the sampling distribution’s mean = 0.5, the population mean also = 0.5 or 50%



#### Q: Voter Sample
Similarly, let’s say you made a sampling distribution for Y, the proportion of people that voted for INC.

The mean of this sampling distribution is 
μ
¯
X
 = 0.50 and the standard error is equal to 0.048.

Now, for the whole survey population, i.e. all the voters in ward 75N, the standard deviation of Y is equal to:

**Options**:
50%

4.8%

48%
You have to find the standard deviation for all people or, in other words, the population standard deviation σ. Using CLT, you can say that (σ/√n) = SE. Hence, (σ/√100) = 0.048, which gives σ = .48 or 48%

10.4%

**Ans**:

48%
You have to find the standard deviation for all people or, in other words, the population standard deviation σ. Using CLT, you can say that (σ/√n) = SE. Hence, (σ/√100) = 0.048, which gives σ = .48 or 48%

# Estimating Mean Using CLT

Now that you have gone through the mid-session summary, let’s get back to rest of the session. Earlier, we tried to estimate the mean commute time of 30,000 employees of an office by taking a small sample of 100 employees and finding their mean commute time. This sample’s mean was \bar{X} = 36.6 minutes and its standard deviation was S = 10 minutes.

 

Recall that we also said that the **population mean**, i.e. daily commute time of all 30,000 employees **μ** = 36.6 (**sample mean**) + some **margin of error**.

 

If you remember, you did not learn exactly how to find this margin of error. But now, you can find it by using CLT (central limit theorem). Now that you know CLT, let’s see exactly how you can do this.

 

#### Finding frm the sampling distribution

- Sampling distribution's mean(µx) = Population mean(µ){Unknown}
- Samping distribution's standard deviation (S.E.) 
    - = σ/√n 
    - = S/√n 
    - = 10/√100 == 1
- The sampling distribution is a normal distribution
- P(µ-2 < 36.6 < µ+2) = ? == 95.4%
    - = P(36.6-2 < µ < 36.6 + 2) == 95%


---
- P(µ-2 < 36.6 < µ+2) == P(36.6-2 < µ < 36.6 + 2)
- Probability associated with the claim is called **Confidence level** ==> Here it is **95.4%**.
- Maximum error made in sample mean is called **margin of error** ==> here it is **2 minutes**.
- Final interval of values is called **confidence interval**. ==> Here it is the range **(34.6, 38.6)**.

#### Q: Mean Estimation using CLT
What is the probability that the sample mean that we have got (
¯
X
 = 36.6 mins), is within the range - (μ - 2) mins to (μ + 2) mins? (Report the answer as a number rounded to two digits after the decimal point)


**Ans**: 0.95


### Generalised approach for estimating sample mean:-

- Sample mean(x-bar)
- Sample standard deviation(S)
- Sample size(n)

- Confidence interval (y% confidence level) = (x-bar - (Z*S)/√n, x-bar + (Z*S)/√n)

    - where Z* is the Z score associated with y% confidence level.

![image.png](attachment:image.png)


So, to summarise, let’s say you have a sample with sample size n, mean \bar{X} and standard deviation S. Now, the y% confidence interval (i.e. the confidence interval corresponding to y% confidence level) for \mu would be given by the range:

| | |
| --- | --- |
|Confidence interval = | ![image.png](attachment:image.png) |


where, Z* is the Z-score associated with a y% confidence level. In other words, the population mean and sample mean differ by a margin of error given by ![image-2.png](attachment:image-2.png).



# Some commonly used Z* values are given below:

![image.png](attachment:image.png)

- Figure 6 - Z* Values for Commonly Used Confidence Levels

At this point, it is important to address a very common misconception. Sampling distributions are just a theoretical exercise and you’re not actually expected to make one in real life. If you want to estimate the population mean, you will just take a sample. You will not create an entire sampling distribution.

 



You must be wondering — if this is the case, then why did you study sampling distributions? To understand the reason for this, let's go through the actual process of sampling. Recall that you are doing **sampling because you want to find the population mean**, albeit in the form of an interval. The three steps to follow are as follows:

1. First, take a sample of size n

2. Then, find the mean and standard deviation S of this sample

3. Now, you can say that for y% confidence level, the confidence interval for the population mean µ, is given by ![image.png](attachment:image.png)



However, as you may have seen in the video above, you cannot finish step 3 without CLT. CLT lets you assume that the sample mean would be normally distributed, with mean \mu and standard deviation ![image.png](attachment:image.png). (approx ![image-2.png](attachment:image-2.png) ).

Using this assumption, it becomes possible to find many things such as margin of error, confidence interval, etc.

Hence, you learnt sampling distributions so that you could learn more about CLT and hence be able to make all the assumptions as stated above.

# Confidence Interval - Example


### Estimating mean lead content:

- n = 100
- Sample Mean(X-bar) = 2.3 ppm
- Simple Standard Deviation (S) = 0.3 ppm
   
- Population mean (µ) would be = 2.3 +- ??

- Confidence level = 99%
- Confidence interval = X-bar +- (Z*S)/√n
    - = 2.3 +- 2.576*0.3/10 ==>> Z* value as per 99% i.e. 2.576
    - = (2.223, 2.377)
    
##### So, Population Mean(µ) = (2.223, 2.377) -  with 99% confidence
- as values are under the range so we can say the product is safe to cosume.


# Summary: Central Limit Theorem - Part II

First, you learnt how, using your knowledge of CLT, you can infer the population mean from the sample mean.

 

We estimated the mean commute time of 30,000 employees of an office, by taking a sample of 100 employees, finding their mean commute time, and estimating based on that value. Specifically, you were given a sample with sample mean 
¯
X
 = 36.6 minutes, and sample standard deviation S = 10 minutes.

 

Using CLT, you concluded that the sampling distribution for mean commute time would have:

1. Mean = μ {unknown}
2. Standard error = σ/√n ≈ S/√n =10/√100 = 1
3. Since n(100) > 30, the sampling distribution is a normal distribution
 

Using these properties, you were able to claim that the probability that the population mean μ lies between 34.6 (36.6-2) and 38.6 (36.6+2), is 95.4%.

 

Then, you learnt some terminology related to the claim:

1. The probability associated with the claim is called confidence level (here, it is 95.4%)
2. The maximum error made in sample mean is called margin of error (here, it is 2 minutes)
3. Final interval of values is called confidence interval [here, it is the range (34.6, 38.6)]


Then, you generalised the whole process. Let’s say you have a sample with sample size n, mean ![image-2.png](attachment:image-2.png) and standard deviation S. You learnt that the y% confidence interval (i.e. confidence interval corresponding to y% confidence level) for \\mu will be given by the range:

Confidence interval = ![image.png](attachment:image.png)

 

Where, Z* is the Z-score associated with a y% confidence level.

# Practice Questions - Part II

Let’s say you **work as a data analyst** at a **pharma company** which manufactures an antipyretic drug (tablet form) with **paracetamol** as the active ingredient. The amount of paracetamol specified by the drug regulatory authorities is **500 mg** with an **allowed error of 10%**. Anything **below 450 mg** would be a quality issue for your company since the drug would become ineffective, while anything **above 550 mg** would be a serious regulatory issue.

 

There are 10 identical manufacturing lines in the plant, each of which produces approximately 10,000 tablets per hour.

 

You want to take some samples, measure the amount of paracetamol, and test if the manufacturing process is running successfully. You have the resources and time to take a **sample of 100 tablets** and measure the paracetamol content in each.

 

For the 100 tablets sampled by you, you find that the **mean paracetamol content is 530 mg** and the **standard deviation is 100 mg**.


Now, you want to know what the average content is for all the tablets in the plant. You are thinking of reporting the average as a **confidence interval**, for which you are **95% confident**.

 

With this information, answer the questions given below.

- Sample size (n) = 100
- Sample Mean (X-bar) - 530 mg
- Standard deviation (S) = 100 mg
- Confidende interval = 95% ~= 1.96
- Confidende interval = 90% ~= 1.65


#### Q: Paracetamol Content
What is the MOE (margin of error) for 95% confidence level?

S.E. =  Z*S/√n = (1.96*100)/10

**Ans**: 19.2 mg

If X is the defined as the paracetamol content, then for this sample of X, sample mean X-bar = 530 mg, sample standard deviation S = 100 mg and sample size n = 100. Also, for 95% confidence interval, Z* is 1.96. Now, you know that the margin of error = (Z∗S) /√n = 1.96 ∗100√100 = 19.6


#### Q:  Paracetamol Content
What is the confidence interval for 95% confidence level?

(X + Z*S/√n , x -  Z*S/√n )

((530 + (1.96*100)/10), (530 - (1.96*100)/10) )

**Ans**:
510.4 to 549.6
✓ Correct
Feedback:
As you know, sample mean X-bar = 530 and Z∗S/√n = 19.6. Now, you know the confidence interval is 
(X + Z*S/√n , x -  Z*S/√n )
. Putting in the values, you can calculate the confidence interval as (510.4, 549.6).



#### Q: Paracetamol Content
What is the confidence interval for 90% confidence level?

(X + Z*S/√n , x -  Z*S/√n )

((530 + (1.65*100)/10), (530 - (1.65*100)/10) )

**Ans**:
513.5 to 546.5
As you know, sample mean X-bar = 530, S = 100 and n = 100. Also, for 90% confidence interval, Z* is 1.65. Now, you know the confidence interval is (X-bar + Z*S/√n , x-bar -  Z*S/√n )
. Putting in the values, you can calculate the confidence interval as (513.5, 546.5).



# Graded Questions
Question 1

An exit poll was conducted by a news agency for the MCD (Municipal Corporation of Delhi) elections. You had been tasked with predicting the winner for **ward 75N** (Ashok Vihar). For that purpose, you asked **100 randomly selected voters** from your ward to name the party they had voted for.

 

The data that was thus collected is given in the following table:

 

|Contesting Party|Mean (Percentage of Voters)|Standard Deviation (Percentage of Voters)|
| --- | --- | --- |
|BJP|	58%|	49.6%|
|INC|	42%|	49.6%|
 

Now, instead of taking more samples and forming a sampling distribution, which would be a tedious and expensive task, let's say you decide to use your knowledge of CLT to find the winner.


First, you would have to form a **95% confidence interval** for the **percentage of voters that voted for BJP**. Then, you would form another such **confidence interval for INC**, after which you would be able to compare the two to predict the winner.

- BJP
    - Confidence level = 95% ~= 1.96 
    - Sample Size (n) = 100
    - Sample Mean(X-bar) = .58 = 58%
    - Standard Deviation (S) = 0.496 
- INC
    - Confidence level = 95% ~= 1.96 
    - Sample Size (n) = 100
    - Sample Mean(X-bar) = .42 = 42%
    - Standard Deviation (S) = 0.496  
    

#### Q2: Exit Poll Confidence Interval
So, you define X as the proportion of people that voted for BJP. Now, for a 95% confidence interval, what would be the value of the MOE (margin of error)?

- MOE (margin of error) 
    - = Z* S/√n = (1.96 * 0.496)/10 
    - = 0.0972
    - = 9.72%

**Ans**:
9.72%
The margin of error = Z∗ S /√n. From the data given above, you know that S = 49.6% or 0.496 and n = 100. The value of Z* corresponding to 95% confidence is 1.96. So, the margin of error = 1.96∗ 0.496/√100 = 0.0972, or 9.72%


#### Q2: Exit Poll Confidence Interval
What would the 95% confidence interval be for X, the proportion of people that voted for BJP?



**Ans**:
(48.28%, 67.72%)

Recall that the margin of error = 9.72% and the sample mean 
X-bar = 58%. So, the confidence interval is (58%-9.72%, 58%+9.72%)


#### Q3: Exit Poll Confidence Interval
What would the 95% confidence interval be for Y, the proportion of people that voted for INC?

- MOE (margin of error) 
    - = Z* S/√n = (1.96 * 0.496)/10 
    - = 0.0972
    - = 9.72%

**Ans**:
(32.28%, 51.72%)
✓ Correct
Feedback:
For the variable Y, the sample mean 
¯
X
 = 42%. S, the sample’s standard deviation = 49.6% and the sample size n = 100. So, the 95% confidence interval = 
(X-bar + Z* S/√n , X-bar - Z* S/√n )
. Putting in the values, you get that the confidence interval is (32.28%, 51.72%).


#### Q4: Exit Poll Confidence Interval
Thus, with 95% confidence, you can:

**Options**:
- Claim that BJP will win the election for ward 75N

- Claim that INC will win the election for ward 75N

- Not make any claims regarding the election results for ward 75N


**Ans**:
- Not make any claims regarding the election results for ward 75N
    - The percentage of voters that voted for BJP, falls within the interval (48.28%, 67.72%). The interval for the percentage of voters that voted for INC is (32.28%, 51.72%). Hence, anything is possible. It is possible that BJP gets 67.72% of the votes and wins the election, but it is also possible that INC could get 51.72% of the votes and win the election for ward 75N. Hence, you cannot decide the winner based on this information.


# Coding Questions(Graded)

Follwing are some Python based graded questions for your practice. All the questions below are graded.

Note: Please use the following command in your code after pandas import statement to make sure that your dataframe is printed properly:

pd.set_option('display.max_columns', 500)

Missing this might lead to not all columns being printed and the correct code being rejected.

# Generating Binomial Distribution

Description
Generate a binomial distribution, tested 10 times, given the number of trials(n) and probability(p) of each trial.
The input will contain seed, n and p in the same order.
The output should contain a numpy array with 10 numbers representing the required binomial distribution.
Hint: You can use numpy's random number generator here too. Remember to set the seed before generating numbers to ensure correct output.
Sample Input:
0
10
0.5
Sample Output:

[5 6 5 5 5 6 5 7 8 5]


In [3]:
import numpy as np 
seed=int(input())
n=int(input())
p=float(input())
np.random.seed(seed)
s = np.random.binomial(n, p, 10)
print(s)

0
10
0.5
[5 6 5 5 5 6 5 7 8 5]


# Coloured cards
Description
A card stack contains white and black cards. Two cards are drawn randomly without replacement. The probability of selecting a white and then a black card is x. The probability of selecting a white card in the first draw is y. You have to find the probability of drawing a black card, given that the first card drawn was white.
The input will contain two lines with x and y respectively.  
The output should be displayed as a float(no need to round it off).
- Sample Input:
    - 0.2
    - 0.5
- Sample Output:
    - 0.4


In [11]:
x=float(input())
y=float(input())
#write your code here
result = x/y
print(result)

0.2
0.5
0.4


# Entrance Test

Description
Consider that the scores of an entrance test are normally distributed. 
If the test has a mean of m and a standard deviation of s.
You have set the cut off as 90 marks for the test. What is the probability of a student passing the test?
The input will contain m and s in two lines in the same order. The output has to be rounded off to two decimal places.

Please note that if the answer is 3.6, it is to be reported as 3.6 and not 3.60

Sample Input:
100
10
Sample Output:
0.84

In [8]:
import scipy.stats
m=float(input())#mean
sd=float(input())#standard deviation
print(round(1-scipy.stats.norm(m,sd).cdf(90),2))

100
10
0.84


# Seating Arrangement

Description
N people sit around a circular table. You have to find the probability that two particular people won't be sitting together.

The input will have the number N and the output should have the probability printed as a float type number rounded off to four decimal places.

Hint: This problem follows the logic of circular permutation which you can read about here.

Please note that if the answer is 3.6, it is to be reported as 3.6 and not 3.6000

Sample Input:
15

Sample Output:
0.8571

In [9]:
n=int(input())
import math
print(round(1-math.factorial(n-2)*math.factorial(2)/math.factorial(n-1),4))

#or 

# n=int(input())
# result = round((n - 3)/(n - 1), 4)
# print(result)


15
0.8571


Thank you 🙏🏻