# Examples: M248 Book A

-----

A collection of examples based on the topics covered by **Book A** of **M248: Analysing data**.

## Contents

### 1. General probability distributions

1. Probability mass function
2. [Probability density function](#1.1_Probability_density_functions)

### 2. Standard discrete distributions

1. Bernoulli distribution
2. Binomial distribution
3. Discrete uniform distribution
4. Geometric distribution
5. Poisson distribution

### 3. Standard continuous distributions

1. Continuous uniform distribution
2. Exponential distribution

### 4. Bernoulli and Poisson processes

1. Bernoulli process
2. Poisson process

### 5. Population quantiles

1. Population quantile of a continuous distribution
2. Population quantile of a discrete distribution

-----

In [1]:
import scipy.stats as stats
from scipy.integrate import quad

## 1. General probability distributions

### 1.1 Probability density functions

**Example 1.1.1.** Suppose that a random variable $X$ has range $(0,1)$ and that its p.d.f. is given by

$$f(x) = \frac{3}{4} (x^{2} + 1), \hspace{3mm} x \in (0,1).$$

What is the value of $P(1/4 \leq X \leq 1/2)$?

The probability is the solution to

$$
P(1/4 \leq X \leq 1/2)
  = \int_{1/4}^{1/2} f(x) \> dx
  = \int_{1/4}^{1/2} \frac{3}{4} (x^{2} + 1) \> dx
  = \cdots
$$

In [5]:
# define a new function
def pdf(x: float):
    return 0.75 * ((x ** 2) + 1)

# integrate the function using quad
quad(func=pdf, a=0.25, b=0.5)[0]  # select 0 index

0.21484375000000003

**Example 1.1.2.** Suppose that a random variable $X$ has range $(0,4)$ and that its p.d.f. is given by

$$f(x) = \frac{3}{40} (\sqrt{x} + 2), \hspace{3mm} x \in (0,4).$$

What is the value of $P(X \geq 1)$?

The probability is the solution to

$$
P(1/4 \leq X \leq 1/2)
  = \int_{1}^{4} f(x) \> dx
  = \int_{1}^{4} \frac{3}{40} (\sqrt{x} + 2) \> dx
  = \cdots
$$

In [6]:
# define a new function
def pdf(x: float):
    return (3/40) * ((x ** (0.5)) + 2)

# integrate the function using quad
quad(func=pdf, a=1, b=4)[0]  # select 0 index

0.8

**Example 1.1.3.** Suppose that a continuous variable $X$ can only take values in the range $−1$ to $1$. The following is a function $f(x)$ of $x$:

$$f(x) = 1 - x^{2}.$$

Is $f(x)$ a valid p.d.f. for $x$?

Check the properties of a valid p.d.f.

(1) $\int f(x) = 1$

(2) $f(x) > 0$


In [8]:
# define a new function
def pdf(x: float):
    return 1 - (x ** 2)

# integrate the function using quad
quad(func=pdf, a=-1, b=1)[0]  # select 0 index

1.3333333333333335

Therefore $f(x)$ is not a valid p.d.f.

But does a normalising constant, $k$, exist, such that $k \> f(x)$ would be a valid p.d.f.?

Solve the following for $k$

$$
\begin{aligned}
    1 &= k \int_{-1}^{1} (1 - x^{2}) \> dx \\
      &= k \bigg( \frac{4}{3} \bigg) \\
    k &= \frac{3}{4}.
\end{aligned}
$$

**Example 1.1.4.** Suppose that a continuous variable $X$ can only take values in the range $1$ to $4$. The following is a function $f(x)$ of $x$:

$$f(x) = (x - 7)^{2}.$$

Is $f(x)$ a valid p.d.f. for $x$?

Check the properties of a valid p.d.f.

(1) $\int f(x) = 1$

(2) $f(x) > 0$

In [9]:
# define a new function
def pdf(x: float):
    return (x - 7) ** 2

# integrate the function using quad
quad(func=pdf, a=1, b=4)[0]  # select 0 index

63.0

Therefore $f(x)$ is not a valid p.d.f.

But does a normalising constant, $k$, exist, such that $k \> f(x)$ would be a valid p.d.f.?

Solve the following for $k$

$$
\begin{aligned}
    1 &= k \int_{-1}^{1} (x - 7)^{2} \> dx \\
      &= 63k \\
    k &= \frac{1}{63}.
\end{aligned}
$$

**Example 1.1.5.** Suppose that a continuous variable $X$ can only take values in the range $0$ to $1$. The following is a function $f(x)$ of $x$:

$$f(x) = \frac{3}{4} (x^{2} + 1), \hspace{3mm} x \in (0,1).$$

What is the c.d.f. associated with $X$?

The c.d.f. of $X$, $F(x)$, is

$$
\begin{aligned}
F(x) = \int_{a}^{x} f(y) \> dy = \int_{0}^{x} \frac{3}{4} (x^{2} + 1) \> dy &= \frac{3}{4} \int_{0}^{x} (x^{2} + 1) \> dy \\
  &= \frac{3}{4} \bigg[ \frac{1}{3} y^{3} + y \bigg]_{0}^{x} \\
  &= \frac{3}{4} \bigg\{ \frac{1}{3} x^{3} + x - \bigg( 0 + 0 \bigg) \bigg\} \\
  &= \frac{1}{4} ( x^{3} + 3x ).
\end{aligned}
$$

**Example 1.1.6.** Suppose that a continuous variable $X$ with range $(0,3)$ has c.d.f. given by

$$F(x) = \frac{1}{27} x^{3}, \hspace{3mm} x \in (0,3).$$

What is $P(1 < X < 2)$?

The probability $P(1 < X < 2)$ will be

$$
\begin{aligned}
P(1 < X < 2) = P(X < 2) - P(X < 1) &= F(2) - F(1) \\
  &= \frac{1}{27} (2^{3}) - \frac{1}{27} (1^{3}) \\
  &= \frac{1}{27} (8 - 1) \\
  &= \frac{7}{27}.
\end{aligned}
$$

**Example 1.1.7.** Suppose that a continuous variable $X$ with range $(0,1)$ has c.d.f. given by

$$F(x) = \frac{1}{5} x^{2} (2x+3), \hspace{3mm} x \in (0,1).$$

What are $P(X < 1/2)$ and $P(X \geq 1/2)$?

The $P(X < 1/2) = F(1/2)$, so

$$
F(1/2) = \frac{1}{5} \bigg(\frac{1}{2}\bigg)^{2} \bigg( 2 \bigg(\frac{1}{2}\bigg) + 3 \bigg) = \cdots
$$

In [12]:
def cdf(x: float):
    return (0.2) * (x ** 2) * ((2 * x) + 3)

In [13]:
cdf(x=0.5)

0.2

The $P(X \geq 1/4) = 1 - F(1/4)$, so

$$
1 - F(1/4) = 1 - \bigg\{ \frac{1}{5} \bigg(\frac{1}{4}\bigg)^{2} \bigg( 2 \bigg(\frac{1}{4}\bigg) + 3 \bigg) \bigg\} = \cdots
$$

In [14]:
1 - cdf(x=0.25)

0.95625

### 1.2. Probability mass functions

## 2. Standard discrete distributions

### 1. Bernoulli distribution

### 2. Binomial distribution

### 3. Discrete uniform distribution

### 4. Geometric distribution

### 5. Poisson distribution

## 3. Standard continuous distributions

### 1. Continuous uniform distribution

### 2. Exponential distribution

## 4. Bernoulli and Poisson processes

### 1. Bernoulli process

### 2. Poisson process

## 5. Population quantiles

### 1. Population quantile of a continuous distribution

### 2. Population quantile of a discrete distribution