In [None]:
'''
 * Copyright (c) 2010 Radhamadhab Dalai
 *
 * Permission is hereby granted, free of charge, to any person obtaining a copy
 * of this software and associated documentation files (the "Software"), to deal
 * in the Software without restriction, including without limitation the rights
 * to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 * copies of the Software, and to permit persons to whom the Software is
 * furnished to do so, subject to the following conditions:
 *
 * The above copyright notice and this permission notice shall be included in
 * all copies or substantial portions of the Software.
 *
 * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 * AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 * LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 * OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
 * THE SOFTWARE.
'''

$$
\textbf{1.14 Credibility Estimates}

In actuarial studies, a credibility estimate is one which can be expressed as a weighted average of the form 

\[
C = (1 - k) A + kB,
\]

where:
\begin{itemize}
    \item \( A \) is the subjective estimate (or the collateral data estimate),
    \item \( B \) is the objective estimate (or the direct data estimate),
    \item \( k \) is the credibility factor, a number that is between 0 and 1 (inclusive) and represents the weight assigned to the objective estimate.
\end{itemize}

A high value of \( k \) implies \( C \approx B \), representing a situation where the objective estimate is assigned ‘high credibility’. A primary aim of credibility theory is to determine an appropriate value or formula for \( k \), as is done, for example, in the theory of the Bühlmann model (Bühlmann, 1967). Many Bayesian models lead to a point estimate which can be expressed as an intuitively appealing credibility estimate.
$$


Credibility estimates

In actuarial studies, a credibility estimate is one which can be expressed as a weighted average of the form
$$
C = (1 - k) A + kB
$$
where:

    -  A  is the subjective estimate (or the collateral data estimate),
    -  B  is the objective estimate (or the direct data estimate),
    -  k  is the credibility factor, a number that is between 0 and 1 (inclusive) and represents the weight assigned to the objective estimate.
 
 # Exercise 1.16: Credibility Estimation in the Binomial-Beta Model

Consider the binomial-beta model given by:

$$
(y | \theta) \sim \text{Binomial}(n, \theta)
$$

$$
\theta \sim \text{Beta}(\alpha, \beta)
$$

## Posterior Distribution

We previously established that the posterior distribution of \(\theta\) given \(y\) is:

$$
(\theta | y) \sim \text{Beta}(\alpha + y, \beta + n - y)
$$

Thus, the posterior mean of \(\theta\) is:

$$
\hat{\theta} = E(\theta | y) = \frac{\alpha + y}{(\alpha + y) + (\beta + n - y)} = \frac{\alpha + y}{\alpha + \beta + n}
$$

## Prior Mean and Maximum Likelihood Estimate (MLE)

The prior mean of \(\theta\) is:

$$
E[\theta] = \frac{\alpha}{\alpha + \beta}
$$

The maximum likelihood estimate (MLE) of \(\theta\) is:

$$
\hat{\theta}_{MLE} = \frac{y}{n}
$$

## Expressing the Credibility Estimate

We can rewrite the posterior mean as:

$$
\hat{\theta} = \frac{\alpha}{\alpha + \beta + n} + \frac{n}{\alpha + \beta + n} \cdot \frac{y}{n}
$$

This leads us to the expression:

$$
\hat{\theta} = (1 - k) A + k B
$$

where:

- $A = E[\theta] = \frac{\alpha}{\alpha + \beta}$ (prior mean)
- $B = \frac{y}{n}$ (MLE)
- $k = \frac{n}{n + \alpha + \beta}$ (credibility factor)

## Discussion

The posterior mean $\hat{\theta}$ serves as a credibility estimate, expressed as a weighted average of the prior mean $A$ and the MLE $B$. The weight assigned to the MLE is determined by the credibility factor $k$, which reflects the amount of data available.

As \(n\) increases, the credibility factor $k$ approaches 1, indicating that the influence of the prior diminishes with increasing data. This aligns with our intuition: with sufficient data, the estimation should rely more on the observed outcomes rather than the prior beliefs.

## Visual Illustration
### Exercise 6: Credibility Estimation in the Binomial-Beta Model

Consider the binomial-beta model given by:

$$
(y | \theta) \sim \text{Binomial}(n, \theta)
$$

$$
\theta \sim \text{Beta}(\alpha, \beta)
$$

## Posterior Distribution

We previously established that the posterior distribution of $\theta$ given $y$ is:

$$
(\theta | y) \sim \text{Beta}(\alpha + y, \beta + n - y)
$$

Thus, the posterior mean of $\theta$ is:

$$
\hat{\theta} = E(\theta | y) = \frac{\alpha + y}{(\alpha + y) + (\beta + n - y)} = \frac{\alpha + y}{\alpha + \beta + n}
$$

## Prior Mean and Maximum Likelihood Estimate (MLE)

The prior mean of $\theta$ is:

$$
E[\theta] = \frac{\alpha}{\alpha + \beta}
$$

The maximum likelihood estimate (MLE) of $\theta$ is:

$$
\hat{\theta}_{MLE} = \frac{y}{n}
$$

## Case (a): $n = 5$, $y = 4$, $\alpha = 2$, $\beta = 6$

In this case, we compute the following:

- Prior Mean $A$:

$$
A = \frac{\alpha}{\alpha + \beta} = \frac{2}{2 + 6} = 0.25
$$

- MLE \(B\):

$$
B = \frac{y}{n} = \frac{4}{5} = 0.8
$$

- Posterior Mean:

The posterior parameters are:

$$
\hat{\theta} = \frac{\alpha + y}{\alpha + \beta + n} = \frac{2 + 4}{2 + 6 + 5} = \frac{6}{13} \approx 0.462
$$

- Credibility Factor \(k\):

$$
k = \frac{n}{n + \alpha + \beta} = \frac{5}{5 + 2 + 6} = \frac{5}{13} \approx 0.385
$$

## Case (b): $n = 20$, $y = 16$, $\alpha = 2$, $\beta = 6$

For this case, we compute:

- Prior Mean \(A\):

$$
A = \frac{2}{2 + 6} = 0.25
$$

- MLE \(B\):

$$
B = \frac{y}{n} = \frac{16}{20} = 0.8
$$

- Posterior Mean:

The posterior parameters are:

$$
\hat{\theta} = \frac{\alpha + y}{\alpha + \beta + n} = \frac{2 + 16}{2 + 6 + 20} = \frac{18}{28} \approx 0.643
$$

- Credibility Factor $k$:

$$
k = \frac{n}{n + \alpha + \beta} = \frac{20}{20 + 2 + 6} = \frac{20}{28} \approx 0.714
$$

## Comparison of Cases

Both cases have the same prior mean and MLE:

- **Prior Mean:** $A = 0.25$
- **MLE:** $B = 0.8$

However, due to the larger sample size $n$ in case (b), the credibility factor is larger:

- **Credibility Factor:** $k \approx 0.714$ (case b) vs $k \approx 0.385$ (case a)

This leads to a posterior mean closer to the MLE in case (b):

- **Posterior Mean:** $\hat{\theta} \approx 0.643$ (case b) vs $\hat{\theta} \approx 0.462$ (case a)

## Note on Likelihood Functions

Each likelihood function in Figure 1.9 has been normalized so that the area underneath it is exactly 1. This means that in each case (a) and (b), the likelihood function \(L(\theta)\) as shown is identical to the posterior density implied by the standard uniform prior, i.e. under:

$$
f_U(0, 1)(\theta) = f_{\text{Beta}}(1, 1)(\theta)
$$

Thus, we have:

$$
L(\theta) = f_{\text{Beta}}(1 + y, 1 + n - y)(\theta)
$$

Figure 9 illustrates the concept of credibility estimation by showing the relevant densities, likelihoods, and estimates for varying data cases.
![image.png](attachment:image.png)
