## The GLM Model
























































































- GLM model uses incremental losses and log-link function

$$E[q(w,d)] = m_{w,d}$$
$$Var(q(w,d)) = \phi m_{w,d}^z$$

where, $m_{w,d} = e^{c + \alpha_w + \beta_2 + \cdots + \beta_d}$ = $e^{\alpha_w + \beta_2 + \cdots + \beta_d}$

- Alpha adjusts for level and beta adjusts for development trends

- In specification with constant (c), $\alpha_1$ = $\beta_1$ = 0
- In the other specification, $\alpha_1$ > 0 and $\beta_1$ = 0 $\leftarrow$ used in paper

- Power of z used to specify the error distribution:
    - Normal: z = 0
    - ODP: z = 1
    - Gamma: z = 2
    - Inverse Gaussian: z = 3

## The ODP Model

- An <b>important property</b> of the ODP model is that the future fitted incremental losses will equal the incremental claims from the standard CL model. (If we start with the diagonal and divide by LDFs to fill out the upper triangle)

#### Three consequences of this property

1. Simple link ratio algorithm can be used in place of the more complicated GLM algorithm, while still maintaining the GLM framework.
2. Use of age-to-age factors acts as a bridge to the deterministic framework. This allows the model to be more easily explained.
3. In general, the log-link function does not work for negative incremental claims. Using link ratios remedies this problem.

#### Advantage of ODP Bootstrap

- We assume that the residuals are <b>independent and identically distributed</b>, but do NOT require them to be normally distributed.

#### Degrees of Freedom Adjustment Factor

<center>$f^{DoF} = \sqrt{\frac{N}{N-P}}$</center>

where, N = # of data cells in triangle and p = 2(# of AYs) -1

- The <b>scaled Pearson residual</b> can be calculated as: $r_{w,d}^S = r_{w,d} * f^{DoF}$

- When using standardized residuals, the scale parameter should still be calculated using the unscaled Pearson residuals. However, the scale parameter can be <b>approximated</b> as:

$$\phi^H = \frac{\sum(r_{w,d}^H)^2}{N}$$<br>

Here's the equation for the regular scale parameter: $\phi = \frac{\sum r_{w,d}^2}{N-P}$

#### Process Variance

- To include process variance in the future incremental losses, we assume that each future incremental loss, $m_{w,d}$, is the mean of a gamma distribution.

## ODP  Bootstrap Algorithm Mechanics

<table><tr><td><img src='images/Bootstrap_1.JPG'></td><td><img src='images/Bootstrap_2.JPG'></td></tr></table>
<center><img src='images/Bootstrap_3.JPG'></center>

<center><img src='images/Bootstrap_4.JPG'></center>

<center><img src='images/Bootstrap_5.JPG'></center>

<center><img src='images/Bootstrap_6.JPG'></center>
<center><img src='images/Bootstrap_7.JPG'></center>

<center><img src='images/Bootstrap_8.JPG'></center>
<center><img src='images/Bootstrap_9.JPG'></center>

<center><img src='images/Bootstrap_10.JPG'></center>

## Bootstrapping the Incurred Loss Triangle

#### Approach 1

- Run both paid and incurred data models. Then use the payment pattern from each iteration of the paid data model to convert the ultimate values from each corresponding incurred model iteration to develop paid losses by AY.

<b>Advantage:</b> It allows us to use the case reserves to help predict the ultimate losses, while still focusing on the payment stream for measuring risk.

<b>Possible improvement:</b> Inclusion of correlation between paid and incurred models (possibly in the residual sampling process). For example, if we want to compare iterations showing long payment streams with iterations showing high incurred results, we must consider correlation.

#### Approach 2

- Apply the ODP bootstrap to the Munich Chain-Ladder (MCL) model. The MCL uses the inherent relationship/correlation between paid and incurred losses to predict ultimate  losses. When paid losses are low relative to incurred losses, then future paid loss development tends to be higher than average and vice versa.

##### Advantages

1. It does not require us to model paid losses twice.
2. It explicitly measures the correlation between paid and incurred losses.

## Bootstrapping the BF and CC Models

- To reduce the variability in the last few AYs (due to the application of MORE age-to-age factors), we can extrapolate using the BF or CC methods.
- These methods can be made stochastic by assuming a distribution for the a priori loss ratios.

## GLM vs ODP Bootstrap Model

- ODP bootstrap requires a parameter for each AY and development factor, while the GLM bootstrap requires fewer parameters.
- GLM bootstrap does not use age-to-age factors, instead it relies on calculated parameter values to calculate future loss development.

#### Drawbacks of GLM Bootstrap

- The GLM must be solved for each iteration of the bootstrap model, which slows down the simulation.
- The model is no longer easily explanable to others using age-to-age factors.


#### Benefits of GLM Bootstrap

- Fewer parameters help avoid over-parameterization of the model.
- Allows us to include CY trend parameter(s) in the model.
- Gives us the ability to model data shapes other than triangles.
    - Can model trapezoid-shaped data
- Allows us to match the model parameters to the statistical features found in the data, and extrapolate those features.
    - Can get a tail factor by assuming last beta parameter continues past the end of the triangle.

## Practical Issue Mentioned by the Author

1. Negative incremental values
2. Negative values during simulation
3. Negative values and extreme outcomes
4. Non-Zero sum of residuals
5. Using L-year weighted average
6. Missing values
7. Outliers
8. Heteroscedasticity
9. Heteroecthesious data
10. Exposure adjustment
11. Tail factors
12. Fitting a distribution to ODP bootstrap residuals

## Practical Issues - Negative incremental losses

- Log-link function can't handle negative incremental values.
- If the column-sum of incremental values is positive for GLM Bootstrap, then we can do the following:
<br><br>
<center> ln[q(w,d)] for q(w,d) > 0,</center>
<center> 0 for q(w,d) = 0,</center>
<center> -ln[abs|q(w,d)|] for q(w,d) < 0 </center>

- If the column-sum of incremental values is negative, then we must do the following:

<center><img src='images/Neg_Incr_1.JPG'></center>
<center><img src='images/Neg_Incr_2.JPG'></center>
<center><img src='images/Neg_Incr_3.JPG'></center>

<center><img src='images/Neg_Incr_4.JPG'></center>

## Practical Issues - Negative Values During Simulation

- When each future incremental value is sampled from a gamma distribution to add process variance, the parameters must be positive.
- To simulate negative values when using a gamma distribution, we use one of the following two options:

<center>$m_{w,d} = -Gamma[abs(m_{w,d}),\phi \cdot abs(m_{w,d})]$</center>
<center>$m_{w,d} = Gamma[abs(m_{w,d}),\phi \cdot abs(m_{w,d})] + 2m_{w,d}$</center>

- The second option is preferred since the resulting distribution will be skewed to the right (as it should be for a gamma distribution)

<table><tr><td><img src='images/Gamma_1.JPG'></td><td><img src='images/Gamma_2.JPG'></td></tr></table>

## Practical Issues - Negative Values and Extreme Outcomes

- Negative incremental values in the first few development columns can cause extreme outcomes.
    - Can lead to large LDFs if the starting column has column-sum close to 0.
    
#### Ways of dealing with the issue
1. <b>Identify extreme iterations and remove them</b> - Need to be careful and remove unreasonable iterations only.
2. <b>Recalibrate the model</b> - Identify the source of the negative incremental losses and remove if necessary.
    - If caused by sparse data in first row then, remove first row and reparameterize.
    - If caused by sal/sub then model gross and sal/sub separately and then combine assuming 100% correlation.

3. <b>Limit incremental losses to zero</b> - Can be done to the original triangle, sampled triangle or the projected future incremental losses.

## Practical Issues - Non-Zero Sum of Residuals

- Only adjust if we think Non-Zero sum of residuals is not reasonable. 
    - Non-Zero residuals could just be characteristic of the data set.
- Simply add a constant to all the residual such that the sum of the shifted residuals is zero.


## Practical Issues - Using L-year Weighted Average

#### GLM Bootstrap
- Just use the relevant data from the triangle to parameterize the GLM model.

#### ODP Bootstrap
- Use L-year development factors for calculating incremental losses.
- Exclude first few diagonals when calculating residuals.
    - However, we still need to sample residuals for these diagonals ( need them for cum. loss calcs)
- Using L-year average factors to project losses to ultimate.    

## Practical Issues - Missing Values

#### ODP Bootstrap

- Estimate the missing values using surrounding values.
- Exclude the missing value when calculating LDFs
- However, still need to sample residuals for the missing values (need them for calculating cumulative values)
- Must exclude missing value to calculate LDFs for projecting losses to ult.
    - We want to maintain the uncertainty related to the missing value.
- If missing values on the latest diagonal then we have 2 options:
    1. Estimate the missing value.
    2. Use the value in the second latest diagonal to construct the fitted triangle.

#### GLM Bootstrap

- If desired, we can estimate the missing value using surrounding values.
- Else, Parameterize the GLM model using the remaining values.

## Practical Issues - Outliers

- If the extreme value(s) are not representative of the variability of the dataset, then their impact could be removed from the model. (see solutions below)
    
#### ODP Bootstrap

- Treat the outlier as missing value.
- Exclude the outlier when calculating LDFs and residuals.
    - Can be done in 3 ways:
        1. Exclude in the numerator
        2. Exclude in the denominator
        3. Exclude in the numerator and denominator
        


- However, still need to sample residual for the outlier (need them for calculating cumulative values)
- Include the outlier cell to calculate LDFs for projecting losses to ult. (different from missing value)
    - This way we include some non-extreme variability in the sample triangle projection.
    
#### GLM Bootstrap
- Treat the outliers as missing data and parameterize the model with the remaining data.

### What to do if there are a significant number of Outliers?

1. Might indicate that the model is a poor fit to the data.
2. <b>GLM bootstrap</b> - New parameters could be chosen or the error distribution could be changed (i.e z parameter)
3. <b>ODP bootstrap</b> - Since ODP does not require a distribution for the residuals, it could be the case that the residuals are highly skewed. If skewness is real, then the outliers should be included. L-year wtd. average could also be used to provide a better fit.

## Practical Issues - Heteroscedasticity

- The regular ODP bootstrap method works because we assume all residuals are independent and identically distributed, if this is not the case then we need to make adjustments to the procedure.

- If the standardized residuals have different variances then we have <b>heteroscedasticity</b>.
    - Need to mindful of credibility when making assessment. (especially in the tail)
    
#### Stratified Sampling
- Group development periods with homogeneous variances.
- Sample with replacement from the residuals within each group separately.
- <b>Advantage</b> - It is simple to understand and implement.
- <b>Disadvantage</b> - Some groups may have few residuals in them, which limits the amount of variability in the possible outcomes.

#### Using Variance Parameter
- Calculate hetero-adjustment factor for each group, i:

$$h_i = \frac{stdev(\cup^{Total} r_{w,d}^H)}{stdev(\cup_i r_{w,d}^H)}$$

<center><img src='images/Hetero_1.JPG'></center>

- Basically adjust the residuals so the whole triangle has similar variance/std.
- After sampling, we adjust the residual to the variance/std. of the new cohort. This means the $h_i$ below may be different from the $h_i$ used initially.
<center><img src='images/Hetero_2.JPG'></center>

#### Using Scale Parameter

- Calculate the scale parameter for each group. 

<center><img src='images/Hetero_3.JPG'></center>


- After this, similar to the variance parameter case, adjust variance based on the new cohort.
- Note, the residuals being used in this case are the <b>unscaled pearson residuals</b>.
- Also note that hetero adjustment factors are considered as a additional parameter (subtract 1).
    - p = p + #(hetero groups) -1

## Practical Issues - Heteroecthesious Data

- This is when you have an incomplete or uneven exposures at interim evaluation dates.
    - Partial first development period - eval period is at 6, 18, 30, etc. mos.
    - Partial last calendar period - latest eval has 6, 18, 30, etc. mos of dev.
    
#### Partial First Development Period Data

- Not a problem for parameterizing an ODP bootstrap model since Pearson residuals use the square root of the fitted value to make them all "exposure independent"
    - Units cancel out when we calculate residuals
- In <b>deterministic analysis</b>, for most recent AY, we simply reduce the projected future payments by half to remove the exposures from 06/30 to 12/31.

- During <b>ODP bootstrap simulation process</b>, for most recent AY, again reduce the projected future values by half, then simulate the process variance as usual.
    - Alternatively, we can reduce the future values by half AFTER simulating the process variance.
    
#### Partial Last Calendar Period Data

- <b>Deterministic analysis</b> - exclude the last diagonal when calculating age-to-age factors, interpolate those factors for the exposures in the last diagonal and use them for projecting future losses. Then reduce the future values for the latest AY by half.

- <b>Parameterizing ODP bootstrap model</b> - Annualize the exposures in the last diagonal to make it consistent with the rest of the triangle. Then calculate fitted triangle and residuals.

- <b> ODP bootstrap simulation process</b> - After getting sample triangle, calculate LDFs. Then adjust latest diagonal back to 6 mos. Then multiply cumulative values by interpolated age-to-age factors to project future losses. Then reduce the future values for the latest AY by half.

## Practical Issues -  Exposure Adjustment

- Has to be done when exposures change dramatically over the years.

#### ODP Bootstrap Model
- Divide losses by earned exposures to get pure premiums.
- At the end multiply the result with corresponding earned exposures.

#### GLM Bootstrap Model
- Fit on pure premiums as well.
- However, we assume AYs with higher exposures have lower variance and vice  versa.
- This could allow fewer AY parameters for the GLM bootstrap model.

## Practical Issues -  Tail Factors

#### ODP Bootstrap Model
- Can add a tail factor to each sample triangle.
- Could extrapolate age-to-ult factor over 3 years using a decay model with decay of 50%. 60-ult: 1.05, 72-ult: 1.025(1 + .05x.5), 84-ult: 1.0125(1 + .025x.5)
- To make it a stochastic process, could assume a distribution for tail factors.

#### GLM Bootstrap Model
- The last beta parameter essentially gives us the tail factor - we assume the last beta parameter applies incrementally until it's effect on the future incremental losses is negligible.
- Same logic if we also have CY parameter.

## Practical Issues -  Fitting Distribution to ODP Bootstrap Residuals

- If we believe that extreme observations are NOT captured well in the loss triangle, then we can parameterize a distribution for the residuals (such as normal) and resample using the distribution.

## Diagnostics Mentioned By the Author

1. Residual graphs
2. Normality test
3. Outliers
4. Parameter adjustment
5. Estimate unpaid model results
6. Mean and std. of incremental values

## Diagnostics - Residual Graphs

- Test the assumption that reiduals are independent and identically distributed.
- Types of residual graphs
    - Development Period
    - Accident Period
    - Calendar Period
    - Incremental Losses
- The residuals should have a random pattern.

## Diagnostics - Normality Test

- Compare ODP model residuals against normal dist to compare parameter sets and assess skewness.

<center><img src='images/Normality_Plot.JPG'></center>

#### Calculate Test Values

- <b>P-value</b> - If the residuals are normally distributed, the p-value should be greater than 5%.
- $R^2$ - The value for normally distributed residuals should be close to 1.
- $R^2$ and <b>p-value</b> are criticized for not penalizing the model for the number of parameters.

- <b>AIC</b> and <b>BIC</b> penalize for parameters used. We want the values to be as small as possible.

$$AIC = 2p + n \left[ln\left(\frac{2 \pi RSS}{n}\right) + 1 \right]$$<br>
$$BIC = n*ln\left(\frac{RSS}{n}\right) + p*ln(n)$$

## Diagnostics - Outliers

- Can identified using box-whisker plot
- Only adjust/remove outliers that are unreasonable

## Diagnostics - Parameter Adjustment

- For the <b>GLM Model</b> - we keep adding parameters to the "basic" model as long as residuals are not randomly distributed around 0 (For various cuts of data).
    - The implied pattern for GLM model should be a smoothed version of the CL/ODP model.
<br><br>    
<center><img src='images/ODPvsOptimal_GLM.JPG'></center>

## Diagnostics - Estimated Unpaid Model Results

- The standard error should increase when moving from the oldest years to the most recent years
    - This is because standard error follows the magnitude of the results
- The total standard error should be larger than any individual error.
- The coefficient of variation (C.V.) should generally decrease when moving from the oldest to the most recent years.
    - C.V. for all years combined should be less than any inidividual AY.
    - Oldest AYs have the least parameters and the greatest uncertainty
    - For Recent AYs, random variations in the remaining payments tend offset one another, thereby reducing variability. 


- However, CV may rise in the most recent AYs due to:
    - With the increasing # of params., the param. uncertainty may overpower the process uncertainty, causing an increase in variability.
    - The model may simply be overestimating the variability in the most recent years.
        - Might have to use BF or CC method in this case for latest AYs. 
- Total CV should be smaller than any individual year's CV.

## Diagnostics - Mean and Std. of Incremental Values

- The mean and std. of future values should help narrow down issues with the CV for the AY.

<center><img src='images/CV.JPG'></center>

## Estimated CY Cash Flow Results

- For CY, standard errors decrease and coefficient of variation increase as we move from older to more recent years.
    - You are moving 1 diagonal to the right when moving to newest CY.
    - As we move further out in the future, the CY unpaid claim estimates will decrease, which also lead to a decrease in size of standard errors.
    - This is opposite of the AY results

## Using Multiple Models

#### Run Models With the Same Random Variables
- Random residuals are picked in the same order.
- The incremental values for each model are weighted together.
- This method of combining does cause correlation in the model results since each model is run using the same set of random residuals.



#### Run Model with Independent Random Variables
- Random residuals are picked randomly for each model.
- Weights are used to select a model by randomly sampling iterations from each model.
    - For 1000 iterations and weights of 25% and 75%, we sample 250 iterations from the first model and 750 iterations from the second model.  
<br><br>
- <b>Note:</b> weighting may produce results by AY, such as negative IBNR, if negative IBNR is not reasonable then the distributions can be adjusted in following manner: 
    - Shape and width of dist. are appropriate, then fixed amt can be added to produce positive IBNR.
    - Shape or width is not appropriate, then a factor can be multiplied to produce positive IBNR.

## Smooth Unpaid Losses

- We can also fit distributions to unpaid losses to generate smooth results.
- The smoothed results can be used to:
    - Assess the quality of the fit
    - Parameterize a dynamic financial analysis (DFA) model
    - Estimate extreme values
    - Estimate TVaR

## Correlation

- We can aggregate the results for each LOB up to the portfolio level.
- Since business segments tend to be correlated, we need to consider this correlation when calculating portfolio results.

#### Location Mapping
- Pick the residuals in the same order for all the LOBs.
- <b>Advantage</b> - Can be easily implemented in a spreadsheet and does not require us to estimate a correlation matrix.
- <b>Disadvantages</b> 
    - Requires all LOBs to have residual triangles of same size with no missing values or outliers. 
    - Correlation of original residuals is used in the model, and can't use another correlation assumption for stress testing the aggregate results. 

#### Re-sorting

- The residuals are re-sorted until the rank correlation between LOBs matches the desired correlation.
    - P-value, correlation coefficient for LOBs and judgement can be used to select correlation matrix assumption.    
- <b>Benefits</b>
    - Different correlation assumptions may be employed.
        - Might have beneficial impacts on aggregate distribution
    - Can be used with triangles of different shapes and sizes.