## Mack Assumptions (Venter)

1. $E[q(w,d+1)|\text{data to }w+d] = f(d)c(w,d)$
    - States that expected value of the incremental losses to emerge in the next period is proportional to the total losees emerged to date.
    
2. Accident years are independent.
    - Can't use CL method if CY effects exist.
    
3. $Var[q(w,d+1)|\text{data to }w+d] = a[d,c(w,d)]$
    - Variance of the next increment is a function of the age and the cumulative losses to date.

## Testable Implications of Assumptions

1. Significance of factor f(d)
2. Superiority of factor assumption to alternative emergence patterns such as :
    - Linear with constant: $E[q(w,d+1)|\text{data to }w+d] = f(d)c(w,d)+g(d)$
    - Factor times parameter: $E[q(w,d+1)|\text{data to }w+d] = f(d)h(w)$ (BF method)
    - Including CY effect: $E[q(w,d+1)|\text{data to }w+d] = f(d)h(w)g(w+d)$
3. Linearity of model: look at residuals as function of c(w,d)
4. Stability of factor: look at residuals as a function of time
5. No correlation among columns
6. No high or low diagonals (checking for CY effects)

### Testing Implication 1

- To be considered significant, the $\lvert f(d)\rvert > 2*std(f(d))$ 
    - For incremental we are testing if f(d) $\ne$ 0. 
    - for cumulative we are testing if f(d) $\ne$ 1.
- Often distribution of f(d) has positive skew $\rightarrow$ lognormal distribution

### Testing Implication 2 - Superiority of Emergence Pattern 

- To compare different emergence patterns, we use the following statistics:
    - Adjusted SSE = $\frac{SSE}{(n-p)^2}$ where n is # of predicted points.
        - Note: We assume first column is given, hence excluded from n count.
    - AIC $\approx$ SSE$* e^{\frac{2p}{n}}$ (2p-n)
    - BIC $\approx$ SSE$* n^{\frac{p}{n}}$ (n-p-n)
<br><br>    
- <b>Linear with constant</b>
    - If constant is significant and factor is not, the additive CL method is more appropriate.

- <b>Factor times parameter / BF method</b>
    - Need (#AYs -1) paramaters for AY and (#development periods - 1) parameters for development period. (2m - 2 total params. for m AYs)
        - First AY is assumed to be at ult. and First dev. period is assumed to be given.
        - CL in comparison only needs m-1 paramters.
    - CapeCod is a special case of BF since it assumes h(w) = h.
        - Similar to CL, also has m-1 paramters.

### Test 1
- Graph q(w,d+1) or c(w,d+1) agaisnt c(w,d)
    - Refer to the graphs in Brosius paper.


### Test 2
- To test different emergence patterns, we first need to re-calculate values in the triangle for each emergence pattern.
    - For BF method, we use an iterative procedure to minimize the sum of squared residuals
        - $h(w) = \frac{\sum_d \frac{q(w,d)}{f(d)}f(d)^2}{\sum_d f(d)^2} = \frac{\sum_d q(w,d)f(d)}{\sum_d f(d)^2}$<br><br>
        - $f(d) = \frac{\sum_w \frac{q(w,d)}{h(w)}h(w)^2}{\sum_w h(w)^2} = \frac{\sum_w q(w,d)h(w)}{\sum_w h(w)^2}$<br><br>
        
        

- For the BF method, we can also use weighted L-S if variance of the residuals are not constant over the triangle.
    - if p = q = 1 then,
        - $h(w)^2 = \frac{\sum_d \frac{q(w,d)^2}{f(d)}}{\sum_d f(d)}$<br><br>
        - $f(d)^2 = \frac{\sum_w \frac{q(w,d)^2}{h(w)}}{\sum_w h(w)}$<br><br>
        

- For CC model, basically the same as BF model, except we have single h value estimated as:<br><br>
$$h = \frac{\sum_{w,d} q(w,d)f(d)}{\sum_{w,d} f(d)^2}$$
<br><br>
- The additive CL and CC method will always produce the same adjusted SSE as they are the same method.

## Testing Implication 3 - Test of Linearity

- We create a scatter plot of raw residuals against $C(w,d)$.
- Can also use Mack's test by plotting $C(w,d+1)$ against $C(w,d)$.

## Testing Implication 4 - Test of Stability

#### Test 1
- Plot the incremental residuals against time (i.e. AY) - You basically go down the column

#### Test 2
- Look at moving average to see if the <b>fixed levels</b> are changing over time.

#### Test 3
- Use state-space model which compares the degree of instability of the observations around the current mean to the degree of instability in the mean itself over time.
    - Helps to determine whether we should use all data or a weighted average that favors more recent years.

## Testing Implication 5 - Correlation of Development Factors

- Venter assumes that his correlation test is a test for AY independence.
- Calculate sample corr. for all pairs of columns and then count how many of them are significant.

<center><img src="images/Venter-corr-1.jpg"></center>
<center><img src="images/Venter-corr-2.jpg"></center>

- No. of testable pairs of columns: m = $n-3 \choose 2$
    - We have n-3 here because T cannot be calculated for column pairs with less than 3 development factors.
<br><br>
- No. of pairs that display significant correlation at 10% level is: X $\sim$ binomial(m,.1)
    - $E[x] =  np = .1m$
    - $\sigma_x = \sqrt{np(1-p)} = \sqrt{m*.1*.9} = .3\sqrt{m}$
    - We can use this to figure out max number of columns pairs that can have signficant correlation.
    

## Testable Implication 6 - CY Effect

- <b>Additive diagonal effect</b>: Uses regression to test for diagonal effects.

<center><img src="images/Venter-CY.jpg"></center>

<center><img src="images/Venter-CY-2.jpg"></center>