# Notebook for understanding and exploring Design of Experiments, DoE 

DoE is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The design refers to how to conduct experiments to gain maximum amount of information. 

In cases where it might be expensive to perform some experiment, or time comsuming, designing experiments properly could save alot. 

## Different Designs 

We can employ different designs to our experiments to gain information. We can calculate how different variables will affect the response. We can also calculate the *power* of the design, which is the probability of the design revieling something. Power usually should be above 80%. 


<div>
<img src="./images/doe_cycle.png" width="500">
</div>

### OVAT, One variable at a time 

OVAT is a method where we essentially employ grid search to the problem. This is good in that it will end up with the best parameters, however need to go through all possible combinations. This is time cumbersome and requires alot of experiments fast. 

Experimental design aims to do better than OVAT, finding the optimum in much lesser time. 

Some pros with using Experimental Design 
* We find the main effect of the underlying data 
* We find the interactions between variables and the response 
* The number of experiments is unknown in OVAT but known per step in experimental design 
* With a new objective we need to redo the OVAT method, but can re-use the experimental design results 






### Factorial Design 

A factorial experiment is an experiment whose design consists of two or more factors, each with discrete possible values or "levels", and whose experimental units take on all the possible compbinations of these levels. Such designs allows the investigator to study the effect of each factor to the response, as well as interactions between the variables' effect on the response. 

One major pro with factorial design over OVAT is that if a variable is much more important to the response than other variables, this is not captured in OVAT. Thus we need factorial.

One dissadvantage with factorial design is that the sample size needed grow exponentially with the number of factors. This is in general true for **full factorial designs** (as explained here), but are mitigated in other cases as **fractional factorial designs**. 

The simplest factorial experiments contains two levels for each of two factors. We often depict this as a cube (or hypercube) with the results of an experiment where the two factors at their respective level on the vertice. 

Given 2 factors with 2 levelse, we call this a 2x2 experiment, or $2^2$. In general:  $\text{number levels}^{\text{number factors}}$
So if we have 3 factors with 2 levels each we have 8 experiments to capture all the in-between relationships. 
<div>
<img src="./images/factorial_design.png" width="200">
</div>



| A/B | 1        | 2        | 3        |
|-----------------|-----------------|-----------------|-----------------|
|1| $\mu_{11}$ | $\mu_{12}$ | $\mu_{13}$ |
|2| $\mu_{21}$ | $\mu_{22}$ | $\mu_{23}$ |

#### Cell mean 
The expected response to a given treatment combination (an experiment with different fature levels) is called a cell mean. 

#### Contrast 
A contrast in cell means is a linear combination of cell means in which the coefficients sum to 0. Imagine having a table describing the facotrial experiment results. Each result given as one $\mu_{ij}$. One contrast might be 
$$\mu_{11} - \mu_{12}$$

This contrast compares the mean responses of the treatment combinations, 11 and 12, coefficients being 1 and -1, summin to zero. 

As an example, take the contrast 
$$\mu_{11}+\mu_{12}+\mu_{13}-\mu_{21}-\mu_{22}-\mu_{23}$$

This compares the response from the two levels of A. The main effect of A is said to be absent if this expression equals 0. 

#### Interaction 
Interaction is the lack of additivitiy between factors. The contrasts
$$\mu_{11}-\mu_{12}-\mu_{21}+\mu_{22}$$ 
$$\mu_{11}-\mu_{13}-\mu_{21}+\mu_{23}$$
belong to the AxB interaction. Interactions is absent if the two expressions are 0. 

#### Effect in a 2-level factorial design 

$$Effect(\delta y) = \frac{\sum{y_+}}{n_+} - \frac{\sum{y_-}}{n_-}$$

As we can see the information is carried by the coefficients of the contrasts, we often summarise these as column vectors. 
For the example table above, we might get out 

| cell | A  | B   | B   | AxB | AxB |
|------|----|-----|-----|-----|-----|
| 11   | 1  | 1   | 0   | 1   |1    |
| 12   | 1  | -1  | 1   | -1  |0    |
| 13   | 1  | 0   | -1  | 0   |-1   |
| 21   | -1 | 1   | 0   | -1  |-1   |
| 22   | -1 | -1  | 1   | 1   |0    |
| 23   | -1 | 0   | -1  | 0   |1    |

The columns of such a table are called the contrast vectors, their components add to zero. Each effect is determined by both the pattern of components in its columns and the number of columns. 




### Fractional Factorial Design

We carefully pick a subset of the experiments from a full factorial design. The subset is chosen to exploit the sparsity-of-effects principle. In short, it takes advantage of the fact that many experiments in the full factorial design is redundant. 

The sparsity-of-effects principle is stating that higher order interactions between main effects are typically negligible. Making the main effects and two-factor interactions the most significant. 

We express the fractional design as 
$$\text{levels}^{\text{number of factors} - \text{size of the fraction of full factorial used}}$$
often denoted 
$$l^{k - p}$$

The p denotes the size of the full factorial used. We call p the number of generators, that is; relationships that determine the intentionally confounded effects that reduce the number of runs needed. So each generator reduce the number of runs required by half.
Say we have a 5 factor experiment with 2 generators, giving $2^{5-2}$ experiments. So we can imagine carrying out experiments involving A B C then find the remaining D and E by using the interactions of ABC. For example, D = A*B and E=A*C. These are generators.

In factorial design one rarely encounter number of levels above 2. 


### Optimal experimental design / Optimization designs 

In DoE optimal designs are designs optimal with respect to some statistical criterion. This allows parameters to be estimated without bias and with minimum variance. In contrast, will a non optimal design require more runs to estimate the same parameters. 

Some statistical criterions

* A-optimality -> Average, seeking to minimize the trace of the inverse of the information matrix. This minimize the average variance of the regression coefficients. 

* C-optimality -> minimizes the variance of a best linear unbiased estimator
