# Chapter 6 — Two-Level Factorial Designs ($2^k$ Factorials)

> Based on Douglas C. Montgomery, *Design and Analysis of Experiments*, and lecture slides.

---

## 1. Introduction

A **two-level factorial design ($2^k$)** is a special case of the general factorial design:

- $k$ factors, each at **two levels** (low = “−”, high = “+”)  
- Response assumed **approximately linear** between chosen levels  
- Factors can be **quantitative** or **qualitative**  
- **Extremely common** in industrial experiments  
- Acts as the **building block** for more advanced designs  
  (fractional factorials, response-surface, etc.)

---

## 2. The Simplest Case: The $2^2$ Design

- Two factors (A and B), each at two levels → $2^2 = 4$ treatments:
  $(1),\; a,\; b,\; ab$  
- “−” and “+” denote low and high levels  
- Geometrically → corners of a **square**

**Example:**  
Chemical process  
A = reactant concentration B = catalyst amount $y$ = recovery  

---

## 3. Analysis Procedure

1. **Estimate factor effects**
2. **Formulate the model**
   - with replication → fit full model  
   - without replication → use **normal probability plot**
3. **Perform ANOVA**
4. **Refine model**
5. **Check residuals**
6. **Interpret results**

---

## 4. Estimation of Factor Effects

For $n$ replicates per run:

$$
\text{Effect of A} =
\frac{1}{2n}\left[(\bar y_a + \bar y_{ab}) -
(\bar y_{(1)} + \bar y_b)\right]
$$

$$
\text{Effect of B} =
\frac{1}{2n}\left[(\bar y_b + \bar y_{ab}) -
(\bar y_{(1)} + \bar y_a)\right]
$$

$$
\text{Interaction AB} =
\frac{1}{2n}\left[(\bar y_{ab} + \bar y_{(1)}) -
(\bar y_a + \bar y_b)\right]
$$

**Interpretation**
- **Magnitude → ** strength of effect  
- **Sign → ** direction (+ increases, − decreases)

---

## 5. Sum of Squares via Contrasts

A **contrast** isolates the influence of a factor:

$$
SS = \frac{(\text{Contrast})^2}{4n}
$$

### Example (n = 3)

| Factor | Contrast | SS | % Contribution |
|:--|--:|--:|--:|
| A | 208.33 | 208.33 | 53 % |
| B | 75.00 | 75.00 | 19 % |
| AB | 8.33 | 8.33 | 2 % |

---

## 6. ANOVA Table Example

| Source | SS | df | MS | F | p |
|--|--|--|--|--|--|
| Model | 291.67 | 3 | 97.22 | 24.82 | 0.0002 |
| A | 208.33 | 1 | 208.33 | 53.19 | < 0.0001 |
| B | 75.00 | 1 | 75.00 | 19.15 | 0.0024 |
| AB | 8.33 | 1 | 8.33 | 2.13 | 0.1828 |
| Error | 31.33 | 8 | 3.92 |  |  |
| Total | 323.00 | 11 |  |  |  |

The model is significant ($p < 0.05$); A and B are important, AB is not.

---

## 7. Regression Model (Coded Variables)

$$
\hat y = 27.5 + 4.17A - 2.5B
$$

In actual variables:

$$
\text{Conversion} = 18.33 + 0.8333(\text{Conc}) - 5(\text{Catalyst})
$$

---

## 8. Conversion Between Coded and Natural Variables

$$
X_A = 
\frac{\text{Conc} -
\frac{\text{Conc}_{\text{high}} + \text{Conc}_{\text{low}}}{2}}
{\frac{\text{Conc}_{\text{high}} - \text{Conc}_{\text{low}}}{2}}
$$

Example: Low = 15, High = 25  
→ Conc = 15 ⇒ $X_A=-1$; Conc = 25 ⇒ $X_A=+1$; Conc = 20 ⇒ $X_A=0$.

---

## 9. The $2^3$ Factorial Design

Three factors → eight runs (cube geometry)

$$
A = \frac{1}{4n}\!\left[(a+ab+ac+abc)
-\big((1)+b+c+bc\big)\right]
$$

$$
AB = \frac{1}{4n}\!\left[(abc+ab+c+bc)
-\big(b+ac+a+(1)\big)\right]
$$

Each effect = average difference between high/low levels of the factor.

---

## 10. Yates Algorithm (Shortcut Method)

Systematic procedure to obtain all effects and SS.

**Steps**

1. Arrange runs in **standard order**
2. Add responses in pairs → cumulative totals  
3. Subtract second column from first → differences  
4. Divide by $2^{k-1}$ to get effects  
5. Compute $SS_i = (\text{effect}_i)^2 × 2^{k-2} n$

**Advantages**
- Fast manual calculation  
- Basis of Minitab and Design-Expert

---

## 11. Plasma Etching Experiment (Example)

| Factor | Contrast | Effect | SS | % |
|--|--:|--:|--:|--:|
| A | −813 | 101.6 | 41 310.6 | 7.8 |
| B | 59 | 7.4 | 217.6 | 0 |
| C | 2449 | 306.1 | 374 850.1 | 70.5 |
| AB | −199 | 24.9 | 2 475.1 | 0.5 |
| AC | −1229 | 153.6 | 94 402.6 | 17.8 |
| BC | −17 | 2.1 | 18.1 | 0 |
| ABC | 45 | 5.6 | 126.6 | 0 |

**Main Finding:** RF Power (C) dominant (~70 %); AC interaction moderate.

---

## 12. Orthogonality Properties

- Every column (except I) has equal numbers of “+” and “−”  
- Sum of product of any two columns = 0 → independent effects  
- $A × B = AB$, $B × C = BC$, etc.  
- Ensures **orthogonal design**

---

## 13. Diagnostics and Model Refinement

- Drop nonsignificant terms  
- Check residual plots for:
  - constant variance  
  - normality  
  - independence  
- Visualize via **cube**, **contour**, and **response-surface** plots  

---

## 14. Unreplicated $2^k$ Designs

- One observation per corner of the cube  
- Also called “single replicate” $2^k$ design  
- Used when resources are limited  

**Issue:** No degrees of freedom for error ⇒ cannot estimate pure error.

**Remedies**

- Pool high-order interactions as error  
- **Normal probability plot** (Daniel, 1959)  
- **Lenth’s method** (approximate t-test)

---

## 15. Resin Plant Experiment ($2^4$)

Factors: A = Temperature, B = Pressure, C = Mole ratio, D = Stirring rate  
Response: Filtration rate

Significant effects: A, C, D main effects; AC and AD interactions.

$$
\hat y =
70.06 + 10.81A + 4.94C + 7.31D
- 9.06AC + 8.31AD
$$

$R^2 = 0.97$ → excellent fit.

---

## 16. The Drilling Experiment (Example 6-3)

A = Load, B = Flow, C = Speed, D = Mud Type  
Response = Advance rate

Large effects: B, C, D. Residuals show heteroscedasticity → **transform** response.

---

## 17. Box–Cox Power Transformation

Purpose:
- Stabilize variance  
- Induce normality  
- Simplify model  

**Transformation:**
$$
y^{(\lambda)} =
\begin{cases}
\dfrac{y^\lambda-1}{\lambda}, & \lambda\ne0\\[6pt]
\ln y, & \lambda=0
\end{cases}
$$

Box–Cox plot gives estimate of $\lambda$ and 95 % CI.  
If 1 ∈ CI → no transformation needed.

After log transform:
$$
\ln(y) = 1.60 + 0.58B + 0.29C + 0.16D
$$
$R^2 = 0.98$

---

## 18. Center Points in $2^k$ Designs

Adding center points:
- Gives **error estimate**
- Tests for **curvature**
- Distinguishes first- vs second-order models

**Hypotheses**
$$
H_0:\;\text{no curvature}
\qquad
H_1:\;\text{curvature exists}
$$

$$
F = 
\frac{SS_{\text{curv}}/df_{\text{curv}}}
{SS_{\text{PE}}/df_{\text{PE}}}
$$

If significant → augment design with **axial runs** → **central composite design (CCD)**

---

## 19. Example 6-6 (Center Points)

Two factors + center points → Curvature not significant → first-order model adequate.

---

## 20. Practical Guidelines for Center Points

- Use current operating condition as center point  
- 3–6 center runs are enough  
- Check for time trends and abnormal runs  
- Center points not meaningful for purely qualitative factors  

---

## 21. Key Formulas Summary

| Concept | Formula |
|--|--|
| Effect estimate | $\displaystyle \text{Effect}=\frac{1}{2^{k-1}n}\sum(\text{sign}\times y)$ |
| Sum of Squares | $\displaystyle SS=\frac{(\text{Contrast})^2}{2^k n}$ |
| $F$-value | $\displaystyle F=\frac{MS_{\text{effect}}}{MS_E}$ |
| Model (coded) | $\displaystyle \hat y=b_0+\sum b_i x_i+\sum b_{ij}x_i x_j$ |
| Box–Cox | see above |
| Curvature test | $\displaystyle F=\frac{SS_{\text{curv}}/df_{\text{curv}}}{SS_{\text{PE}}/df_{\text{PE}}}$ |

---

## 22. Key Takeaways

- $2^k$ designs give maximum information with minimum runs  
- **Main effects + interactions** easily interpreted  
- **Orthogonality** → independent effect estimates  
- **Unreplicated** designs require graphical/approximate testing  
- **Center points** → curvature detection  
- **Transformations** → variance stabilization & normality  
- Foundation for fractional factorial and response-surface methods  

---

**End of Chapter 6 Summary**
