# Hypotheses Testing in Econometrics

# Properties of OLS Estimator
Ordinary Least Squares (OLS) estimators must meet specific properties for valid and unbiased estimation. These properties ensure the regression model performs well in empirical applications.

### 1. **Linearity**
   - The OLS estimator is a linear combination of the observed dependent variable ($ Y $).
   - The regression model takes the form:
     $$
     Y = \beta_0 + \beta_1 X + \epsilon
     $$

### 2. **Full Rank**
   - The independent variables ($ X $) must be linearly independent.
   - No multicollinearity among predictors.

### 3. **Regression Model**
   - The relationship between $ Y $ and $ X $ is correctly specified, and no relevant variables are omitted.

### 4. **Spherical Errors**
   - Errors ($ \epsilon $) must have constant variance (homoscedasticity) and no autocorrelation.
   - Mathematically:
     $$
     Var(\epsilon) = \sigma^2 I
     $$

### 5. **Non-Stochastic Regressors**
   - The regressors ($ X $) are fixed or independent of the error term ($ \epsilon $).

### 6. **Normal Errors**
   - Errors ($ \epsilon $) are normally distributed:
     $$
     \epsilon \sim N(0, \sigma^2)
     $$
   - Required for valid hypothesis testing using $ t $- and $ F $-statistics.

---

# Gauss-Markov Theorem

The **Gauss-Markov Theorem** is a fundamental result in statistics and econometrics. It states that under certain assumptions, the **Ordinary Least Squares (OLS)** estimator is the **Best Linear Unbiased Estimator (BLUE)** for the coefficients in a linear regression model.



## Key Terms in the Gauss-Markov Theorem
- **Best**: The estimator has the smallest variance among all linear unbiased estimators.
- **Linear**: The estimator is a linear function of the observed data.
- **Unbiased**: The expected value of the estimator equals the true value of the parameter:
  $$
  E(\hat{\beta}) = \beta
  $$
- **Estimator**: A rule or formula to calculate estimates for unknown parameters.

---

## Assumptions for the Gauss-Markov Theorem
For the OLS estimator to be BLUE, the following assumptions must hold:

1. **Linearity of the Model**:
   - The regression model is:
     $$
     Y = X\beta + \epsilon
     $$
     Where:
     - $ Y $: Dependent variable (vector of size $ n \times 1 $).
     - $ X $: Independent variables (matrix of size $ n \times k $).
     - $ \beta $: Coefficient vector ($ k \times 1 $).
     - $ \epsilon $: Error term ($ n \times 1 $).

2. **Full Rank of $ X $**:
   - The independent variables are linearly independent, ensuring the matrix $ X'X $ is invertible.

3. **Zero Mean of Errors**:
   - The error term has an expected value of zero:
     $$
     E(\epsilon) = 0
     $$

4. **Homoscedasticity (Equal Variance of Errors)**:
   - The variance of the errors is constant:
     $$
     Var(\epsilon) = \sigma^2 I
     $$
     Where $ I $ is the identity matrix.

5. **No Autocorrelation**:
   - The errors are uncorrelated:
     $$
     Cov(\epsilon_i, \epsilon_j) = 0 \quad \text{for } i \neq j
     $$

6. **Non-Stochastic Regressors**:
   - The independent variables $ X $ are fixed in repeated samples (or independent of the errors).

---

## Statement of the Gauss-Markov Theorem
Under these assumptions:
- The OLS estimator:
  $$
  \hat{\beta} = (X'X)^{-1} X'Y
  $$
  is **BLUE**:
  - **Best**: It has the smallest variance among all linear unbiased estimators.
  - **Linear**: It is a linear function of $ Y $.
  - **Unbiased**: Its expected value equals the true value of the parameter ($ E(\hat{\beta}) = \beta $).

---

## Intuition Behind the Gauss-Markov Theorem
1. **Unbiasedness**:
   - The OLS estimator accurately reflects the true relationship between the variables on average.
   
2. **Efficiency**:
   - Among all linear and unbiased estimators, OLS has the smallest variance, meaning it produces the most precise estimates.

3. **Practical Impact**:
   - The OLS estimator is reliable and robust when the assumptions hold, making it a cornerstone of regression analysis.

---

## What Happens If Assumptions Are Violated?
1. **Homoscedasticity Violated**:
   - If the error variance is not constant (heteroscedasticity), OLS is still unbiased but no longer efficient.
   - Alternative methods like **Generalized Least Squares (GLS)** are preferred.

2. **Autocorrelation**:
   - If errors are correlated, OLS is unbiased but not efficient. Time-series models or corrections like **Newey-West standard errors** are used.

3. **Model Misspecification**:
   - If the model is misspecified or important variables are omitted, the OLS estimator becomes biased.

---

## Conclusion
The Gauss-Markov Theorem is the foundation of linear regression analysis, justifying the use of OLS estimators under the specified assumptions. It ensures that OLS estimators are the best choice for estimating linear relationships, provided the assumptions hold. When the assumptions are violated, other methods or adjustments must be employed.
