# The McCall Model

## Setup

<a id='index-0'></a>
An unemployed agent receives in each period a job offer at wage $ w_t $.

In this lecture, we adopt the following simple environment:

- The offer sequence $ \{w_t\}_{t \geq 0} $ is IID, with $ q(w) $ being the probability of observing wage $ w $ in finite set $ \mathbb{W} $.  
- The agent observes $ w_t $ at the start of $ t $.  
- The agent knows that $ \{w_t\} $ is IID with common distribution $ q $ and can use this when computing expectations.  



At time $ t $, our agent has two choices:

1. Accept the offer and work permanently at constant wage $ w_t $.  
1. Reject the offer, receive unemployment compensation $ c $, and reconsider next period.  


The agent is infinitely lived and aims to maximize the expected discounted
sum of earnings

$$
\mathbb{E} \sum_{t=0}^{\infty} \beta^t y_t 
$$

### The Value Function

In order to optimally trade-off current and future rewards, we need to think about two things:

1. the current payoffs we get from different choices  
1. the different states that those choices will lead to in next period  

Denote $ v^*(w) $ the value of the objective function when an agent in this situation makes optimal decisions now and at all future points in time.

This function $ v^* $ must satisfy the
recursion


<a id='equation-odu-pv'></a>
$$
v^*(w)
= \max \left\{
        \frac{w}{1 - \beta}, \, c + \beta
        \sum_{w' \in \mathbb{W}} v^*(w') q (w') \tag{1}
    \right\} 
$$

for every possible $ w $  in $ \mathbb{W} $.


- the first term inside the max operation is the lifetime payoff from accepting current offer, since  


$$
\frac{w}{1 - \beta} = w + \beta w + \beta^2 w + \cdots
$$

- the second term inside the max operation is the **continuation value**, which is the lifetime payoff from rejecting the current offer and then behaving optimally in all subsequent periods  




### The Optimal Policy


The **policy** is, in general, a map from states to actions.

Given *any* $ w $, we can read off the corresponding best choice (accept or
reject) by picking the max on the right-hand side of [(1)](#equation-odu-pv).

Thus, we have a map from $ \mathbb W $ to $ \{0, 1\} $, with 1 meaning accept and 0 meaning reject.

We can write the policy as follows

$$
\sigma(w) := \mathbf{1}
    \left\{
        \frac{w}{1 - \beta} \geq c + \beta \sum_{w' \in \mathbb W}
        v^*(w') q (w')
    \right\}
$$

Here $ \mathbf{1}\{ P \} = 1 $ if statement $ P $ is true and equals 0 otherwise.

We can also write this as

$$
\sigma(w) := \mathbf{1} \{ w \geq \bar w \}
$$

where


<a id='equation-reswage'></a>
$$
\bar w := (1 - \beta) \left\{ c + \beta \sum_{w'} v^*(w') q (w') \right\} \tag{2}
$$

Here $ \bar w $ (called the *reservation wage*) is a constant depending on
$ \beta, c $ and the wage distribution.

The agent should accept if and only if the current wage offer exceeds the reservation wage.


## Computation

We need to compute the value function at each possible state $ w \in \mathbb W $.

Let’s set

$$
\mathbb W := \{w_1, \ldots, w_n  \}
    \quad \text{and} \quad
    v^*(i) := v^*(w_i)
$$

The value function is then represented by the vector
$ v^* = (v^*(i))_{i=1}^n $.

In view of [(1)](#equation-odu-pv), this vector satisfies the nonlinear system of equations


<a id='equation-odu-pv2'></a>
$$
v^*(i)
= \max \left\{
        \frac{w(i)}{1 - \beta}, \, c + \beta \sum_{1 \leq j \leq n}
            v^*(j) q (j)
    \right\}
\quad
\text{for } i = 1, \ldots, n \tag{3}
$$