# A Simplified Kuno Huisman Problem

We examine a simplified version of the problem that was presented in the first half of the recent guest lecture by Kuno Huisman. We will see examine a simplified version of that problem and seen how this is a special case of the MDP setup we discussed in class. 

### Overview of the problem

LG is considering whether to invest a new product variety, namely a new type of LCD monitor. We abstract away from the problem of choosing the optimal screen size and focus on the problem of whether to invest and if so, then when. There is a finite number of periods $T$ in which investment can take place, after which investment is worthless (because superior technologies have been developed and therefore, there will be no demand for our product). The revenue from investing decreases in time, which can be due to entry of competitors (but we do not model the competitors for now). There is uncertainty regarding the investment cost. If the decision maker chooses not to invest, she moves to the next period. If she invests, she does not make further decisions and received an expected discounted payoff. 

In MDP notation:

- **State space**: $ \mathcal{X} = \{ x^1, x^2 \} $. Corresponding to low and high investment costs, respecitvely.

- ***Binomial* transition probabilities**: $ \Pi $, where an element is  $\pi_{ij}=Pr(x_{t+1}=j \ \big| \ x_{t}=i )$.


- **Action space**: $ \mathcal{A} = \{ 0,1 \} $. Corresponding to investing or not investing.

- **Markov decision rule**: $ \alpha_t : \mathcal{X} \rightarrow \mathcal{A} $. Complete contingent plan: an action given any state for every time period.

- **Flow utility**

$$
u_t(0, x_t) = 0
$$

$$
u_t(1, x_t) = t^{-p} - x_t
$$

$\ \ \ \ \ \ $where $p$ is a parameter governing how fast the the revenue from investing decreases







In [None]:
rho = 0.9;
tCheck=10 ;
x = [0.2; 0.5] ; % state space (low and high investment cost)

UInvest=nan(tCheck,2); UNot=UInvest; action = UInvest ;
v=nan(tCheck+1,2);

v(tCheck+1,1) = 0;
v(tCheck+1,2) = 0;

p1 = 0.8; % Probability of transitioning from low cost state to low cost state
p2 = 0.2; % Probability of transitioning from high cost state to low cost state

pi = [ p1, 1-p1; 
       p2, 1-p2] ;

power = 0.8
for t=tCheck:-1:1
    UInvest(t, :) = (1/t)^power - x(:);
    UNot(t, :) = rho*( pi(:,:) * v(t+1,:)' );
    action(t,:) = UInvest(t,:) > UNot(t,:);
    v(t,:) = max([UInvest(t,:); UNot(t,:)]);
end

display(action)
display(UInvest)
display(UNot)