At the end of each production cycle (e.g. seasonal) a candy production line must decide
whether to keep a machinery again or replace it with a new one. A machinery at cycle t has
a corresponding efficiency state
- Write down the Bellman optimality equation of the value function.
- Find the replacement policy that maximizes the expected long term cumulative profits using Value Iteration to solve the problem (Policy Iteration and Linear Programming methods can be considered too)
- Test the sensitivity of the optimal policies to different problem parameters