Question: Greg's PPO algorithm obtained 90% of the possible reward on the CoinRun environment. CoinRun's maximum reward is half as much as the maximum ProcGen reward of 240. How much reward did Greg's PPO algorithm get? Think carefully first, then make a decision:
Half of much as ProcGen's maximum reward is 240 / 2 = 120 reward. 90% of CoinRun's maximum reward is 120 * .9 = 108 reward. So the answer is 108.