Just want to know if it makes sense to apply the tech in https://github.com/rlcode/reinforcement-learning/blob/master/2-cartpole/3-reinforce/cartpole_reinforce.py#L45 to a3c implementations, for cartpole and breakout? Thanks.