re-implemented with gluon
#usage
- train from starting
python FlappyBirdQDN.py - train with one checkpoint (with 3w+ timestep) python FlappyBirdQDN.py traind.params
- smart bird comes after about 2w steps
- Block converages quickly than HybridBlock? bird (HybridBlock) show smart after 3w steps