https://github.com/deepmind/dsprites-dataset Disentangle everything, maybe a good example of how to make videos classification