We should rename the datasets.get_* function to datasets.get_* in order to be more sklearn compliant. Also it should be possible to give the rng as input as in sklearn.