Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Random sampling #1410
The main usecase is to randomly downsample to a cached dataset, and then use that as the basis for faster exploration and expression building.
So far, the Python and SQL backends are implemented; still have to implement the Pandas / Dask backend.