datasets
is a python package that enables users to quickly build complex Tensorflow datasets. The tool offers flexibility to import out-of-memory datasets and apply image augmentation functions in real time.
datasets
API borrows heavily from ImageDataGenerator
, making it nearly a drop-in replacement. However, TFImageDataset
class is approximately 5-fold faster than the ImageDataGenerator
.
The latest stable version can be installed directly from github:
git clone https://github.com/beringresearch/datasets/
cd datasets
python3 install --editable .
Check out example notebook to get started with the package.