A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
-
Updated
Oct 9, 2024 - Python
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
[CVPR 2021: Oral] In this work, we show that high frequency Fourier spectrum decay discrepancies are not inherent characteristics for existing CNN-based generative models.
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
High-level API for tar-based dataset
This repo is the official released code of FoPro (AAAI-2023)
Scripts to collect data from CARLA and save them as Webdataset
Add a description, image, and links to the webdataset topic page so that developers can more easily learn about it.
To associate your repository with the webdataset topic, visit your repo's landing page and select "manage topics."