DeepSpeedDataLoader is a wrapper for torch DataLoader and should at a minimum match the semantics. Right now this property is not well tested and discrepancies have already been identified, such as this #167. In my estimation, a fair bit of work is required to achieve this goal, including the following items:
-
Support batching and non-batching modes.
-
Unit tests to validate expected behavior
-
Performance comparison between DeepSpeedDataLoader and torch DataLoader.
-
Some documentation of DeepSpeedDataLoader, including any semantic deviation from torch DataLoader.