Hi ,
After encountering problems during training with my custom dataset using Deepspeed Initialize , I have narrowed the problem to the fact that since the number of images in my dataset is not divisible by the batch size , Deepspeed is throwing an error. This is easily resolvable in torch Dataloader by setting drop_last = True , the Deepspeed.pt.Dataloader source code however does not use this argument at all .
Could this be added to the source please ?
If not , please suggest an alternative means for training