Index out of error #115

Zumbalamambo · 2021-02-14T00:02:50Z

It throws the following error on training

self_supervised/simclr/simclr_module.py", line 249, in optimizer_step
    param_group["lr"] = self.lr_schedule[self.trainer.global_step]
IndexError: index 900 is out of bounds for axis 0 with size 900

The text was updated successfully, but these errors were encountered:

aribornstein · 2021-02-14T09:08:12Z

@ananyahjha93

akihironitta · 2021-02-14T14:34:34Z

Maybe the same as Lightning-Universe/lightning-bolts#436?

Zumbalamambo · 2021-02-14T22:00:46Z

Still the same problem even after I have set the max_epochs :(

ananyahjha93 · 2021-02-25T02:38:51Z

@Zumbalamambo are you using the simclr/swav script from bolts? If yes, can you post the num samples in your dataset, your batch size, accelerator count and then max epochs?

pengbohua · 2021-04-18T04:13:20Z

I met the same issue when I tried to reproduce SwAV pretraining on CIFAR10.

# data
batch_size = 2048
dm = CIFAR10DataModule(data_dir='./data/', batch_size=batch_size, normalize=True)
# loaders are contained in the DataModule which are self consistent

parser = argparse.ArgumentParser('SwAV CIFAR-10')
parser = SwAV.add_model_specific_args(parser)

args = parser.parse_args('')

# model
args.gpus = 1
args.arch = 'resnet18'
args.hidden_mlp = 1024
args.max_epochs = 100
args.dataset = dm
args.batch_size = batch_size
args.size_crops = [32, 16]
args.maxpool1 = False
args.nmb_crops = [2, 1]
args.gaussian_blur = False
args.num_samples = dm.num_samples
dm.train_transforms = SwAVTrainDataTransform(
    size_crops=args.size_crops,
    nmb_crops=args.nmb_crops,
    gaussian_blur=args.gaussian_blur
)

dm.val_transforms = SwAVEvalDataTransform(
    size_crops=args.size_crops,
    nmb_crops=args.nmb_crops,
    gaussian_blur=args.gaussian_blur
)
dm.test_transforms = SwAVEvalDataTransform(
    size_crops=args.size_crops,
    nmb_crops=args.nmb_crops,
    gaussian_blur=args.gaussian_blur
)
print('hypers', args)

#logger 
from pytorch_lightning.loggers import TensorBoardLogger, CSVLogger

csv_logger = CSVLogger("/content/drive/MyDrive/contrastive_learning/Swav/logs", name="SwAV-CIFAR10")
model = SwAV(
**args.__dict__
)


# fit
trainer = pl.Trainer(max_epochs=args.max_epochs, gpus=1, precision=16, logger=csv_logger, callbacks=[EarlyStopping(monitor='val_loss')])
trainer.fit(model, datamodule=dm)


#error message
usr/local/lib/python3.7/dist-packages/pytorch_lightning/trainer/training_loop.py in optimizer_step(self, optimizer, opt_idx, batch_idx, train_step_and_backward_closure)
    431             on_tpu=self.trainer._device_type == DeviceType.TPU and _TPU_AVAILABLE,
    432             using_native_amp=using_native_amp,
--> 433             using_lbfgs=is_lbfgs,
    434         )
    435 

/usr/local/lib/python3.7/dist-packages/pl_bolts/models/self_supervised/swav/swav_module.py in optimizer_step(self, epoch, batch_idx, optimizer, optimizer_idx, optimizer_closure, on_tpu, using_native_amp, using_lbfgs)
    329         # adjust LR of optim contained within LARSWrapper
    330         for param_group in optimizer.param_groups:
--> 331             param_group["lr"] = self.lr_schedule[self.trainer.global_step]
    332 
    333         # from lightning

IndexError: index 1900 is out of bounds for axis 0 with size 1900

edgarriba · 2021-05-03T10:34:07Z

@Zumbalamambo are you still having those issues ?
BTW, what version of flash do you use ?

tarunn2799 · 2021-05-29T13:31:25Z

@edgarriba I'm facing the same issue, when I'm trying to train a custom dataset. I'm running 4 gpus, and a batch size of 2048.

ananyahjha93 · 2021-08-16T20:30:10Z

This has been fixed in bolts master.

Zumbalamambo added bug / fix Something isn't working help wanted Extra attention is needed labels Feb 14, 2021

edenlightning assigned ananyahjha93 Feb 16, 2021

edenlightning added this to the 0.2 milestone Mar 22, 2021

edenlightning modified the milestones: 0.2, 0.3 Apr 19, 2021

edenlightning added the waiting on author label May 10, 2021

ethanwharris modified the milestones: 0.3, 0.3.x Jun 9, 2021

Borda modified the milestones: 0.3.x, 0.4 Aug 3, 2021

ananyahjha93 closed this as completed Aug 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Index out of error #115

Index out of error #115

Zumbalamambo commented Feb 14, 2021

aribornstein commented Feb 14, 2021

akihironitta commented Feb 14, 2021

Zumbalamambo commented Feb 14, 2021

ananyahjha93 commented Feb 25, 2021

pengbohua commented Apr 18, 2021 •

edited

Loading

edgarriba commented May 3, 2021

tarunn2799 commented May 29, 2021

ananyahjha93 commented Aug 16, 2021

Index out of error #115

Index out of error #115

Comments

Zumbalamambo commented Feb 14, 2021

aribornstein commented Feb 14, 2021

akihironitta commented Feb 14, 2021

Zumbalamambo commented Feb 14, 2021

ananyahjha93 commented Feb 25, 2021

pengbohua commented Apr 18, 2021 • edited Loading

edgarriba commented May 3, 2021

tarunn2799 commented May 29, 2021

ananyahjha93 commented Aug 16, 2021

pengbohua commented Apr 18, 2021 •

edited

Loading