Named converted to regular tuples when sent to the gpu. #1588

nathanbreitsch · 2020-04-24T03:46:31Z

🐛 Bug

Named tuples returned from Dataset get converted to regular tuples when sent to the gpu.
This happens because isinstance(instance_of_a_named_tuple, tuple) evaluates to True in distrib_parts.py
https://github.com/PyTorchLightning/pytorch-lightning/blob/67d5f4dc392250d23bfeb11aba45e919a99ff1c0/pytorch_lightning/trainer/distrib_parts.py#L463

To Reproduce

import pytorch_lightning as pl
from collections import namedtuple
import torch
import numpy

NamedTupleDemoInput = namedtuple('DemoInput', ['x1', 'x2', 'y'])

class NamedTupleDemoDataset:
    def __len__(self):
        return 30000

    def __getitem__(self, index):
        x1 = numpy.random.uniform(0, 100)
        x2 = numpy.random.uniform(0, 100)
        y = 2*x1 + 3*x2 + numpy.random.normal(0, 0.05)
        return NamedTupleDemoInput(x1, x2, y)

class WeightedSum(torch.nn.Module):
    def __init__(self):
        super(WeightedSum, self).__init__()
        self.a = torch.nn.Parameter(torch.zeros(1))
        self.b = torch.nn.Parameter(torch.zeros(1))

    def forward(self, x1, x2):
        return self.a * x1 + self.b * x2

class NamedTupleDemo(pl.LightningModule):

    def __init__(self):
        super(NamedTupleDemo, self).__init__()
        self.model = WeightedSum()

    def forward(self, x1, x2):
        return self.model(x1, x2)

    def train_dataloader(self):
        return torch.utils.data.DataLoader(NamedTupleDemoDataset(), batch_size=128)

    def training_step(self, batch, batch_index):
        yhat = self.forward(batch.x1, batch.x2)
        return {'loss': torch.nn.functional.mse_loss(batch.y, yhat)}

    def configure_optimizers(self):
        return torch.optim.Adam(self.parameters(), lr=1e-2)

if __name__ == '__main__':
    module = NamedTupleDemo()
    pl.Trainer(max_epochs=20, gpus=1).fit(module)
    print(f'a={float(module.model.a)} b={float(module.model.b)}')

Traceback (most recent call last):
  File "demo.py", line 48, in <module>
    pl.Trainer(max_epochs=20, gpus=1).fit(module)
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/trainer.py", line 749, in fit
    self.single_gpu_train(model)
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/distrib_parts.py", line 491, in single_gpu_train
    self.run_pretrain_routine(model)
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/trainer.py", line 910, in run_pretrain_routine
    self.train()
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/training_loop.py", line 384, in train
    self.run_training_epoch()
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/training_loop.py", line 456, in run_training_epoch
    _outputs = self.run_training_batch(batch, batch_idx)
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/training_loop.py", line 633, in run_training_batch
    loss, batch_output = optimizer_closure()
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/training_loop.py", line 597, in optimizer_closure
    output_dict = self.training_forward(split_batch, batch_idx, opt_idx, self.hiddens)
  File "/home/n/repos/pytorch-lightning/pytorch_lightning/trainer/training_loop.py", line 770, in training_forward
    output = self.model.training_step(*args)
  File "demo.py", line 40, in training_step
    yhat = self.forward(batch.x1, batch.x2)
AttributeError: 'tuple' object has no attribute 'x1'

Expected behavior

Namedtuples returned from the dataset should be keep their original fields.

Environment

CUDA:
- GPU:
- GeForce RTX 2080 Ti
- available: True
- version: 10.2
Packages:
- numpy: 1.18.3
- pyTorch_debug: False
- pyTorch_version: 1.5.0
- pytorch-lightning: 0.7.4rc5
- tensorboard: 2.2.1
- tqdm: 4.45.0
System:
- OS: Linux
- architecture:
- 64bit
- ELF
- processor:
- python: 3.8.2
- version: Proposal for help #1 SMP PREEMPT Sun, 05 Apr 2020 05:13:14 +0000

The text was updated successfully, but these errors were encountered:

github-actions · 2020-04-24T03:47:10Z

Hi! thanks for your contribution!, great first issue!

Vozf · 2021-03-01T11:19:45Z

I am having similar troubles with multiGpu setup? Is that fixed for multiple gpus in the pr? If not I believe this should be reopened.
In my case everything works fine for single gpu but with 2 gpus I get the error
AttributeError: 'tuple' object has no attribute 'image'
But it shouldn't be tuple on the error line, there should be namedtuple

nathanbreitsch added bug Something isn't working help wanted Open to be worked on labels Apr 24, 2020

nathanbreitsch mentioned this issue Apr 24, 2020

Don't convert namedtuple to tuple #1589

Merged

5 tasks

williamFalcon closed this as completed in #1589 Apr 30, 2020

Vozf mentioned this issue Mar 1, 2021

Multi gpu, named tuple converts to tuple #6257

Closed

lakshmi-speak mentioned this issue Nov 27, 2023

Automatic device transfer converts namedtuples into regular tuples Lightning-AI/utilities#199

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Named converted to regular tuples when sent to the gpu. #1588

Named converted to regular tuples when sent to the gpu. #1588

nathanbreitsch commented Apr 24, 2020

github-actions bot commented Apr 24, 2020

Vozf commented Mar 1, 2021

Named converted to regular tuples when sent to the gpu. #1588

Named converted to regular tuples when sent to the gpu. #1588

Comments

nathanbreitsch commented Apr 24, 2020

🐛 Bug

To Reproduce

Expected behavior

Environment

github-actions bot commented Apr 24, 2020

Vozf commented Mar 1, 2021