Update auto_reg_nn.sample_mask_indices() to be default device-aware #3344

cafletezbrant · 2024-03-21T23:33:10Z

Hi Pyro team, thank you for making such a useful and cool library. I encountered a small bug with an easy fix and wanted to share.

As described in my Pyro forum post, there is a device mismatch in auto_reg_nn.sample_mask_indices(). The line

 indices = torch.linspace(1, input_dim, steps=hidden_dim, device="cpu").to(
      torch.Tensor().device
)

creates tensors on CPU, even when torch.set_default_device('cuda') is used (I believe this is because torch.Tensor is an alias to torch.FloatTensor, which is not the same as torch.cuda.FloatTensor()) . Minimum working example (from Pyro docs):

import torch
import pyro
from pyro.nn import AutoRegressiveNN

torch.set_default_device('cuda')

x = torch.randn(100, 10)
print(x.device)
# cuda:0
print(torch.Tensor().device)
# cpu
print(torch.tensor(0.).device)
# cuda:0
arn = AutoRegressiveNN(10, [50], param_dims=[1])
p = arn(x)

The instantiation of a AutoRegressiveNN object will fail with the error

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

The proposed fix is to replace torch.Tensor().device with torch.tensor(0.0).device (lower case tensor; adding a simple value since torch.tensor() expects data). Then the object can be instantiated. This change is the sole element in this PR.

martinjankowiak · 2024-03-21T23:58:46Z

thanks @cafletezbrant

this is pretty old code...

wouldn't this be sufficient? torch.linspace(1, input_dim, steps=hidden_dim)

cafletezbrant · 2024-03-22T00:01:33Z

@martinjankowiak pretty old as in almost deprecated? Or just not recently updated?

Also yes, I just tested, your proposal also works, can update to that if you'd prefer.

martinjankowiak · 2024-03-22T00:07:55Z

yes please use the simpler version, thanks!

pretty old as in almost deprecated? Or just not recently updated?

not recently updated and therefore oldish pytorch idioms

martinjankowiak · 2024-03-22T00:09:45Z

does arn.to(...) work as expected?

@martinjankowiak

update per discussion with @martinjankowiak

cafletezbrant · 2024-03-22T00:20:00Z

Yes, arn.to() works as expected:

arn.to('cpu')
next(arn.parameters()).is_cuda
# False
p = arn(x.cpu())
p.device
# device(type='cpu')
arn.to('cuda')
next(arn.parameters()).is_cuda
# True
p = arn(x)
p.device
# device(type='cuda', index=0)
p[0, 0:5]
# tensor([-0.2749,  0.0823,  0.1205, -0.1107,  0.1880], device='cuda:0',
#       grad_fn=<SliceBackward0>)

I've pushed the requested simpler version. I was asking about age because if this is relatively unused code, I might expect to stub my toe a few more times, which might turn into one or more additional PRs.

martinjankowiak · 2024-03-22T00:27:46Z

not sure what your goals are but there are certainly more up-to-date normalizing flows libraries out there, some of which have some amount of pyro integration, see e.g. https://github.com/pyro-ppl/pyro/blob/dev/pyro/contrib/zuko.py

cafletezbrant · 2024-03-22T00:36:16Z

Ah interesting, is that a more recommended way to do things [1]? I was just trying to test out whether an NF would help my model fit (i.e. I am not sure if it will), which is why I was originally trying to use an AutoGuide. I suppose the way forward would be to simply write a guide using e.g. Zuko for the parameters I'm trying to estimate via NF and add that to my AutoGuideList?

[1] Just to be clear, I meant no criticism of the state of affairs of this code base, just that if it was less maintained than other parts, that I might be posting here again.

martinjankowiak · 2024-03-22T17:57:28Z

i think using the machinery in pyro is probably a reasonable place to start but if you want to explore a more diverse and/or more recent set of flows it may be a good idea to explore other pytorch-based flow libraries like zuko

cafletezbrant · 2024-03-22T18:53:23Z

Got it, thanks for the pointer. I'll explore the built-in work first and see where that goes.

martinjankowiak · 2024-03-23T15:30:49Z

looks like you deleted before i could merge

cafletezbrant · 2024-03-23T15:40:33Z

Sorry, brainfart! I will fix on Monday

Update auto_reg_nn.py to be device-aware

b17df9b

Update auto_reg_nn.py

e93a7e6

update per discussion with @martinjankowiak

martinjankowiak approved these changes Mar 22, 2024

View reviewed changes

cafletezbrant closed this by deleting the head repository Mar 23, 2024

cafletezbrant mentioned this pull request Mar 25, 2024

Auto regressive nn on gpu - revisited #3346

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update auto_reg_nn.sample_mask_indices() to be default device-aware #3344

Update auto_reg_nn.sample_mask_indices() to be default device-aware #3344

cafletezbrant commented Mar 21, 2024

martinjankowiak commented Mar 21, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 23, 2024

cafletezbrant commented Mar 23, 2024

Update auto_reg_nn.sample_mask_indices() to be default device-aware #3344

Update auto_reg_nn.sample_mask_indices() to be default device-aware #3344

Conversation

cafletezbrant commented Mar 21, 2024

martinjankowiak commented Mar 21, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 22, 2024

cafletezbrant commented Mar 22, 2024

martinjankowiak commented Mar 23, 2024

cafletezbrant commented Mar 23, 2024