Adjoint operations move Jacobian from GPU to CPU #88

yashsavani · 2024-03-20T21:58:00Z

🐛 Bug

The adjoint operations in CoLA are moving the Jacobian tensor from the GPU to the CPU, which can lead to performance issues and inconsistencies.

To reproduce

** Code snippet to reproduce **

import torch
import cola

dev = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

x = torch.randn(100).to(dev)
fn = torch.nn.Sequential(torch.nn.Linear(100, 64), torch.nn.Linear(64, 100)).to(dev)

J = cola.ops.Jacobian(fn, x)
print(J.device, J.T.device, J.H.device, cola.ops.Adjoint(J).device)

** Stack trace/error message **

cuda:0 cpu cpu cpu

Expected Behavior

Output should look like:

cuda:0 cuda:0 cuda:0 cuda:0

System information

Please complete the following information:

0.0.6.dev11+gf3c5494
2.1.2
Springdale Open Enterprise Linux 8.6 (Modena)

Additional context

Possibly an issue here
https://github.com/wilson-labs/cola/blob/main/cola/ops/operators.py#L361
where the device is not being used

The text was updated successfully, but these errors were encountered:

AndPotap · 2024-08-28T16:53:17Z

Thank you for pointing out this bad allocation of devices. Also thank you for the concise and well though code snippet to reproduce. I've just added a fix on #97.

yashsavani added the bug Something isn't working label Mar 20, 2024

AndPotap closed this as completed Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjoint operations move Jacobian from GPU to CPU #88

Adjoint operations move Jacobian from GPU to CPU #88

yashsavani commented Mar 20, 2024

AndPotap commented Aug 28, 2024

Adjoint operations move Jacobian from GPU to CPU #88

Adjoint operations move Jacobian from GPU to CPU #88

Comments

yashsavani commented Mar 20, 2024

🐛 Bug

To reproduce

Expected Behavior

System information

Additional context

AndPotap commented Aug 28, 2024