`torch.load` planned default flip for `weights_only` 

## TL;DR
PyTorch is planning a BC-breaking change in `torch.load` to flip the default for `weights_only` from `None` (i.e. `False`) to `True` (and have added a warning to this effect in torch 2.4 :) ) that will break loading of tensor serialized when on XLA.

## Context
Instead of using the default `Unpickler` provided by `pickle`, `torch.load(weights_only=True)` uses a custom Unpickler that restricts the allowed `GLOBAL`s in the checkpoint (classes and functions) to those [here](https://github.com/pytorch/pytorch/blob/main/torch/_weights_only_unpickler.py#L152) (that required to build state_dicts).

The purpose of this is towards addressing the issue of remote code execution when using `torch.load`. 

Another feature of this is that users can allowlist certain globals using [`add_safe_globals`](https://pytorch.org/docs/stable/notes/serialization.html#torch.serialization.add_safe_globals) (in torch 2.4) or the `safe_globals` context manager (in torch nightly), a simple example being

```python
import torch
from torch.serialization import safe_globals

class MyTensor(...):
     pass
 
 t = MyTensor(torch.randn(2, 3))    
 torch.save(t, "ckpt.pt")
 
 # This fails saying that MyTensor is not an allowed GLOBAL
 # t1 = torch.load("ckpt.pt", weights_only=True)
 
 # This succeeds
 with safe_globals([MyTensor]):
     torch.load("ckpt.pt", weights_only=True)
```

## How this affects XLA 

Notably, XLA uses a special path that uses numpy for serialization/deserialization see [here](https://github.com/pytorch/pytorch/blob/main/torch/_tensor.py#L264-L279). However, we have made a decision not to include the numpy GLOBALS required for unpickling in the defaut list as we do not control the codepaths numpy implements for pickling (see relevant GLOBALs [here](https://github.com/pytorch/pytorch/pull/124763/files))

## Ask

Opening this issue to figure out the best way to move forward re above to make the flip as smooth as possible!

Ideally, it would be good if the [path for serializing XLA tensors](https://github.com/pytorch/pytorch/blob/main/torch/_tensor.py#L264-L279) could be refactored to not use numpy and we would definitely accept a PR that implements this!

Separately, for existing checkpoints I imagine there will be something that needs to be done there.


cc @JackCaoG 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`torch.load` planned default flip for `weights_only` #7799

TL;DR

Context

How this affects XLA

Ask

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

torch.load planned default flip for weights_only #7799

Description

TL;DR

Context

How this affects XLA

Ask

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`torch.load` planned default flip for `weights_only` #7799