Segmentation Fault for `vmap`ed function accessing `BatchedTensor.data`

### 🐛 Describe the bug

The following code should print the data contained in the BatchedTensor or raise an error if `data` should not be accessed for a BatchedTensor if this is considered unsafe behavior like `.item()`, but instead the code fails with a Segmentation Fault.

```python
import torch

def foo(x):
    y = x.data  # <- Segmentation Fault
    print(y)
    return x

torch.func.vmap(foo)(torch.randn(3, 3))
```

### Versions

PyTorch version: 2.0.0                                                                                                                                        
Is debug build: False                                                                                                                                         
CUDA used to build PyTorch: 11.8                                                                                                                              
ROCM used to build PyTorch: N/A                                                                                                                               
                                                                                                                                                              
OS: Ubuntu 22.04.2 LTS (x86_64)                                                                                                                               
GCC version: (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0                                                                                                            
Clang version: Could not collect                                                                                                                              
CMake version: Could not collect                                                                                                                              
Libc version: glibc-2.35                                                                                                                                      
                                                                                                                                                              
Python version: 3.10.9 | packaged by conda-forge | (main, Feb  2 2023, 20:20:04) [GCC 11.3.0] (64-bit runtime)                                                
Python platform: Linux-5.15.0-67-generic-x86_64-with-glibc2.35                                                                                                
Is CUDA available: True                                                                                                                                       
CUDA runtime version: 12.1.66                                                                                                                                 
CUDA_MODULE_LOADING set to: LAZY                                                                                                                              
GPU models and configuration: GPU 0: NVIDIA GeForce RTX 2080                                                                                                  
Nvidia driver version: 470.161.03                                                                                                                             
cuDNN version: Could not collect                                                                                                                              
HIP runtime version: N/A                                                                                                                                      
MIOpen runtime version: N/A                                                                                                                                   
Is XNNPACK available: True                                                                                                                                    
                                                                                                                                                              
CPU:                                                                                                                                                          
Architecture:                    x86_64                                                                                                                       
CPU op-mode(s):                  32-bit, 64-bit                                                                                                               
Address sizes:                   43 bits physical, 48 bits virtual                                                                                            
Byte Order:                      Little Endian                                                                                                                
CPU(s):                          16                                                                                                                           
On-line CPU(s) list:             0-15                                                                                                                         
Vendor ID:                       AuthenticAMD                                                                                                                 
Model name:                      AMD Ryzen 7 3700X 8-Core Processor                                                                                           
CPU family:                      23                                                                                                                           
Model:                           113                                                                                                                          
Thread(s) per core:              2                                                                                                                            
Core(s) per socket:              8                                                                                                                            
Socket(s):                       1                                                                                                                            
Stepping:                        0                                                                                                                            
Frequency boost:                 enabled                                                                                                                      
CPU max MHz:                     4426,1709                                                                                                                    
CPU min MHz:                     2200,0000                                                                                                                    
BogoMIPS:                        7186.22                                                                                                                      
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_o
pt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt 
aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_n
b bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_
ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save t
sc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca sme sev sev_
es                                                                                                                                                            
Virtualization:                  AMD-V                                                                                                                        
L1d cache:                       256 KiB (8 instances)                                                                                                        
L1i cache:                       256 KiB (8 instances)                                                                                                        
L2 cache:                        4 MiB (8 instances)                                                                                                          
L3 cache:                        32 MiB (2 instances)                                                                                                         
NUMA node(s):                    1                                                                                                                            
NUMA node0 CPU(s):               0-15                                                                                                                         
Vulnerability Itlb multihit:     Not affected                                                                                                                 
Vulnerability L1tf:              Not affected                                                                                                                 
Vulnerability Mds:               Not affected                                                                                                                 
Vulnerability Meltdown:          Not affected                                                                                                                 
Vulnerability Mmio stale data:   Not affected                                                                                                                 
Vulnerability Retbleed:          Mitigation; untrained return thunk; SMT enabled with STIBP protection                                                        
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl and seccomp                                                          
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization                                                         
Vulnerability Spectre v2:        Mitigation; Retpolines, IBPB conditional, STIBP always-on, RSB filling, PBRSB-eIBRS Not affected
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Not affected

Versions of relevant libraries:
[pip3] numpy==1.23.5
[pip3] pytorch-ignite==0.4.11
[pip3] pytorch-lightning==1.9.3
[pip3] torch==2.0.0
[pip3] torchaudio==2.0.0
[pip3] torchmetrics==0.9.2
[pip3] torchvision==0.15.0
[pip3] triton==2.0.0
[conda] blas                      2.116                       mkl    conda-forge
[conda] blas-devel                3.9.0            16_linux64_mkl    conda-forge
[conda] cudatoolkit               11.7.0              hd8887f6_10    nvidia
[conda] ffmpeg                    4.3                  hf484d3e_0    pytorch
[conda] ignite                    0.4.11                     py_0    pytorch
[conda] libblas                   3.9.0            16_linux64_mkl    conda-forge
[conda] libcblas                  3.9.0            16_linux64_mkl    conda-forge
[conda] liblapack                 3.9.0            16_linux64_mkl    conda-forge
[conda] liblapacke                3.9.0            16_linux64_mkl    conda-forge
[conda] mkl                       2022.1.0           h84fe81f_915    conda-forge
[conda] mkl-devel                 2022.1.0           ha770c72_916    conda-forge
[conda] mkl-include               2022.1.0           h84fe81f_915    conda-forge
[conda] numexpr                   2.8.0           mkl_py310h0afd4a5_2    conda-forge
[conda] numpy                     1.23.5          py310h53a5b5f_0    conda-forge
[conda] pytorch                   2.0.0           py3.10_cuda11.8_cudnn8.7.0_0    pytorch
[conda] pytorch-cuda              11.8                 h7e8668a_3    pytorch
[conda] pytorch-lightning         1.9.3                    pypi_0    pypi
[conda] pytorch-mutex             1.0                        cuda    pytorch
[conda] torchaudio                2.0.0               py310_cu118    pytorch
[conda] torchmetrics              0.9.2                    pypi_0    pypi
[conda] torchtriton               2.0.0                     py310    pytorch
[conda] torchvision               0.15.0              py310_cu118    pytorch


cc @ezyang @gchanan @zou3519 @Chillee @samdow @soumith @kshitij12345 @janeyx99

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Segmentation Fault for `vmap`ed function accessing `BatchedTensor.data` #97161

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Segmentation Fault for vmaped function accessing BatchedTensor.data #97161

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Segmentation Fault for `vmap`ed function accessing `BatchedTensor.data` #97161