[functorch] fix batching rule for dropout #92975

kshitij12345 · 2023-01-25T10:54:28Z

The repro now works:

import torch
import torch.func
import torch.nn as nn

x = torch.randn(3, device='cuda')
y = torch.randn(1, 3, device='cuda')

def fn(x, y):
    # previously output of dropout used to be incorrect [B, 3] (B=1) and thus `mean(1)` used to fail
    # post the fix output of dropout is [B, 1, 3] and `mean(1)` works.
    return x + nn.functional.dropout(y, 0.3).mean(1)


o = torch.func.vmap(fn, in_dims=(0, None), randomness='different')(x, y)

NOTE:
native_dropout_batching_rule(const Tensor& tensor, double p, c10::optional<bool> train) was called only for CUDA tensor. Hence this issue only affected CUDA tensors and not CPU tensors

Ref:

pytorch/aten/src/ATen/functorch/PyTorchOperatorHacks.cpp

Lines 251 to 258 in a6ac922

    
           Tensor dropout(const Tensor& input, double p, bool train) { 
        
             auto result = [&]() { 
        
               NoNamesGuard guard; 
        
               if (train && is_fused_kernel_acceptable(input, p)) { 
        
                 return std::get<0>(at::native_dropout(input, p, train)); 
        
               } 
        
               return _dropout<false>(input, p, train); 
        
             }();

pytorch-bot · 2023-01-25T10:54:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92975

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6acfa4c:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

aten/src/ATen/functorch/BatchRulesRandomness.cpp

Skylion007

Looks good to me now.

kshitij12345 · 2023-01-26T01:03:18Z

@pytorchbot merge

pytorchmergebot · 2023-01-26T01:04:55Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[functorch] fix batching rule for dropout

5331c95

pytorchbot added the open source label Jan 25, 2023

Skylion007 reviewed Jan 25, 2023

View reviewed changes

aten/src/ATen/functorch/BatchRulesRandomness.cpp Outdated Show resolved Hide resolved

kshitij12345 added 2 commits January 25, 2023 15:53

correct fix

928dcec

update test comment

6acfa4c

kshitij12345 added the release notes: functorch release notes category; Pertaining to torch.func or pytorch/functorch label Jan 25, 2023

kshitij12345 marked this pull request as ready for review January 25, 2023 16:04

kshitij12345 requested review from zou3519 and Chillee as code owners January 25, 2023 16:04

Skylion007 self-requested a review January 25, 2023 17:37

Skylion007 reviewed Jan 25, 2023

View reviewed changes

Chillee approved these changes Jan 25, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 26, 2023

Skylion007 approved these changes Jan 26, 2023

View reviewed changes

pytorchmergebot added the Merged label Jan 26, 2023

pytorchmergebot closed this in d88bc38 Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[functorch] fix batching rule for dropout #92975

[functorch] fix batching rule for dropout #92975

kshitij12345 commented Jan 25, 2023 •

edited

pytorch-bot bot commented Jan 25, 2023 •

edited

Skylion007 left a comment

kshitij12345 commented Jan 26, 2023

pytorchmergebot commented Jan 26, 2023

	Tensor dropout(const Tensor& input, double p, bool train) {
	auto result = [&]() {
	NoNamesGuard guard;
	if (train && is_fused_kernel_acceptable(input, p)) {
	return std::get<0>(at::native_dropout(input, p, train));
	}
	return _dropout<false>(input, p, train);
	}();

[functorch] fix batching rule for dropout #92975

[functorch] fix batching rule for dropout #92975

Conversation

kshitij12345 commented Jan 25, 2023 • edited

pytorch-bot bot commented Jan 25, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92975

✅ No Failures

Skylion007 left a comment

Choose a reason for hiding this comment

kshitij12345 commented Jan 26, 2023

pytorchmergebot commented Jan 26, 2023

Merge started

kshitij12345 commented Jan 25, 2023 •

edited

pytorch-bot bot commented Jan 25, 2023 •

edited