DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 #16043

LysandreJik · 2022-03-10T11:28:32Z

The internal torch method _softmax_backward_data changed API between 1.10 and 1.11, from requiring a tensor as its last output to requiring a size.

This PR updates the concerned models so that they are correctly supported.

Torch 1.11: https://github.com/pytorch/pytorch/blame/e47a5a64bbf4d388b70397e3237f9d5710ee4c9c/tools/autograd/derivatives.yaml#L1861
Before: https://github.com/pytorch/pytorch/blame/768cfaa8f86bf7c7b0af441d1536f060274c27a0/tools/autograd/derivatives.yaml#L1704

HuggingFaceDocBuilderDev · 2022-03-10T11:33:59Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik · 2022-03-10T11:40:55Z

src/transformers/models/deberta/modeling_deberta.py


 logger = logging.get_logger(__name__)

+convert_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")


Maybe a better name for that value would be something more specific to its purpose, but then the name starts being long and @sgugger gets angry, wdyt?

Suggested change

convert_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")

convert_softmax_tensor_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")

I'd maybe just add a do_convert... to make it a bit clearer it's a flag and not a function

don't mind the name after :-)

patrickvonplaten

Thanks!

sgugger

For the torch int div we wrote our own function that does the test internally. I think we should do this the same way and write our own _softmax_backward_data inside pytorch_utils which will do the test internally, then import this one here.

LysandreJik · 2022-03-10T13:21:59Z

Addressed your comment @sgugger, could you do a second review?

As seen with Sylvain offline, I've moved out the packaging.version.parse operation of the methods, as otherwise they would be called inside the methods themselves, which are called multipled times in forward passes. @patrickvonplaten could you check if that's fine with you?

LysandreJik · 2022-03-10T13:23:26Z

src/transformers/pytorch_utils.py

+is_torch_less_than_1_8 = version.parse(torch.__version__) < version.parse("1.8.0")
+is_torch_less_than_1_11 = version.parse(torch.__version__) < version.parse("1.11")


The torch version cannot change during runtime, so this is harmless

LysandreJik · 2022-03-10T13:24:47Z

src/transformers/pytorch_utils.py

        return torch.div(tensor1, tensor2, rounding_mode="floor")
+
+
+def softmax_backward_data(parent, grad_output, output, dim, self):


The self comes from the signature of the PyTorch function which is identical

speediedan · 2022-03-24T19:48:28Z

@sgugger @LysandreJik Thanks for your awesome work on building this immensely valuable ecosystem (and community!).
I'm waiting to release a package that requires this post-4.17 commit and it would be great to avoid pointing to a specific commit for packaging purposes. Is a 4.17.1 patch release planned? I asked on Discord but this was indicated to be a better forum for the question. Thanks again for helping lead this great community!

sgugger · 2022-03-24T20:02:27Z

I don't think we have a patch planned. We will have 4.18 released probably next week instead :-)

speediedan · 2022-03-24T23:10:40Z

I don't think we have a patch planned. We will have 4.18 released probably next week instead :-)

AWESOME! looking forward to it!

callmeBalloch · 2022-03-26T09:03:31Z

Note that this evaluates True for pre-releases such as '1.11.0a0+b6df043'. So the the error is still present.
is_torch_less_than_1_11 = version.parse(version.parse(torch.__version__).base_version) < version.parse("1.11") may help.
Am I missing something?

Support for torch 1.11

4e82e98

LysandreJik requested review from anton-l, patrickvonplaten and sgugger March 10, 2022 11:28

LysandreJik commented Mar 10, 2022

View reviewed changes

patrickvonplaten approved these changes Mar 10, 2022

View reviewed changes

sgugger reviewed Mar 10, 2022

View reviewed changes

Address Sylvain's comment

5fd7210

LysandreJik requested a review from sgugger March 10, 2022 13:20

LysandreJik requested a review from patrickvonplaten March 10, 2022 13:22

LysandreJik commented Mar 10, 2022

View reviewed changes

patrickvonplaten approved these changes Mar 10, 2022

View reviewed changes

LysandreJik commented Mar 10, 2022

View reviewed changes

sgugger approved these changes Mar 10, 2022

View reviewed changes

LysandreJik merged commit e66743e into master Mar 10, 2022

LysandreJik deleted the fix-softmax-backward-torch-1.11 branch March 10, 2022 14:01

RZinman mentioned this pull request Apr 9, 2022

debert TypeError: _softmax_backward_data(): argument 'input_dtype' (position 4) must be torch.dtype, not Tensor #16587

Closed

nbroad1881 mentioned this pull request Apr 16, 2022

use base_version to check torch version in torch_less_than_1_11 #16806

Merged

sxjscience mentioned this pull request Jun 17, 2022

Support PT 1.11 in AutoMM autogluon/autogluon#1836

Merged


		logger = logging.get_logger(__name__)

		convert_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")

	convert_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")
	convert_softmax_tensor_to_dtype = not version.parse(torch.__version__) < version.parse("1.11")

		is_torch_less_than_1_8 = version.parse(torch.__version__) < version.parse("1.8.0")
		is_torch_less_than_1_11 = version.parse(torch.__version__) < version.parse("1.11")

		return torch.div(tensor1, tensor2, rounding_mode="floor")


		def softmax_backward_data(parent, grad_output, output, dim, self):

DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 #16043

DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 #16043

Uh oh!

Conversation

LysandreJik commented Mar 10, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik commented Mar 10, 2022

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

speediedan commented Mar 24, 2022

Uh oh!

sgugger commented Mar 24, 2022

Uh oh!

speediedan commented Mar 24, 2022

Uh oh!

callmeBalloch commented Mar 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

HuggingFaceDocBuilderDev commented Mar 10, 2022 •

edited

Loading