[PyTorch] Fix module types (LazyLinear -> Linear) #2225

atgctg · 2022-07-30T13:27:08Z

Description of changes:

In 6.3 Parameter Initialization, surprisingly LazyLinear modules have the type <class 'torch.nn.modules.linear.Linear'> during init, so the in-place updates in the PyTorch examples are skipped.

This PR fixes this by making the if statements check for the correct nn.Linear type.

Not applied:

def init_constant(module):
    if type(module) == nn.LazyLinear: # this is False
        nn.init.constant_(module.weight, 1)
        nn.init.zeros_(module.bias)
net.apply(init_constant)
net[0].weight.data[0], net[0].bias.data[0]

(tensor([ 0.1594,  0.9592, -1.4913,  0.8780]), tensor(0.))

Applied:

def init_constant(module):
    if type(module) == nn.Linear: # this is True
        nn.init.constant_(module.weight, 1)
        nn.init.zeros_(module.bias)
net.apply(init_constant)
net[0].weight.data[0], net[0].bias.data[0]

(tensor([1., 1., 1., 1.]), tensor(0.))

By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

d2l-bot · 2022-07-30T13:34:03Z

Job d2l-en/PR-2225/1 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-2225/

d2l-bot · 2022-07-30T13:36:19Z

Job d2l-en/PR-2225/2 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-2225/

d2l-bot · 2022-07-30T13:40:24Z

Job d2l-en/PR-2225/3 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-2225/

d2l-bot · 2022-07-30T13:46:03Z

Job d2l-en/PR-2225/4 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-2225/

AnirudhDagar

Hi @atgctg thanks for the PR. Great catch with this bug, which was overlooked in #2038. It is actually not surprising for the lazy layers to be torch.nn.modules.linear.Linear type, since PyTorch devs designed the lazy modules intending to convert them to their non-lazy analogue after the first dry run (param init). See docs.

astonzhang · 2022-07-31T21:44:14Z

@atgctg thanks. Could you send another PR to replace your github id with your name in our acknowledgement:
https://github.com/d2l-ai/d2l-en/blob/master/chapter_preface/index.md

Fix PyTorch module types (LazyLinear -> Linear)

a65c945

atgctg changed the title ~~Fix PyTorch module types (LazyLinear -> Linear)~~ [PyTorch] Fix module types (LazyLinear -> Linear) Jul 30, 2022

astonzhang requested a review from AnirudhDagar July 30, 2022 18:37

AnirudhDagar approved these changes Jul 31, 2022

View reviewed changes

AnirudhDagar merged commit 489ab4a into d2l-ai:master Jul 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Fix module types (LazyLinear -> Linear) #2225

[PyTorch] Fix module types (LazyLinear -> Linear) #2225

atgctg commented Jul 30, 2022 •

edited

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

AnirudhDagar left a comment

astonzhang commented Jul 31, 2022

[PyTorch] Fix module types (LazyLinear -> Linear) #2225

[PyTorch] Fix module types (LazyLinear -> Linear) #2225

Conversation

atgctg commented Jul 30, 2022 • edited

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

d2l-bot commented Jul 30, 2022

AnirudhDagar left a comment

Choose a reason for hiding this comment

astonzhang commented Jul 31, 2022

atgctg commented Jul 30, 2022 •

edited