NewRandnInit didn't works as expected when mean=0.0 #48

FelixHo · 2021-07-20T04:50:53Z

package main

import(
    "fmt"
    "github.com/sugarme/gotch/nn"
)

func main() {
	mat_3x5 := nn.NewRandnInit(0.0, 1.0).InitTensor([]int64{3, 5}, gotch.CPU)
	fmt.Printf("%v", mat_3x5)
}

output

   1.000     1.000     1.000     1.000     1.000
   1.000     1.000     1.000     1.000     1.000
   1.000     1.000     1.000     1.000     1.000

According to this line of code, when mean=0, all elements will be initialized with stdev

gotch/nn/init.go

Line 81 in 96b0967

data[i] = float32(rand.NormFloat64()*r.mean + r.stdev)

This is different from PyTorch's randn function

""" Returns a tensor filled with random numbers from a normal distribution
with mean `0` and variance `1`"""

torch.randn(3, 5)

############### output ###########################
tensor([[ 0.1769, -2.0933, -0.8882,  0.0051,  0.9833],
        [-0.6342,  0.4093,  0.6266,  0.3935,  0.2045],
        [ 0.3055, -0.4522, -1.7044,  1.8426,  0.4553]])

I know MustRandn can solve this problem, but NewRandnInit as the default initialization method of Embedding will directly affect the initialization weights.

Is this a bug or is it designed this way for some reason?

fixed #45 #48 RandInit

sugarme · 2021-07-20T14:10:38Z

@FelixHo ,

Hopefully it's fixed. Thanks for pointing out.

sugarme added a commit that referenced this issue Jul 20, 2021

fixed #45 #48 RandInit

731513a

sugarme mentioned this issue Jul 20, 2021

fixed #45 #48 RandInit #49

Merged

sugarme added a commit that referenced this issue Jul 20, 2021

Merge pull request #49 from sugarme/init

37f3a59

fixed #45 #48 RandInit

sugarme added the bug Something isn't working label Jul 20, 2021

sugarme closed this as completed Jul 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NewRandnInit didn't works as expected when mean=0.0 #48

NewRandnInit didn't works as expected when mean=0.0 #48

FelixHo commented Jul 20, 2021

sugarme commented Jul 20, 2021

NewRandnInit didn't works as expected when mean=0.0 #48

NewRandnInit didn't works as expected when mean=0.0 #48

Comments

FelixHo commented Jul 20, 2021

sugarme commented Jul 20, 2021