nn.Embedding layer initialization #63

ivanstepanovftw · 2024-02-07T05:41:39Z

I have a questions regarding nn.Embedding layer. It is a layer that mimics F.one_hot + nn.Linear(..., bias=False) and implemented as a lookup table, a magnitude faster than one hot + linear combination.

By default, nn.Embedding is initialized as a normal distribution (mean 0, std 1). However, first layer of SIREN expects uniform distributed input at interval [-1, 1].

Should I initialize nn.Embedding as embedding.weight.uniform_(-1, 1) to match expectations of SIREN for input distributions?
Can I use nn.Embedding as a first layer of SIREN, and initialize it as proposed - embedding.weight.uniform_(-1 / in_features, 1 / in_features) - to get rid of two linear layers without non-linearity in between?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nn.Embedding layer initialization #63

nn.Embedding layer initialization #63

ivanstepanovftw commented Feb 7, 2024

nn.Embedding layer initialization #63

nn.Embedding layer initialization #63

Comments

ivanstepanovftw commented Feb 7, 2024