I notice that there is no bias and no subtraction of mean in layer norm.
I understand no bias but I'm confused about the meaning of computing variance without subtraction of mean.
Normally, we compute variance, for example:

But it's different here. Why is that?
Hoping some explanation.