You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi ! I read Yarin Gal's paper and I did not understand how the weight regulariser and dropout regulariser are initialized. The author provided a formula, but it is not very clear (e.g what means prior length scale ? and which value to assign for this variable ?). Could you explain how you find the values used to inizialize the weight regulariser and the dropout regulariser ?
The text was updated successfully, but these errors were encountered:
It's been a little while since I looked at this, but from what I can tell this factors are talked about in section 4.4 of the paper. The length scale is talked about in more detail in appendix D of the paper and the method of determination is different for the different cases.
In the UCI case, it seems it was chosen based on validation data, and for MNIST a grid search was carried out - the results of which are shown in Figure 11.
Hi ! I read Yarin Gal's paper and I did not understand how the weight regulariser and dropout regulariser are initialized. The author provided a formula, but it is not very clear (e.g what means prior length scale ? and which value to assign for this variable ?). Could you explain how you find the values used to inizialize the weight regulariser and the dropout regulariser ?
The text was updated successfully, but these errors were encountered: