You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The multi-class model coded in section 93 does a surprisingly good job even without any training, and even better without nn.ReLU. No matter the seed used for initialization, at epoch 0, the test accuracy is always above 70%. Why is that? I can't get it down to about 50%. I observed that in the notebook I coded along, but it's the same in the notebook from your repository.
The text was updated successfully, but these errors were encountered:
It must be something with the order of the code, I suggest to do a clean copy to your own notebook, see my example. For the sake of explanation, with the flow of Daniels video, the notebook is pretty good, but if you start experimenting, you quickly get lost in all those cells.
When I decrease the learning rate, I can clearly see how the accuracy goes up from below 20 to 100%:
What you also could try is to make the data set more difficult. Increase the center_std to 4 or so. Now you clearly can see how the model is struggeling:
The multi-class model coded in section 93 does a surprisingly good job even without any training, and even better without
nn.ReLU
. No matter the seed used for initialization, at epoch 0, the test accuracy is always above 70%. Why is that? I can't get it down to about 50%. I observed that in the notebook I coded along, but it's the same in the notebook from your repository.The text was updated successfully, but these errors were encountered: