BatchNorm_Before_vs_After

In deep learning, enhancing the training and convergence of deep neural networks has been a long-standing challenge. One technique that has demonstrated remarkable effectiveness in stabilizing and accelerating the training process is Batch Normalization. There are two distinct approaches to integrating Batch Normalization within a neural network architecture - before and after the activation functions in a neural network.

I used TensorFlow for the implementation of these two strategies.

After my implementation, I found out that the disparity in performance between the two implementations is minimal. Following training for 10 epochs, the "after activation" approach achieves a training accuracy of 0.8917, while the "before activation" approach achieves a slightly lower training accuracy of 0.8812. Similarly, in terms of test accuracy, the "after activation" strategy achieves a score of 0.7100, while the "before activation" strategy scores slightly lower at 0.7049.

The observations may differ due to various factors including the number of training epochs, the complexity of the dataset, and the architecture of the neural network.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
tf_batchnorm.ipynb		tf_batchnorm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BatchNorm_Before_vs_After

About

Languages

Si-ddhartha/BatchNorm_Before_vs_After

Folders and files

Latest commit

History

Repository files navigation

BatchNorm_Before_vs_After

About

Topics

Resources

Stars

Watchers

Forks

Languages