This repository contains all the codes for the MLP question in HW3 of CE-40959: Deep Learning Course, presented by Dr. Soleymani in Sharif University of Technology. In this question, a complete set of Jupyter Notebook and python scripts is prepared for examining almost all critical factors in designing a Multi-layer Perceptron model.
In the jupyter notebook, some experiemnts are designed to be applied on farsi OCR dataset. As its name implies, it is like famous MNIST dataset but it consists of images of handwritten digits in farsi. By running these experiments, you can figure out how factors of
- Regularization techniques
- Activation functions
- Number of layers and units
- Weights initialization
affect the result of learning process. In addition, at the end of each section of the notebook, some visualizations on model parameters are requested with proper explanation.
Alongside the notebook, there some python files with TODO sections filled with proper lines of code. For Each TODO section, a comprehensive description of the required code is provided.