Babysitting the learning process

In this talk, I will explain upon how to train a neural network, how the hyperparameters of a network should be selected. This talk will explore how to choose a good neural network architecture for your model for better performance and generalization suited to your specific problem and dataset size, why proper weight initialization is important for neural network training and help learn some sanity checks such as overfitting a single batch which can be useful for debugging model training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arjun_jain_babysitting_learning_process.md

arjun_jain_babysitting_learning_process.md

Babysitting the learning process

Files

arjun_jain_babysitting_learning_process.md

Latest commit

History

arjun_jain_babysitting_learning_process.md

File metadata and controls

Babysitting the learning process