Reduce_mean to be changed to reduce_sum for better results | Course 2 W3A1 #42

lovenya · 2023-06-24T15:04:40Z

To quote the notebook itself

"
In step 1, the compute_total_loss function will only take care of summing the losses from one mini-batch of samples. Then, as you train the model (in section 3.3) which will call this compute_total_loss function once per mini-batch, step 2 will be done by accumulating the sums from each of the mini-batches, and finishing it with the division by the total number of samples to get the final cost value.

Computing the "total loss" instead of "mean loss" in step 1 can make sure the final cost value to be consistent. For example, if the mini-batch size is 4 but there are just 5 samples in the whole batch, then the last mini-batch is going to have 1 sample only. Considering the 5 samples, losses to be [0, 1, 2, 3, 4] respectively, we know the final cost should be their average which is 2. Adopting the "total loss" approach will get us the same answer. However, the "mean loss" approach will first get us 1.5 and 4 for the two mini-batches, and then finally 2.75 after taking average of them, which is different from the desired result of 2. Therefore, the "total loss" approach is adopted here.

"

lovenya · 2023-06-28T08:32:03Z

Completed, thank you

lovenya closed this as completed Jun 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce_mean to be changed to reduce_sum for better results | Course 2 W3A1 #42

Reduce_mean to be changed to reduce_sum for better results | Course 2 W3A1 #42

lovenya commented Jun 24, 2023

lovenya commented Jun 28, 2023

Reduce_mean to be changed to reduce_sum for better results | Course 2 W3A1 #42

Reduce_mean to be changed to reduce_sum for better results | Course 2 W3A1 #42

Comments

lovenya commented Jun 24, 2023

lovenya commented Jun 28, 2023