in #3 for the suggestion of linearly decreasing the learning rate through training. note that the provided model was trained with the old solver configuration. in our experiements, this new solver configuration leads to model accuracy that is greater than or equal to the old configuration.
Latest commit 0bc03d9
Mar 26, 2016