New Chapter: hyperparameter selection #50

mehradans92 · 2021-09-08T02:45:11Z

whitead · 2021-09-08T02:57:56Z

kjappelbaum · 2021-10-05T15:35:02Z

a bit more far-fetched would be experiment tracking.

I'm thinking of stuff like weights and biases which 1) has tools for hyperparameter sweeps 2) tools to visualize some chemistry

kjappelbaum · 2021-10-05T15:35:57Z

as framework, I really enjoy using Optuna

mehradans92 · 2021-12-06T14:36:59Z

whitead · 2021-12-06T20:17:16Z

@mehradans92 I think that comment was me sharing some existing methods used. It would be better to be as package agnostic as possible though.

whitead · 2022-03-30T17:52:20Z

@mehradans92 Read through it briefly. Looks great, a lot of work went into it! Also I can tell it will be very helpful. A few proposed changes:

Try to look at the layers chapter once more, there is some overlapping material (e.g., dropout, regularization, hyperparameters).
Cite some papers on learning rate schedulers and maybe add some information on momentum, since it's related. Also some have mentioned warm-start, which I'm not familiar with. Maybe mention it..
Fig 8.2 - does it need to be a movie? Can be distracting while reading. I can see the benefit for 8.1 certainly
Batch size - would love to get 1-2 citations here on batch size and its connection to randomness in estimating gradient
Dropout - can you cite the paper and maybe add a bit more on where it should be added (all layers?), if it should be combined with other regularization, etc.
It is really critical to use validation data for a hyperparameter search - otherwise you're implicitly fitting to testing data. See here. You need to strongly emphasize this point early and make sure code/examples uses the word validation, instead of test, for the search.
On Keras, can you reduce the output level of the logging (verbose=0) so the text isn't rendered in the chapter.
You've split up the code nicely, but it'd be great to have some discussion, maybe showing how snippets of how the methods work too, before going right into training.

whitead added the enhancement New feature or request label Oct 5, 2021

mehradans92 mentioned this issue May 24, 2022

revised chapter based on suggested changes #145

Merged

whitead closed this as completed Aug 18, 2022

Provide feedback