Hyperparameter tuning - discrepancy between readme and code? #73

maruker · 2019-10-19T14:13:55Z

The readme states

For probing tasks, we used an MLP with a Sigmoid nonlinearity and and tuned the nhid (in [50, 100, 200]) and dropout (in [0.0, 0.1, 0.2]) on the dev set.

However, in the code it looks like the parameters given by the user are always used. No tuning takes place and no predefined hyperparameters are loaded. Maybe I missed something?

Should I do hyperparameter tuning to get results that are comparable to the literature?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter tuning - discrepancy between readme and code? #73

Hyperparameter tuning - discrepancy between readme and code? #73

maruker commented Oct 19, 2019

Hyperparameter tuning - discrepancy between readme and code? #73

Hyperparameter tuning - discrepancy between readme and code? #73

Comments

maruker commented Oct 19, 2019