Fix issue #100 #102

jkterry1 · 2021-06-02T22:21:25Z

Close #100

araffin · 2021-06-03T21:14:42Z

train.py

@@ -65,7 +65,7 @@
        choices=["halving", "median", "none"],
    )
    parser.add_argument("--n-startup-trials", help="Number of trials before using optuna sampler", type=int, default=10)
-    parser.add_argument("--n-evaluations", help="Number of evaluations for hyperparameter optimization", type=int, default=20)
+    parser.add_argument("--n-evaluations", help="Number of timesteps at which the environment is evaluated during for a single hyperparameter combination", type=int, default=20)


that's not what it does...
For hyperparameter optimization, you give each trial a maximum budget of n_timesteps and to decide whether to prune or not, you evaluate each trial every n_timesteps // args.n_evaluations

btw, could you merge your two doc PR together and update the changelog?

"you give each trial a maximum budget of n_timesteps and to decide whether to prune or not"

After reading through the code either I'm misunderstanding you, I'm going crazy or thats wrong. n_timesteps how by timesteps an individual training run for set of hyperparameters runs for and that's used for reference in pruning. e.g. model.learn(self.n_timesteps, callback=eval_callback)

However, my new sentence did have some grammar errors, this is what it probably should've said:
"Maximum number of training timesteps for each set of hyperparameters during optimization"

Is that better?

"Maximum number of training timesteps for each set of hyperparameters during optimization"

This is what n_timesteps is when doing hyperparameter optimization.

And using the TrialEvalCallback, we can stop each trial early by returning False according to the pruner:

rl-baselines3-zoo/utils/callbacks.py

Line 49 in f6b3ff7

if self.trial.should_prune():

which resutls in a trial doing less than n_timesteps.

And to decide whether to prune or not, you have regular evaluations every n_timesteps // args.n_evaluations, so a maximum of args.n_evaluations in total.

If you want, there is a self contained example in the optuna repo or here: https://github.com/araffin/rl-handson-rlvs21/blob/main/optuna/sb3_simple.py

Fix issue DLR-RM#100

2ad3e39

araffin reviewed Jun 3, 2021

View reviewed changes

jkterry1 closed this Jun 9, 2021

jkterry1 mentioned this pull request Jun 9, 2021

Improve documentation #108

Merged

jkterry1 deleted the patch-2 branch December 16, 2021 21:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue #100 #102

Fix issue #100 #102

jkterry1 commented Jun 2, 2021

araffin Jun 3, 2021

jkterry1 Jun 9, 2021

araffin Jun 9, 2021

Fix issue #100 #102

Fix issue #100 #102

Conversation

jkterry1 commented Jun 2, 2021

araffin Jun 3, 2021

Choose a reason for hiding this comment

jkterry1 Jun 9, 2021

Choose a reason for hiding this comment

araffin Jun 9, 2021

Choose a reason for hiding this comment