Skip to content

Commit e9cd8f5

Browse files
feat(a3c): use stepped learning rate decay
- a high learning rate works well up front, but doesn't smooth out really.
1 parent 15cc58a commit e9cd8f5

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

libraries/mathy_python/mathy/agents/policy_value_model.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,10 @@ def __init__(
4141
super(PolicyValueModel, self).__init__(**kwargs)
4242
if args is None:
4343
args = BaseConfig()
44-
self.optimizer = tf.keras.optimizers.Adam(lr=args.lr)
44+
lr_schedule = tf.keras.optimizers.schedules.ExponentialDecay(
45+
args.lr, decay_steps=100000, decay_rate=0.96, staircase=True
46+
)
47+
self.optimizer = tf.keras.optimizers.Adam(learning_rate=lr_schedule)
4548
self.args = args
4649
self.predictions = predictions
4750
self.embedding = MathyEmbedding(self.args)

0 commit comments

Comments
 (0)