Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib] fix VTrace in impala_tf_policy to support Keras 3 #45562

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

RocketRider
Copy link
Contributor

Fix ValueError: Argument learning_rate should be float, or an instance of LearningRateSchedule, or a callable (that takes in the current iteration value and returns the corresponding learning rate value). Received instead: learning_rate=<tf.Variable 'lr:0' shape=() dtype=float32, numpy=0.0005>

Fixes: #45050

Fix ValueError: Argument `learning_rate` should be float, or an instance of LearningRateSchedule, or a callable (that takes in the current iteration value and returns the corresponding learning rate value). Received instead: learning_rate=<tf.Variable 'lr:0' shape=() dtype=float32, numpy=0.0005>

Signed-off-by: Michael Möbius <e-mail@rocketrider.eu>
@RocketRider RocketRider changed the title fix VTrace in impala_tf_policy to support Keras 3 [RLlib] fix VTrace in impala_tf_policy to support Keras 3 May 25, 2024
@anyscalesam anyscalesam added triage Needs triage (eg: priority, bug/not-bug, and owning component) rllib RLlib related issues labels May 29, 2024
@RocketRider
Copy link
Contributor Author

@ArturNiederfahrenhorst could you take a look? I would appreciate it! Would love to see Keras 3 support in rllib, and it is just two small PRs away ;)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
rllib RLlib related issues triage Needs triage (eg: priority, bug/not-bug, and owning component)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[RLlib] ValueError in initialization of ImpalaTF2Policy
2 participants