Skip to content
This repository has been archived by the owner on Oct 6, 2023. It is now read-only.

请教下,train-ppo.py中actor-model和reward模型同为reward模型? #7

Closed
glsoon opened this issue May 24, 2023 · 0 comments
Closed

Comments

@glsoon
Copy link

glsoon commented May 24, 2023

No description provided.

@glsoon glsoon closed this as completed May 24, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant