Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what are the best performing models? #3

Open
shouldsee opened this issue Sep 20, 2022 · 2 comments
Open

what are the best performing models? #3

shouldsee opened this issue Sep 20, 2022 · 2 comments

Comments

@shouldsee
Copy link
Collaborator

Thanks for sharing! Just wondering what's going on in terms of

  • The current workflow to implement and test new models?
  • what metrics to calculate for new models?
  • what are the best models so far?
@Cybernetic1
Copy link
Owner

Hey, sorry I did not see your message.

  1. First we find an algorithm (such as PPO or Soft Actor-Critic) and adapt its code to solve Tic Tac Toe. The first version would use board-vector and second version use logic proposition embedding.
  2. For each game the highest score is 20. I just want to demonstrate convergence first.
  3. All the models tested so far -- based on naive policy gradient -- failed to converge to the highest score. Some of them reached close to the highest score but were unstable.

@shouldsee
Copy link
Collaborator Author

Ok thanks. I guess the core question is, how is the score calculated? which function did you use to calculate the score?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants