Dual head network like AlphaZero #560

b0n541 · 2023-06-08T20:08:54Z

I would like to use KotlinDL to create a AlphaZero like dual head network for my game.

Unfortunately I haven't found any hint on how to accomplish that.

Is this already possible or would it be a new feature to be implemented?

zaleslaw · 2023-06-09T09:17:33Z

Hi @b0n541 if you want to create an AlphaZero, could you please share some notes about Keras/PyTorch implementation for that model?

Probably after that I could say more about is it possible or not

b0n541 · 2023-06-09T17:04:57Z

I have found the following information:

https://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0
https://github.com/deepmind/open_spiel/blob/master/docs/alpha_zero.md#model
https://github.com/deepmind/open_spiel/blob/master/open_spiel/python/algorithms/alpha_zero/model.py

What we would need is a way to split the network after the first couple of layers into two stacks of following layers with dedicated outputs for the game value prediction and the move prediction.

EgoLLC · 2024-05-12T19:23:11Z

Do you mean reinforcement learning?

Can I implement reinforcement learning in Kotlin DL to play chess or any other 1v1 board game where the model will play against itself?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dual head network like AlphaZero #560

Dual head network like AlphaZero #560

b0n541 commented Jun 8, 2023

zaleslaw commented Jun 9, 2023

b0n541 commented Jun 9, 2023

EgoLLC commented May 12, 2024

Dual head network like AlphaZero #560

Dual head network like AlphaZero #560

Comments

b0n541 commented Jun 8, 2023

zaleslaw commented Jun 9, 2023

b0n541 commented Jun 9, 2023

EgoLLC commented May 12, 2024