Partial Python rewrite #361

taylorhansen · 2023-05-09T22:26:50Z

With the growing complexity of the RL/ML side of this project, the training code is starting to hit the limits of the current TFJS/Node capabilities, having to re-implement some frameworks/algorithms from scratch (training loop, attention layers, etc) and hitting bugs and performance issues. Part of this project will be rewritten in Python in order to better take advantage of deep learning libraries and resources.

Since the simulator used by this project is written in JavaScript, inevitably part of code will have to remain in that language, while connecting it to the main Python training script (or inference server in PsBot workflow) via an interop library. After some iteration the best place to draw the JS-Python barrier seems to be at the BattleAgent level, where a JSON describing the battle state can be sent to Python to calculate inferences, do ML, etc. and then send action responses back. Most of the PsBot and BattleState modules will remain unchanged and can still take full advantage of the modular PS libraries that are currently being used, while inference and training code will be rewritten in Python and further features/improvements built on top of that.

scheibo · 2023-05-21T19:47:37Z

Since the simulator used by this project is written in JavaScript, inevitably part of code will have to remain in that language, while connecting it to the main Python training script (or inference server in PsBot workflow) via an interop library

I'm not sure about the scope this project is interested in (Gen 4? All gens?), but https://github.com/pkmn/engine (which has a Python driver, https://github.com/AnnikaCodes/PyKMN) is a more accurate and much faster engine for Gen 1 (and aims to eventually support further old gens) and may be worth considering.

taylorhansen · 2023-05-23T21:38:12Z

Thanks for bringing this to my attention!

Currently only Gen 4 Random Battles are being supported for now since it's simple enough but not too far removed from modern gen mechanics so that once I confirm a good ML algorithm and a stable underlying framework I can start writing Gen 5-9 and/or doubles versions. It seems that this libpkmn only supports Gen 1 so I can't use it right now, but once it's more fully featured for higher gens I'd be interested in switching out @pkmn/sim for it to try and speed up model training.

Thanks for telling me about this library and I'll keep watch for new updates.

taylorhansen added enhancement Something should be changed ai Has to do with the AI training Has to do with the training script labels May 9, 2023

taylorhansen self-assigned this May 9, 2023

taylorhansen closed this as completed in 4e89b5b Jun 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partial Python rewrite #361

Partial Python rewrite #361

taylorhansen commented May 9, 2023 •

edited

Loading

scheibo commented May 21, 2023

taylorhansen commented May 23, 2023

Partial Python rewrite #361

Partial Python rewrite #361

Comments

taylorhansen commented May 9, 2023 • edited Loading

scheibo commented May 21, 2023

taylorhansen commented May 23, 2023

taylorhansen commented May 9, 2023 •

edited

Loading