Metagrok

A self-play Reinforcement Learning system for learning to battle on Pokémon Showdown. This code was used in the work: A Self-Play Policy Optimization Approach To Battling Pokémon.

How does it work?

There are three pieces of functionality that make this system work:

Handling communications with the outside world. Currently it is able to connect to a generic Pokémon Showdown server using the websocket interface. It can also battle over the stdio interface provided by ./pokemon-showdown simulate-battle.
Processing Showdown server messages into a structured format. We repurpose the Showdown client code to create a headless client -- one that does not interact with DOM elements and does not assume it is running on a web page. To have the resulting JavaScript code work with Python, we embed a V8 runtime (using Python Mini Racer) in our program.
Performing an action in-game and learning from its mistakes. The main decision maker is a neural network. See the paper for high level details and a schematic of the neural network architecture.

How do I get it to run?

Most development happens in a local Docker container. To set that up:

docker build . --tag metagrok:latest

This docker container freezes versioning for three main components:

Python environment and conda packages
Node.js version
Pokémon Showdown server commit SHA

Any update to one of these things require a rebuild of the Docker container. In particular, no source code (i.e. nothing in the metagrok/ or the js/ directories) is frozen. For convenience during development, any new conda packages that are installed should simply be appended to scripts/install-more-conda-packages.sh.

To develop code in this environment, run in a terminal window:

docker run -it -v $(pwd):/root/workspace --entrypoint /bin/bash metagrok:latest

# inside the Docker instance
(metagrok) [root@6a31f1d57424]# cd workspace

Building Showdown-related components

The first thing to do is to set up the correct Node version, download both Showdown repos, and create the headless client:

scripts/install.sh
scripts/compile-headless-client.sh

This creates the headless client at build/engine.js.

The rest of this section contains common things one might want to do with a Showdown bot.

Evaluating a bot against another bot

./rp metagrok/exe/head2head.py \
    --format gen7randombattle \
    --p1 ,metagrok.pkmn.engine.baselines.MostDamageMovePlayerTypeAware \
    --p2 metagrok.pkmn.models.v3_capacity.QuadCapacity:static/sample-v3-quad-model.pytorch \
    --num-matches 10

Evaluating a bot against humans on a Pokémon Showdown server

# 1. Set up showdown server on localhost:8000

# host.docker.internal is the docker's DNS name for the host machine
./rp metagrok/exe/smogon_eval.py \
    metagrok.pkmn.models.v3_capacity.QuadCapacity:static/sample-v3-quad-model.pytorch \
    --num-matches 8 \
    --max-concurrent 4 \
    --host host.docker.internal --port 8000

Training the bot

./rp metagrok/exe/integrated_rl_script.py \
    expts/XX-test.json \
    data/test-integrated-rl-script

Are there unit tests???

Yes, there are some unit tests, though code coverage is woefully low. In the Docker environment, run nose2 to execute all of the Python unit tests.

How can I help?

There's a lot of jank in this code, so there's a lot to do. A few things that come to mind for me:

Make it so that we can have a different dex directory per model
Fix challenge_bot.py (an agent on a Showdown server that only responds to challenges)
Refactor the code so that this project makes sense as a package on PyPI

A less trivial (but way more exciting) improvement would be to make this work with doubles (and thereby work with VGC formats)!

Please open an issue with a proposed plan before starting to do any work!

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
build		build
dex		dex
expts		expts
js		js
metagrok		metagrok
scripts		scripts
static		static
test-data		test-data
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
battler.env.yml		battler.env.yml
config.json		config.json
rp		rp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metagrok

How does it work?

How do I get it to run?

Building Showdown-related components

Evaluating a bot against another bot

Evaluating a bot against humans on a Pokémon Showdown server

Training the bot

Are there unit tests???

How can I help?

About

Releases

Packages

Languages

License

yuzeh/metagrok

Folders and files

Latest commit

History

Repository files navigation

Metagrok

How does it work?

How do I get it to run?

Building Showdown-related components

Evaluating a bot against another bot

Evaluating a bot against humans on a Pokémon Showdown server

Training the bot

Are there unit tests???

How can I help?

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages