Standard RL benchmarks are often fixed, making it difficult to observe how strategies scale for different complexities. CARTerpillar is an extension of the classic CartPole environment to multiple carts, where the number of carts
This is a gym-based implementation of the CARTerpillar environment. The env defined in carterpillar.py strictly adheres to the gym standards to help users adopt it as fast as possible.
To experiment with it, you just need to have the gymnasium library installed. Then, you can clone this repo and start using it as follows:
from gymnasium.wrappers import TimeLimit
from CARTerpillar import CARTerpillarEnv
env = CARTerpillar(n_carts=5)
env = TimeLimit(env, max_episode_steps=500)
state, _ = env.reset()
state, reward, done, info = env.step(env.action_space.sample())
