Evaluating the behavioural changes of the OpenAI cart pole task solved with a Full Microbial Genetic Algorithm
We optimised a Full Microbial Genetic Algorithm (FMGA) to solve the OpenAI cart pole task: finding a genotype that could keep the pole up for 500 episodes.
Having found a correlation between the number of epochs, the population size, and the fitness of the best individual, we trained an FMGA with 50 individuals over 1000 epochs to get the fittest controller.
We examined how the position, velocity, angle and rotation rate of the most fit controller at different numbers of epochs change. We found patterns in the failures of controllers, such as an increased velocity before falling. We also found that the fittest controllers keep a consistent pattern in each observed behaviour until the last episodes.