We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Algorithm used: Q learning (TD(0))
State space : {string of 0's, 1's and 2's}