Value-Based TD mushroom_rl.algorithms.value.td Batch TD mushroom_rl.algorithms.value.batch_td DQN mushroom_rl.algorithms.value.dqn