notebook training curve claims #9

dylwil3 · 2023-06-06T16:55:06Z

Some of the claims in the notebook about training graphs looking the same don't seem to be quite right. For example, the PBRS wrapper that aims for 'zero' value does, in fact, make an improvement in the small lake training. More confusingly, the 'initializing Q table' training and PBRS training don't seem to be the same- even though a paper claims they should be identical. Is this a seeding issue only or something more?

dylwil3 added the bug Something isn't working label Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebook training curve claims #9

notebook training curve claims #9

dylwil3 commented Jun 6, 2023

notebook training curve claims #9

notebook training curve claims #9

Comments

dylwil3 commented Jun 6, 2023