My PPO RL model to get feedback
Feedback-based Reinforcement Learning models receive feedback in order to improve the output based on the feedback. Here an outer feedback is designed to do so.
I have implemented the model in a Visual Studio Code notebook.