[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

trevormcinroe · 2020-08-11T16:57:29Z

There are a few solutions on the leaderboard with 0 episodes before solve.

Observing the code in these solutions shows that the agent behavior is simply a static, manually-defined policy. For example:

class Agent:
   def decide(self, observation):
       position, velocity, angle, angle_velocity = observation
       action = int(3. * angle + angle_velocity > 0.)
       return action

This algorithm is obviously not using any tenets of reinforcement learning, or even machine learning in general.
Even if this policy was jotted down from an actual RL-learned policy, it is not genuine to say that it was learned in 0 episodes.

The leaderboard itself is for fun, yes. But you could also argue that it is a place for RL beginners to read through code and learn.

This leaderboard is community-driven, so I ask the community: should solutions like this be allowed to occupy the top spot on the leaderboard?

jkterry1 · 2021-08-21T02:42:46Z

It should be allowed. It's instructive for people to see useful things.

trevormcinroe changed the title ~~Should manually-defined agent behavior be allowed on the leaderboards?~~ [Docs] Should manually-defined agent behavior be allowed on the leaderboards? Aug 11, 2020

jkterry1 closed this as completed Aug 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

trevormcinroe commented Aug 11, 2020

jkterry1 commented Aug 21, 2021

[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

Comments

trevormcinroe commented Aug 11, 2020

jkterry1 commented Aug 21, 2021