Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docs] Should manually-defined agent behavior be allowed on the leaderboards? #2021

Closed
trevormcinroe opened this issue Aug 11, 2020 · 1 comment

Comments

@trevormcinroe
Copy link

There are a few solutions on the leaderboard with 0 episodes before solve.

Observing the code in these solutions shows that the agent behavior is simply a static, manually-defined policy. For example:

class Agent:
   def decide(self, observation):
       position, velocity, angle, angle_velocity = observation
       action = int(3. * angle + angle_velocity > 0.)
       return action

This algorithm is obviously not using any tenets of reinforcement learning, or even machine learning in general.
Even if this policy was jotted down from an actual RL-learned policy, it is not genuine to say that it was learned in 0 episodes.

The leaderboard itself is for fun, yes. But you could also argue that it is a place for RL beginners to read through code and learn.

This leaderboard is community-driven, so I ask the community: should solutions like this be allowed to occupy the top spot on the leaderboard?

@trevormcinroe trevormcinroe changed the title Should manually-defined agent behavior be allowed on the leaderboards? [Docs] Should manually-defined agent behavior be allowed on the leaderboards? Aug 11, 2020
@jkterry1
Copy link
Collaborator

It should be allowed. It's instructive for people to see useful things.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants