Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

python wrapper over sarsa libraries #51

Merged
merged 7 commits into from
Oct 23, 2017

Conversation

DurgeshSamant
Copy link
Contributor

This PR demonstrates how one can write thin python wrappers over the existing sarsa libraries. Such wrappers can allow one to code up high level agents in python that can directly call the fast C++ sarsa library functions.

As an example, this PR also contains a high level sarsa agent written in python. This agent is a python paraphrase of the existing high level sarsa offense agent written in C++. A script that runs this agent is also included.

In comparison to the C++ agent, the python agent obtains < 0.5 % difference in mean goal scoring % when averaged over 20 runs. This is when the fullstate flag is ON.

However, in case of partial observability, the python agent obtains 0 - 5% less goal scoring percentage when compared to the C++ agent. Despite my Best efforts, I am unable to pin down the reason for this.

It would be great if the maintainers and the broader community could replicate this issue and find ways to overcome it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants