Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the RL environment (familiarize with the github examples) #25

Closed
aaronkurz opened this issue Nov 23, 2021 · 1 comment
Closed

Comments

@aaronkurz
Copy link
Owner

aaronkurz commented Nov 23, 2021

Take a look at https://github.com/david-cortes/contextualbandits. Omar is already taking care of installing the package and requirements in the docker file. Take a look at the example and implement a basic version of the RL env/agent. Most importantly understand how the RL env and agent is set up, which methods are important and how it works. Start with the examples and work your way up from there. The specific method does not matter right now. Define a VERY simple and basic reward function. The focus is on starting the execution of batches of instances from within the RL env and the orchestration of the different components of the prototype.

@aaronkurz
Copy link
Owner Author

aaronkurz commented Dec 7, 2021

@tgrosman tgrosman moved this from In Progress to Review in Kanban board Dec 14, 2021
Kanban board automation moved this from Review to Closed Jan 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
No open projects
Development

No branches or pull requests

5 participants