[REVIEW]: Generic reinforcement learning codebase in TensorFlow #1524

whedon · 2019-06-24T21:19:24Z

Submitting author: @alexanderimanicowenrivers (Alexander I. Cowen-Rivers)
Repository: https://github.com/for-ai/rl
Version: 1.0.0
Editor: @mbobra
Reviewer: @desilinguist, @paragkulkarni11
Archive: 10.5281/zenodo.3408453

Status

Status badge code:

HTML: <a href="http://joss.theoj.org/papers/a65da0f74f34be097b1c1189ae6abdc6"><img src="http://joss.theoj.org/papers/a65da0f74f34be097b1c1189ae6abdc6/status.svg"></a>
Markdown: [![status](http://joss.theoj.org/papers/a65da0f74f34be097b1c1189ae6abdc6/status.svg)](http://joss.theoj.org/papers/a65da0f74f34be097b1c1189ae6abdc6)

Reviewers and authors:

Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) by leaving comments in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)

Reviewer instructions & questions

@desilinguist & @paragkulkarni11, please carry out your review in this issue by updating the checklist below. If you cannot edit the checklist please:

Make sure you're logged in to your GitHub account
Be sure to accept the invite at this URL: https://github.com/openjournals/joss-reviews/invitations

The reviewer guidelines are available here: https://joss.readthedocs.io/en/latest/reviewer_guidelines.html. Any questions/concerns please let @mbobra know.

✨ Please try and complete your review in the next two weeks ✨

Review checklist for @desilinguist

Conflict of interest

As the reviewer I confirm that I have read the JOSS conflict of interest policy and that there are no conflicts of interest for me to review this work.

Code of Conduct

I confirm that I read and will adhere to the JOSS code of conduct.

General checks

Repository: Is the source code for this software available at the repository url?
License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
Version: 1.0.0
Authorship: Has the submitting author (@alexanderimanicowenrivers) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?

Functionality

Installation: Does installation proceed as outlined in the documentation?
Functionality: Have the functional claims of the software been confirmed?
Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
Automated tests: Are there automated tests or manual steps described so that the function of the software can be verified?
Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

Authors: Does the paper.md file include a list of authors with their affiliations?
A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
References: Do all archival references that should have a DOI list one (e.g., papers, datasets, software)?

Review checklist for @paragkulkarni11

Conflict of interest

As the reviewer I confirm that I have read the JOSS conflict of interest policy and that there are no conflicts of interest for me to review this work.

Code of Conduct

I confirm that I read and will adhere to the JOSS code of conduct.

General checks

Repository: Is the source code for this software available at the repository url?
License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
Version: 1.0.0
Authorship: Has the submitting author (@alexanderimanicowenrivers) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?

Functionality

Installation: Does installation proceed as outlined in the documentation?
Functionality: Have the functional claims of the software been confirmed?
Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
Automated tests: Are there automated tests or manual steps described so that the function of the software can be verified?
Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

Authors: Does the paper.md file include a list of authors with their affiliations?
A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
References: Do all archival references that should have a DOI list one (e.g., papers, datasets, software)?

The text was updated successfully, but these errors were encountered:

whedon · 2019-06-24T21:19:29Z

Hello human, I'm @whedon, a robot that can help you with some common editorial tasks. @desilinguist, @paragkulkarni11 it looks like you're currently assigned to review this paper 🎉.

⭐ Important ⭐

If you haven't already, you should seriously consider unsubscribing from GitHub notifications for this (https://github.com/openjournals/joss-reviews) repository. As a reviewer, you're probably currently watching this repository which means for GitHub's default behaviour you will receive notifications (emails) for all reviews 😿

To fix this do the following two things:

Set yourself as 'Not watching' https://github.com/openjournals/joss-reviews:

You may also like to change your default settings for this watching repositories in your GitHub profile here: https://github.com/settings/notifications

For a list of things I can do to help you, just type:

@whedon commands

whedon · 2019-06-24T21:19:30Z

Attempting PDF compilation. Reticulating splines etc...

whedon · 2019-06-24T21:20:05Z

👉 Check article proof 📄 👈

mbobra · 2019-06-24T21:21:31Z

@desilinguist @paragkulkarni11 Thank you for agreeing to review this submission. Whedon generated a checklist for you above. Please let me know if you have any questions or comments!

mbobra · 2019-06-24T21:26:29Z

@alexanderimanicowenrivers Thank you for helping me find reviewers. I really appreciate it. Do you mind addressing the comments by @cervere (in #1502) in the manuscript (doesn't have to be long)?

alexanderimanicowenrivers · 2019-06-24T23:29:20Z

Thanks for agreeing to review, and I have already addressed these comments @mbobra in my reply here #1502 (comment)

mbobra · 2019-07-02T00:49:43Z

👋 @desilinguist, @paragkulkarni11 How is it going? Would you like more time to review? Do you have any questions? Please let me know!

alexanderimanicowenrivers · 2019-07-02T04:11:42Z

Let me know if there is anything you would like clarified! @desilinguist @paragkulkarni11

desilinguist · 2019-07-02T11:24:15Z

Yes, I’d like until next Friday (7/12) please.

mbobra · 2019-07-02T17:24:55Z

@desilinguist Absolutely. Thanks for your time.

alexanderimanicowenrivers · 2019-07-02T17:40:11Z

Thank you :)

alexanderimanicowenrivers · 2019-07-12T00:55:09Z

Looking forward to receiving the reviews :) @desilinguist and @paragkulkarni11

paragkulkarni11 · 2019-07-12T16:29:49Z

Hi,
I am sorry for late review. In my opinion, documentation could be more elaborate adhering to guidelines given above. In particular , I will expect to be more specific when we say that past work was more general in nature. This will clearly specify and support your claim. Also can we see any real life example of this codebase implementation ? Kindly include it in your documentation. In case,, if possible please add most frequently used words of your paper. I am interested to see references of the paper used for this research. Please let me know if any point is not clear from above discussion. Thanks.

desilinguist · 2019-07-14T13:59:13Z

I have completed my review and I think while the goal of the project is admirable – providing a generic codebase for reinforcement learning in Tensorflow that also provides nice logging and visualization features – the setup process and the overall documentation both leave a lot to be desired. I have filed two issues with suggestions on how to address both of these.

The software paper does have a bit of motivation but it isn't really clear what the target demographic is - is it RL researchers who are already experts and this is a way for them to run experiments or are novice users also included? I got the impression from my interaction with the authors that it was the latter but I was quite lost.

There also doesn't seem to be any real tests - the Travis CI script just runs a bunch of python train.py commands with a bunch of parameters to see whether anything would fail? Am I misreading that?

mbobra · 2019-07-15T23:46:51Z

👋 @alexanderimanicowenrivers Can you please address the comments by @paragkulkarni11 and @desilinguist? Please let me know if you want any help with this.

alexanderimanicowenrivers · 2019-07-20T02:30:09Z

Thanks for the reviews @desilinguist and @paragkulkarni11

@desilinguist - We have resolved and one issue -- an improved initialisation script. We are indeed targetting the RL researcher, rather than a novice. We are finishing up the providing clearer documentation with working examples task (as is also shown in our newer paper.md file). The idea is for this codebase to be used specifically for RL research, rather than for industrial systems (although it could be used for this too)!.

@paragkulkarni11 - Thanks for the feedback, as mentioned above we have addressed the issues of working example and clearer documentation. We have been using it for research, however, we haven’t published anything yet.

The reason our tests are just run scripts is due to the high stochasticity of experiments. However, we are working on developing deterministic settings which we can rigorously test. We are awaiting adding in a few more tests on the non-stochastic components. As well as finishing up the docs.

Thanks!

Alex and the For.ai team :)

mbobra · 2019-07-24T14:11:39Z

@desilinguist @paragkulkarni11 Do you recommend this paper for publication after @alexanderimanicowenrivers' improvements or are there still some outstanding issues? I see that the checklists are not complete and I would like some guidance on how you would like to proceed from here.

desilinguist · 2019-07-24T14:14:13Z

Thanks @alexanderimanicowenrivers for the modifications! Unfortunately, I am traveling to a conference outside the country today and won't be back until August 5th. I will be happy to re-review once I am back.

alexanderimanicowenrivers · 2019-08-01T05:35:27Z

Dear reviewer,

We thank you for your comments and suggestions and believe we have sufficiently addressed all points.

Regarding your comment on the installation script, we have added to the file setup.sh such that the installation script now supports Homebrews on macOS. In addition, we have written additional logging lines that will allow the user to know exactly what is being installed and any errors if they occur. Furthermore, we have added to the file README.md additional information on how the user can manually install all dependencies should they choose to do so.

Regarding your comment on additional documentation, we have added to the files README.md and paper.md. Furthermore, we have created a readthedocs documentation paper, located at: https://rl-codebase.readthedocs.io/en/latest/. Here, we detail the different modules provided as well as a tutorial and basic usage. In the tutorial, we have outlined the use of a Conda environment to run experiments, as you requested us to do.

Regarding your comment on tests, we have under the directory \tests provided testing for memory and models.

Thanks for the great feedback.
Sincerely,
the FOR.ai team

alexanderimanicowenrivers · 2019-08-04T23:09:32Z

I hope you had a nice holiday @desilinguist , I look forward to hearing back from you and @paragkulkarni11

desilinguist · 2019-08-05T13:46:24Z

@alexanderimanicowenrivers thanks so much for considering my suggestions! I think there has been great progress. However, I still see a couple of issues:

I don't see the changes you described in the README in setup.sh. For example, I don't see the mac_package_manaager variable anywhere in setup.sh in the master branch?
I don't see the external documentation linked anywhere from the repository README? It'd be nice to add a readthedocs badge from shields.io right at the top of the README next to the passing builds.
Thanks for adding more explicit tests. However, I see lots of deprecation warnings in the test output. Perhaps you can address those in the codebase or if you don't think they apply, you can turn off the deprecation warnings?

alexanderimanicowenrivers · 2019-08-06T01:16:22Z

Dear @desilinguist,

Apologies, below are the final amendments. We are just working out how to add the badge from shields.io as we speak.

Let me know what you think.

Bests,
Alex

desilinguist · 2019-08-06T13:02:39Z

@alexanderimanicowenrivers I think it would be better to review the changes all together after they are merged into the master branch.

In any case, I looked at the first PR for setup.sh and it looks like it hasn't really been tested with macports. I get the following error when running the command port install qt open-mpi pkg-config ffmpeg:

Error: Port qt not found

alexanderimanicowenrivers · 2019-08-10T08:21:29Z

The major changes have now been merged, as well as the bug for macports has now been fixed in this PR - for-ai/rl#27

Thanks @desilinguist

Alex

desilinguist · 2019-08-12T19:56:12Z

Thanks @alexanderimanicowenrivers! It looks much, much better now. FYI, the echo command in the PR doesn't match the actual command that's run so you should probably fix that :)

@mbobra please consider this approved from my side!

mbobra · 2019-09-30T22:46:50Z

@openjournals/joss-eics This paper is ready for acceptance! Nice work @hsezhiyan @alexanderimanicowenrivers @bryanlimy 🎉

kyleniemeyer · 2019-10-01T01:42:23Z

@whedon generate pdf

whedon · 2019-10-01T01:42:25Z

Attempting PDF compilation. Reticulating splines etc...

whedon · 2019-10-01T01:42:52Z

👉 Check article proof 📄 👈

kyleniemeyer · 2019-10-01T02:05:46Z

Hi @alexanderimanicowenrivers, some fixes needed in the paper before we publish:

@incollection{Sutton2000,
title = {Policy Gradient Methods for Reinforcement Learning with Function Approximation},
author = {Sutton, Richard S and David A. McAllester and Satinder P. Singh and Mansour, Yishay},
booktitle = {{Advances in Neural Information Processing Systems 12}},
editor = {S. A. Solla and T. K. Leen and K. M\"{u}ller},
pages = {1057--1063},
year = {2000},
publisher = {MIT Press},
}

hsezhiyan · 2019-10-04T07:23:21Z

Hi Kyle,

Thanks for you update! I apologize for the delayed response.

I've addressed your comments in PR#32

I've fixed all issues except the Silver 2014 reference. I checked other papers on arxiv, and they cite they cite it the exact same way. Let me know if you have other concerns.

kyleniemeyer · 2019-10-04T15:43:40Z

@whedon generate pdf

whedon · 2019-10-04T15:43:43Z

Attempting PDF compilation. Reticulating splines etc...

whedon · 2019-10-04T15:44:14Z

👉 Check article proof 📄 👈

kyleniemeyer · 2019-10-04T15:56:24Z

Hi @hsezhiyan, thanks for making those edits. I just submitted a PR that fixes two final things in the references: for-ai/rl#33

the Sutton 2000 reference was still missing some information; it looks like you added the BiBTeX citation I gave above, but your paper was referencing the original (incorrect) entry.
For the Silver 2014 reference, it looks like the other papers on arXiv are not citing in correctly (and as a journal, we need to be a bit more rigorous with references); I found the paper at http://proceedings.mlr.press/v32/silver14.html, where they give the correct BibTeX citation info.

hsezhiyan · 2019-10-04T18:01:27Z

@kyleniemeyer thanks for the review and the pull request! I have merged your PR.

Are we good to go?

kyleniemeyer · 2019-10-04T18:03:02Z

@whedon generate pdf

whedon · 2019-10-04T18:03:04Z

Attempting PDF compilation. Reticulating splines etc...

whedon · 2019-10-04T18:03:47Z

👉 Check article proof 📄 👈

kyleniemeyer · 2019-10-04T18:03:58Z

@whedon set 10.5281/zenodo.3408453 as archive

whedon · 2019-10-04T18:04:02Z

OK. 10.5281/zenodo.3408453 is the archive.

kyleniemeyer · 2019-10-04T18:04:22Z

@whedon accept

whedon · 2019-10-04T18:04:24Z

Attempting dry run of processing paper acceptance...

whedon · 2019-10-04T18:04:57Z


OK DOIs

- 10.5281/zenodo.1134899 is OK

MISSING DOIs

- None

INVALID DOIs

- None

whedon · 2019-10-04T18:05:16Z

Check final proof 👉 openjournals/joss-papers#1006

If the paper PDF and Crossref deposit XML look good in openjournals/joss-papers#1006, then you can now move forward with accepting the submission by compiling again with the flag deposit=true e.g.

@whedon accept deposit=true

kyleniemeyer · 2019-10-04T18:06:31Z

@whedon accept deposit=true

whedon · 2019-10-04T18:06:34Z

Doing it live! Attempting automated processing of paper acceptance...

whedon · 2019-10-04T18:07:21Z

🐦🐦🐦 👉 Tweet for this paper 👈 🐦🐦🐦

whedon · 2019-10-04T18:07:22Z

🚨🚨🚨 THIS IS NOT A DRILL, YOU HAVE JUST ACCEPTED A PAPER INTO JOSS! 🚨🚨🚨

Here's what you must now do:

Check final PDF and Crossref metadata that was deposited 👉 Creating pull request for 10.21105.joss.01524 joss-papers#1007
Wait a couple of minutes to verify that the paper DOI resolves https://doi.org/10.21105/joss.01524
If everything looks good, then close this review issue.
Party like you just published a paper! 🎉🌈🦄💃👻🤘

Any issues? notify your editorial technical team...

kyleniemeyer · 2019-10-04T18:15:03Z

Congrats @hsezhiyan on your article's publication in JOSS!

Many thanks to @desilinguist & @paragkulkarni11 for reviewing, and @mbobra for editing, this submission.

whedon · 2019-10-04T18:15:07Z

🎉🎉🎉 Congratulations on your paper acceptance! 🎉🎉🎉

If you would like to include a link to your paper from your README use the following code snippets:

Markdown:
[![DOI](https://joss.theoj.org/papers/10.21105/joss.01524/status.svg)](https://doi.org/10.21105/joss.01524)

HTML:
<a style="border-width:0" href="https://doi.org/10.21105/joss.01524">
  <img src="https://joss.theoj.org/papers/10.21105/joss.01524/status.svg" alt="DOI badge" >
</a>

reStructuredText:
.. image:: https://joss.theoj.org/papers/10.21105/joss.01524/status.svg
   :target: https://doi.org/10.21105/joss.01524

This is how it will look in your documentation:

We need your help!

Journal of Open Source Software is a community-run journal and relies upon volunteer effort. If you'd like to support us please consider doing either one (or both) of the the following:

Volunteering to review for us sometime in the future. You can add your name to the reviewer list here: https://joss.theoj.org/reviewer-signup.html
Making a small donation to support our running costs here: https://numfocus.org/donate-to-joss

hsezhiyan · 2019-10-05T00:01:12Z

Thank you very much @kyleniemeyer for your close collaboration at the last steps!

Thank you @mbobra for your guidance! And thanks @desilinguist and @paragkulkarni11 for your reviews!

whedon assigned mbobra Jun 24, 2019

whedon added the review label Jun 24, 2019

whedon mentioned this issue Jun 24, 2019

[PRE REVIEW]: Generic reinforcement learning codebase in TensorFlow #1502

Closed

whedon assigned desilinguist Jun 25, 2019

whedon assigned paragkulkarni11 Jul 13, 2019

kyleniemeyer closed this as completed Oct 4, 2019

whedon added published Papers published in JOSS recommend-accept Papers recommended for acceptance in JOSS. labels Mar 2, 2020

[REVIEW]: Generic reinforcement learning codebase in TensorFlow #1524

[REVIEW]: Generic reinforcement learning codebase in TensorFlow #1524

Comments

whedon commented Jun 24, 2019 • edited Loading

Status

Reviewer instructions & questions

Review checklist for @desilinguist

Conflict of interest

Code of Conduct

General checks

Functionality

Documentation

Software paper

Review checklist for @paragkulkarni11

Conflict of interest

Code of Conduct

General checks

Functionality

Documentation

Software paper

whedon commented Jun 24, 2019

whedon commented Jun 24, 2019

whedon commented Jun 24, 2019

mbobra commented Jun 24, 2019

mbobra commented Jun 24, 2019

alexanderimanicowenrivers commented Jun 24, 2019 • edited Loading

mbobra commented Jul 2, 2019

alexanderimanicowenrivers commented Jul 2, 2019

desilinguist commented Jul 2, 2019

mbobra commented Jul 2, 2019

alexanderimanicowenrivers commented Jul 2, 2019

alexanderimanicowenrivers commented Jul 12, 2019

paragkulkarni11 commented Jul 12, 2019

desilinguist commented Jul 14, 2019

mbobra commented Jul 15, 2019

alexanderimanicowenrivers commented Jul 20, 2019 • edited Loading

mbobra commented Jul 24, 2019

desilinguist commented Jul 24, 2019

alexanderimanicowenrivers commented Aug 1, 2019 • edited Loading

alexanderimanicowenrivers commented Aug 4, 2019

desilinguist commented Aug 5, 2019

alexanderimanicowenrivers commented Aug 6, 2019

desilinguist commented Aug 6, 2019

alexanderimanicowenrivers commented Aug 10, 2019

desilinguist commented Aug 12, 2019 • edited Loading

mbobra commented Sep 30, 2019

kyleniemeyer commented Oct 1, 2019

whedon commented Oct 1, 2019

whedon commented Oct 1, 2019

kyleniemeyer commented Oct 1, 2019

hsezhiyan commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

hsezhiyan commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

whedon commented Oct 4, 2019

kyleniemeyer commented Oct 4, 2019

whedon commented Oct 4, 2019

hsezhiyan commented Oct 5, 2019

whedon commented Jun 24, 2019 •

edited

Loading

alexanderimanicowenrivers commented Jun 24, 2019 •

edited

Loading

alexanderimanicowenrivers commented Jul 20, 2019 •

edited

Loading

alexanderimanicowenrivers commented Aug 1, 2019 •

edited

Loading

desilinguist commented Aug 12, 2019 •

edited

Loading