-
-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
submission: healthdatacsv #358
Comments
Hi all: did a small update to include a COC, and a basic use vignette. |
Editor checks:
Editor commentsThanks for your submission. I have requests out to reviewers, and will let you know the reviewers and the review date when enough have agreed. In the meantime, could you please address these issues?
Reviewers: |
One other thing: while the package is undergoing peer review, could you please add the peer review badge? You can do this by running |
Thank you @lmullen - I've started addressing your comments. |
|
Thanks to our reviewers for agreeing to take on the peer review of this package. Reviewer: @czeildi You can find details about how to do the review in the reviewers' guide. |
Package ReviewPlease check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
DocumentationThe package includes all the following forms of documentation:
Functionality
Final approval (post-review)
Estimated hours spent reviewing: 6
Review CommentsIn general, I found the package easy to learn and use, and the code is readable and simple. Below are my comments in three categories. Most of them are non-blocking. usability
code style
keywords <- jsonlite::flatten(parsed_dataset) %>%
as_tibble() %>%
select(
.data$publisher.name,
.data$keyword
) %>%
tidyr::unnest(cols = c(.data$keyword)) %>%
distinct()
if (!is.null(agency)) {
keywords <- keywords %>%
filter(.data$publisher.name == agency)
}
documentation
fetch_catalog("Centers for Disease Control and Prevention",
keyword = "built environment")
|
Thanks @czeildi! I appreciate the feedback and will start addressing individual items soon. |
In the interest of reducing load on reviewers and editors as we manage the COVID-19 crisis, rOpenSci is temporarily pausing new submissions for software peer review for 30 days (and possibly longer). Please check back here again after 17 April for updates. In this period new submissions will not be handled, nor new reviewers assigned. Reviews and responses to reviews will be handled on a 'best effort' basis, but no follow-up reminders will be sent. Other rOpenSci community activities continue. We express our continued great appreciation for the work of our authors and reviewers. Stay healthy and take care of one other. The rOpenSci Editorial Board |
In this period new submissions will not be handled, nor new reviewers assigned.
Reviews and responses to reviews will be handled on a 'best effort' basis, but
no follow-up reminders will be sent. Please check back here again after 25 May when we will be announcing plans to slowly start back up. We express our continued great appreciation for the work of our authors and reviewers. Stay healthy and take care of one other. The rOpenSci Editorial Board |
Hi, @cksun-usc. Obviously a lot happened in the world since you agreed to take on this review. Is this something that you would be able to pick up at some point? If you do not wish to go forward with the review, that's more than fine; just let me know. |
👋 @iecastro I'll be taking over as editor for this submission and will follow up on what's needed to move this along. Thank you for your patience. |
Hi @czeildi - hope all is well! I've addressed your comments below. Thank you for your time.
|
@iecastro I reviewed your changes, looks great! I happily recommend approving this package. I noticed one minor thing: you should no longer import |
Thanks very much for the review and acceptance recommendation @czeildi The package looks to be in good shape to me as well. Since the second reviewer has gone missing, I'll try to get someone else to do a quick review and move this to acceptance. |
👋 @dbuijs Would you be able to help us review this package? One reviewer has already completed a thorough review and recommended acceptance. You'll need to do a sanity check and make sure everything looks ok to you. There is a checklist to work through. Thanks for considering. 🙏 |
@iecastro I'll keep looking for a second reviewer but if you have any suggestions for potential reviewers who don't have a conflict of interest, I am all ears. |
Adding @richfitz as second reviewer. Thanks for volunteering Rich! Much appreciated. |
Package ReviewPlease check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
DocumentationThe package includes all the following forms of documentation:
Functionality
Final approval (post-review)
Estimated hours spent reviewing: 1
Review CommentsLooked at 56bfd39 (Mon Jul 27 07:47:25 2020 -0400). I've not been able to complete the review ad the API is down, but the comments below should be addressed regardless. The three big issues are
Note that none of my comments really concern the implementation, which looks fine (modulo comments about dependencies, but that's a matter of taste). Problematic use of flakey API
All tests that use web requests should be opt-in, especially if the package will go to CRAN. Add
Similar, but harder, comments apply to the vignette if this is going on CRAN - I do not recommend at all trying to run a vignette that uses the internet or external service there. All alternative options are unpleasant though:
Testing issuesThe testing is minimal to the point of view that it may not be worth having, for example in
However, when looking at Instead, split these functions into network-using and network free components. You're basically there already, and fetch_catalog <- function(agency = NULL,
keyword = NULL) {
api_call <- healthdata_api("data.json")
fetch_catalogue_process(api_call, agency, keyword)
} and focus the bulk of the testing on the second half. What happens if If you split the code at this point you don't even need to bother formally mocking the api call, you can just keep a "reasonable" data set in the package for testing with. Similar comments apply throughout. You may want to use mocking (I like the The other coverage hole ( Documentation is minimalThe documentation for
This supposes that the user already knows what sort of data night be found there, and it's not clear what the restriction on csv is for. The (Other files have the same issues) In the DependenciesThis seems to me an excessive number of dependencies to pull in for what is a nicely streamlined package. Things like On the other hand, Minor commentRemove commented code (e.g., test-healthdata_api.R) - if part way through writing a test use Please note I will be offline from 4-19 September. I can look again before then but will be unresponsive in those two weeks. |
Thanks so much Rich! This is super helpful. 💯 |
@iecastro Checking in to see if you need any assistance. Please let us know when you're ready. No rush. 🙏 |
Hi @richfitz, @karthik - Some comments below, addressing the review items. I'm struggling with my travis build, and expanding the test suite. @richfitz - if you have the time, would you mind attempting the review again? Thanks, all!
|
Hi @karthik, suggest closing this issue. Have not done further updates in a while. |
ok thanks @iecastro |
Submitting Author Name: Ivan Castro
Due date for @czeildi: 2020-03-19Submitting Author Github Handle: @iecastro
Repository: https://github.com/iecastro/healthdatacsv
Version submitted: 0.0.1
Editor: @karthik
Reviewers: @czeildi, @richfitz
Due date for @richfitz: 2020-08-25
Archive: TBD
Version accepted: TBD
Scope
Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):
Explain how and why the package falls under these categories (briefly, 1-2 sentences):
healthdatacsv
falls under data retrieval because it accesses and ingests data from an online source. This package focuses on health-related data made available from the HealthData.gov API.Who is the target audience and what are scientific applications of this package?
Target audience would be researchers, instructors, students, and analysts in health-related fields (i.e. Epidemiology, Health Economics, Health Services Research). The data accessed are collected by agencies of the US Dept. of Health and Human Services, and are often nationally representative. Scientific applications include, among others, the investigation of: disease patterns, trends in health behaviors, or deficits in health systems.
Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?
There is rHealthDataGov; however, this package has not been updated and does not function properly with the current HealthData.gov API.
If you made a pre-submission inquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.
presubmission: healthdatacsv #350
Technical checks
Confirm each of the following by checking the box. This package:
Publication options
JOSS Options
paper.md
matching JOSS's requirements with a high-level description in the package root or ininst/
.MEE Options
Code of conduct
The text was updated successfully, but these errors were encountered: