Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REVIEW]: Sentiment Analysis of Twitter Data (SAoTD) #764

Closed
16 of 19 tasks
whedon opened this issue Jun 4, 2018 · 58 comments
Closed
16 of 19 tasks

[REVIEW]: Sentiment Analysis of Twitter Data (SAoTD) #764

whedon opened this issue Jun 4, 2018 · 58 comments
Labels
accepted published Papers published in JOSS recommend-accept Papers recommended for acceptance in JOSS. review

Comments

@whedon
Copy link

whedon commented Jun 4, 2018

Submitting author: @evan-l-munson (Evan Munson)
Repository: https://github.com/evan-l-munson/SAoTD
Version: 0.2.0
Editor: @arfon
Reviewer: @kbenoit
Archive: 10.5281/zenodo.2578973

Status

status

Status badge code:

HTML: <a href="http://joss.theoj.org/papers/e6002792b44f50039afc22dbe3d4a086"><img src="http://joss.theoj.org/papers/e6002792b44f50039afc22dbe3d4a086/status.svg"></a>
Markdown: [![status](http://joss.theoj.org/papers/e6002792b44f50039afc22dbe3d4a086/status.svg)](http://joss.theoj.org/papers/e6002792b44f50039afc22dbe3d4a086)

Reviewers and authors:

Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)

Reviewer instructions & questions

@kbenoit, please carry out your review in this issue by updating the checklist below. If you cannot edit the checklist please:

  1. Make sure you're logged in to your GitHub account
  2. Be sure to accept the invite at this URL: https://github.com/openjournals/joss-reviews/invitations

The reviewer guidelines are available here: https://joss.theoj.org/about#reviewer_guidelines. Any questions/concerns please let @leeper know.

Review checklist for @kbenoit

Conflict of interest

Code of Conduct

General checks

  • Repository: Is the source code for this software available at the repository url?
  • License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
    No, but many R packages using standard licenses only name the license in the DESCRIPTION file. The README.md refers to "All code is licensed GL", this should be "GPL-x".
  • Version: 0.2.0
    No: GitHub release is 6 commits behind the master. Master has v1.0.0.
  • Authorship: Has the submitting author (@evan-l-munson) made major contributions to the software?
    Yes, apparently all.
  • Does the full list of paper authors seem appropriate and complete?
    More than complete, as the 2nd through last author have an unknown contribution to the software. They have not made any GitHub commits, and there is no record of them having authored any of the functions (for instance via @author).

Functionality

  • Installation: Does installation proceed as outlined in the documentation?
  • Functionality: Have the functional claims of the software been confirmed?
  • Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

  • A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
  • Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
  • Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
  • Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
  • Automated tests: Are there automated tests or manual steps described so that the function of the software can be verified?
  • Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

  • Authors: Does the paper.md file include a list of authors with their affiliations?
  • A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
    No, the paper focuses more on what the package does than providing an explicit statement of need. This could be made more clear (and at little cost or effort).
  • References: Do all archival references that should have a DOI list one (e.g., papers, datasets, software)?
@whedon
Copy link
Author

whedon commented Jun 4, 2018

Hello human, I'm @whedon. I'm here to help you with some common editorial tasks. @kbenoit it looks like you're currently assigned as the reviewer for this paper 🎉.

⭐ Important ⭐

If you haven't already, you should seriously consider unsubscribing from GitHub notifications for this (https://github.com/openjournals/joss-reviews) repository. As a reviewer, you're probably currently watching this repository which means for GitHub's default behaviour you will receive notifications (emails) for all reviews 😿

To fix this do the following two things:

  1. Set yourself as 'Not watching' https://github.com/openjournals/joss-reviews:

watching

  1. You may also like to change your default settings for this watching repositories in your GitHub profile here: https://github.com/settings/notifications

notifications

For a list of things I can do to help you, just type:

@whedon commands

@whedon
Copy link
Author

whedon commented Jun 4, 2018

Attempting PDF compilation. Reticulating splines etc...

@whedon
Copy link
Author

whedon commented Jun 4, 2018

@kbenoit
Copy link

kbenoit commented Jun 7, 2018

Review

[Checklist moved above]

Comments

On the paper

This paper describes an R package that provides a workflow for analyzing sentiment and topics in Twitter text, wrapping around packages such as twitteR, tidytext, and topicmodels. The package contains a number of useful analytic functions for looking at Twitter data, and these are clearly demonstrated in the vignette. Some of these could be useful to other forms of text, but the package is specially designed to work with Twitter data, including not just the import of this data but also working with Twitter-specific handles such as hashtags and usernames

This paper makes a nice, short article that should be published, but could be improved by addressing a few relatively minor issues:

  • A few sentences at the outset describing the ideal use of the package and how it fits into other text analysis workflows using R would be welcome.
  • The package presumes a knowledge of the tidyverse approach to R, which is fine, but the paper (and package documentation) could mention this more explicitly.
  • Should the lexicon used for sentiment analysis be called the "Bing lexicon" or the Hu and Liu (2004) sentiment dictionary?
  • "Twitter" and "Tweet" should be capitalized in the text.

On the package

The package works, and I have seen much worse source code in widely-used R packages published on CRAN. These are some suggestions for improving the code and the package, not necessarily linked to the paper and whether it should be published. (I leave to the editor to decide.)

Naming. This is a matter of preference, although there are some emerging guidelines designed to reduce the chaos in the R world. This paper combines capitalized object names with lower-cased object names, and function names with . (e.g. e.g. PosNeg.Words()), which is generally discouraged due to the indeterminacy with the S3 dispatch system. Do the functions really need to be capitalized? Also naming is not consistent: Word.Corr() is "object.verb" but Number.Topics is verb.object.

The package name itself runs contrary to this advice from Hadley Wickham:

Avoid using both upper and lower case letters: doing so makes the package name hard to type and even harder to remember.

Unnecessary C++ code. Why is there a function rcpp_hello_world()? This looks like demonstration code that should be removed.

Data copyright issues. Can distribute the data in raw_tweets. That material is copyright of the authors of the Tweets. At the least, it may require some attribution. It would be worth reviewing the Twitter terms of service about this.

Data object loading. Set LazyLoad: true in DESCRIPTION else code like the (not run) examples for Scores() will not work.

(non-)Object orientation. None of the functions use generics and method dispatch, but rather check the class of the input objects using conditionals within each function (e.g. here. This makes extending the package harder, in addition to being more error-prone. The function names are very generic, furthermore, such as BoxPlot(), or Tidy(). Other packages have functions named tidy(), but they are defined for specific object classes. I suggest using more distinctive names to differentiate this package's functions from those found in other packages, and/or method dispatch for specific object classes. Simplyu capitalizing the function names is likely to confuse some users.

Code organization. Nearly all functions are in a single long .R file called Function.R. Splitting this into smaller files would make the code organization clearer.

Examples. Most examples are not run, due to the difficulties of connecting to the Twitter API using authentication. But this is not true for an functions that only use raw_tweets, such as Bigram(), Bigram.Network(), BoxPlot(), etc. Furthermore, this code does not run as written, because raw_tweets is not lazy loaded. In addition, there is no need to load the package in the examples (using library(SAoTD)) because the help functions should be only accessible if the package is already loaded.

Tests. The file tests/testthat/test_Acquire.R contains Twitter authentication keys. These should be removed (and changed, since they will remain visible in the git history).

@leeper
Copy link
Member

leeper commented Jun 7, 2018

Excellent review, @kbenoit! Thank so much!

@evan-l-munson Can you address the issues raised in the review - particularly the missing checked items from the review checklist and the other useful suggestions raised in the review?

@evan-l-munson
Copy link

evan-l-munson commented Jun 7, 2018 via email

@leeper
Copy link
Member

leeper commented Aug 2, 2018

@evan-l-munson Just a nudge on this.

@leeper
Copy link
Member

leeper commented Aug 26, 2018

@evan-l-munson Just another nudge on this.

@evan-l-munson
Copy link

evan-l-munson commented Aug 30, 2018 via email

@arfon
Copy link
Member

arfon commented Nov 2, 2018

Hi @evan-l-munson - please try and get to these updates when you get a chance.

@evan-l-munson
Copy link

I have made about half the corrections suggested above. In the next week, I hope to rename my functions to better fit with standard naming conventions, additionally, I am looking at the test dataset for the copyright issues and a couple of the other issues.

@evan-l-munson
Copy link

Good evening @arfon, I think I made all the requested corrections. Please let me know if you need anything else corrected/adjusted. Thanks for the patience and assistance in this process.

@arfon
Copy link
Member

arfon commented Nov 19, 2018

👋 @kbenoit - please come and take another look at this submission when you get a chance, the author has made some updates based on your feedback.

@arfon
Copy link
Member

arfon commented Nov 19, 2018

@whedon assign @arfon as editor

@kbenoit
Copy link

kbenoit commented Nov 19, 2018

@arfon Happy to do so.

@arfon
Copy link
Member

arfon commented Dec 10, 2018

Hi @kbenoit - have you had a chance to take another look at this submission?

@kbenoit
Copy link

kbenoit commented Dec 10, 2018

I've had a chance to look at the package again, and I am pleased to report that it and the paper are much improved. @evan-l-munson has done a very good job of addressing my concerns above, if not as good a job of summarizing in a PR or memo what these changes were 😉 .

Package. The code organization is much better (and will be easier to maintain or for potential contributors to absorb). I also like the function naming much better than before - the function index looks more tidy and sensible now.

There is still unnecessary C++ code in /src, which creates the function rcpp_hello_world() in the package index. This should simply be removed.

Paper. The paper does a much better job now of explaining the package and its purpose, and the vignette rounds this out nicely.

Subject to the Hello World change, 👍.

@arfon
Copy link
Member

arfon commented Dec 10, 2018

Thanks @kbenoit. @evan-l-munson - please make these final changes to your package and we can move forward accepting this submission.

@evan-l-munson
Copy link

Gentlemen, I missed your email by accident. I appreciate these comments and will try to get them corrected/accounted for this weekend or sometime during the holidays. Thank you!

@evan-l-munson
Copy link

Gentlemen, good morning. I have corrected rcpp_hello_world() function. I was running through everything to make sure it was working properly before I gave you the final word. Everything was looking good until I tried to view the vignette. For some reason, the vignette is not found when using utils::vignette("saotd"). I will troubleshoot that and hopefully get that working. Thanks!

@arfon
Copy link
Member

arfon commented Jan 2, 2019

Thanks for the update @evan-l-munson

@labarba
Copy link
Member

labarba commented Jan 20, 2019

👋 @evan-l-munson — How is it going? Have you been able to troubleshoot the issue? Give us an update when you can. Thanks!

@evan-l-munson
Copy link

Good morning, I appreciate your patience with me. Finishing up this package is ending up being more challenging and time-consuming than I anticipated. I was working on my vignette issue yesterday and am struggling to fix what I am seeing. Everything seems to be built properly, however, after I re-download and load the package from my Git page the vignettes are not found. I have used both, utils::vignette('saotd') and utils::browseVignettes('saotd') but receive the same error message: No vignettes found by utils::browseVignettes("saotd"). I have also sent the Git repository to a friend, who experienced the same issue. As I look at the R package structure I think I have everything correct (and I have been looking at it for months, trying to correct) but obviously I have something incorrect since they are not being found. If you have some insight as I what might be happening I would appreciate the any thoughts you might have so I can complete this. Thanks!

@arfon
Copy link
Member

arfon commented Feb 10, 2019

If you have some insight as I what might be happening I would appreciate the any thoughts you might have so I can complete this. Thanks!

I'm not sure sorry. Perhaps @kbenoit has some thoughts on this?

@evan-l-munson
Copy link

@arfon I have another colleague looking into the issue for me. I am hoping they will get back to me this week. I have ran out of ideas on my end and am not sure why I can view the vignettes after I re-download the package from GitHub. If the vignette issue isn't a big one I would say the package is ready for submission, but if that is a big issue, I will continue to work on it. Thanks!

@whedon
Copy link
Author

whedon commented Feb 25, 2019

@arfon
Copy link
Member

arfon commented Feb 25, 2019

Hi @evan-l-munson. I've made some slight tweaks to your paper in evan-l-munson/saotd#9 - let me know what you think.

In addition, please make sure you:

  • Update the version number based on the feedback from this review.
  • Add something to your README making it clear how users can contribute (if appropriate).

@evan-l-munson
Copy link

@arfon, Good evening. I have completed your suggested edits/additions and pushed to GitHub. If there is anything else, let me know and I will get working on it asap. Thanks for the help!

@arfon
Copy link
Member

arfon commented Feb 25, 2019

@whedon generate pdf

@whedon
Copy link
Author

whedon commented Feb 25, 2019

Attempting PDF compilation. Reticulating splines etc...

@whedon
Copy link
Author

whedon commented Feb 25, 2019

@arfon
Copy link
Member

arfon commented Feb 25, 2019

@arfon, Good evening. I have completed your suggested edits/additions and pushed to GitHub. If there is anything else, let me know and I will get working on it asap. Thanks for the help!

@evan-l-munson thanks. Please could you clarify what version the package is now at? Given the modifications during the review it would seem appropriate to make a new release and archive in Zenodo.

Once you've done this I'm happy to proceed with accepting this submission.

@evan-l-munson
Copy link

@arfon, I bumped the package to version 0.2.0. I will work to get resubmit to Zendo within the next day or two. Will let you know when I get that done. Thanks!

@evan-l-munson
Copy link

@arfon, I released the 0.2.0 version to zenodo this morning and have updated the DOI badge. Is there anything else you need? Thanks!

@arfon
Copy link
Member

arfon commented Feb 27, 2019

@whedon set 10.5281/zenodo.2578973 as archive

@whedon
Copy link
Author

whedon commented Feb 27, 2019

OK. 10.5281/zenodo.2578973 is the archive.

@arfon
Copy link
Member

arfon commented Feb 27, 2019

@whedon set 0.2.0 as version

@whedon
Copy link
Author

whedon commented Feb 27, 2019

OK. 0.2.0 is the version.

@arfon
Copy link
Member

arfon commented Feb 27, 2019

@whedon accept

@whedon
Copy link
Author

whedon commented Feb 27, 2019

Attempting dry run of processing paper acceptance...

@whedon
Copy link
Author

whedon commented Feb 27, 2019


OK DOIs

- http://doi.org/10.18637/jss.v040.i13 is OK

MISSING DOIs

- None

INVALID DOIs

- None

@whedon
Copy link
Author

whedon commented Feb 27, 2019

Check final proof 👉 openjournals/joss-papers#529

If the paper PDF and Crossref deposit XML look good in openjournals/joss-papers#529, then you can now move forward with accepting the submission by compiling again with the flag deposit=true e.g.

@whedon accept deposit=true

@arfon
Copy link
Member

arfon commented Feb 27, 2019

@whedon accept deposit=true

@whedon
Copy link
Author

whedon commented Feb 27, 2019

Doing it live! Attempting automated processing of paper acceptance...

@whedon
Copy link
Author

whedon commented Feb 27, 2019

🚨🚨🚨 THIS IS NOT A DRILL, YOU HAVE JUST ACCEPTED A PAPER INTO JOSS! 🚨🚨🚨

Here's what you must now do:

  1. Check final PDF and Crossref metadata that was deposited 👉 Creating pull request for 10.21105.joss.00764 joss-papers#530
  2. Wait a couple of minutes to verify that the paper DOI resolves https://doi.org/10.21105/joss.00764
  3. If everything looks good, then close this review issue.
  4. Party like you just published a paper! 🎉🌈🦄💃👻🤘

Any issues? notify your editorial technical team...

@arfon arfon closed this as completed Feb 27, 2019
@whedon
Copy link
Author

whedon commented Feb 27, 2019

🎉🎉🎉 Congratulations on your paper acceptance! 🎉🎉🎉

If you would like to include a link to your paper from your README use the following code snippets:

Markdown:
[![DOI](http://joss.theoj.org/papers/10.21105/joss.00764/status.svg)](https://doi.org/10.21105/joss.00764)

HTML:
<a style="border-width:0" href="https://doi.org/10.21105/joss.00764">
  <img src="http://joss.theoj.org/papers/10.21105/joss.00764/status.svg" alt="DOI badge" >
</a>

reStructuredText:
.. image:: http://joss.theoj.org/papers/10.21105/joss.00764/status.svg
   :target: https://doi.org/10.21105/joss.00764

This is how it will look in your documentation:

DOI

We need your help!

Journal of Open Source Software is a community-run journal and relies upon volunteer effort. If you'd like to support us please consider doing either one (or both) of the the following:

@arfon
Copy link
Member

arfon commented Feb 27, 2019

@kbenoit - many thanks for your review of this submission ✨

@evan-l-munson - your paper is now accepted into JOSS ⚡🚀💥

@arfon arfon reopened this Feb 27, 2019
@arfon arfon closed this as completed Feb 27, 2019
@whedon
Copy link
Author

whedon commented Feb 27, 2019

🎉🎉🎉 Congratulations on your paper acceptance! 🎉🎉🎉

If you would like to include a link to your paper from your README use the following code snippets:

Markdown:
[![DOI](http://joss.theoj.org/papers/10.21105/joss.00764/status.svg)](https://doi.org/10.21105/joss.00764)

HTML:
<a style="border-width:0" href="https://doi.org/10.21105/joss.00764">
  <img src="http://joss.theoj.org/papers/10.21105/joss.00764/status.svg" alt="DOI badge" >
</a>

reStructuredText:
.. image:: http://joss.theoj.org/papers/10.21105/joss.00764/status.svg
   :target: https://doi.org/10.21105/joss.00764

This is how it will look in your documentation:

DOI

We need your help!

Journal of Open Source Software is a community-run journal and relies upon volunteer effort. If you'd like to support us please consider doing either one (or both) of the the following:

@evan-l-munson
Copy link

@arfon and @kbenoit thank you both so much for your help! Take care!

@whedon whedon added published Papers published in JOSS recommend-accept Papers recommended for acceptance in JOSS. labels Mar 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
accepted published Papers published in JOSS recommend-accept Papers recommended for acceptance in JOSS. review
Projects
None yet
Development

No branches or pull requests

6 participants