Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Legal question in regard to copyrighted reviews. #1

Open
jcrodriguez1989 opened this issue Jan 8, 2021 · 2 comments
Open

Legal question in regard to copyrighted reviews. #1

jcrodriguez1989 opened this issue Jan 8, 2021 · 2 comments

Comments

@jcrodriguez1989
Copy link

Dear authors,
thanks for this great script, and for the accompanying publication.
I have a related question. I have web-scraped (through a public, non-documented API) a huge number of individual reviews in the structure of review = (customer, product, rating, review_text). So, these result in short texts (no books, chapters, etc.), what are your thoughts about the legality of transforming individual reviews to bag of words?

thanks,
Dr. Rodriguez, Juan Cruz.

@hawc2
Copy link
Member

hawc2 commented Jan 8, 2021

Hey @jcrodriguez1989, it's legal to transform copyrighted works for non-consumptive research purposes - this is well within fair use. You might like to read Matthew Sag's article, "The New Legal Landscape for Text Mining and Machine Learning.

If you're taking copyrighted reviews and converting them to disaggregated (disordered) bag of words datasets using this script, you can share the bags of words with anyone.

Depending on where you're getting your reviews, they may not even be under copyright restrictions - I would check with the platform/publisher hosting them for their policies on user privacy and what copyright they retain over reviews

@jcrodriguez1989
Copy link
Author

Hi @hawc2 ,
thank you very much for your fast reply, and for the provided article.
First, I will try to get the data owner's permission, but I am not even able to get a contact.
I will let you know how it goes.
thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants