No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.rubocop.yml
2018-10-20.qs
Gemfile
Gemfile.lock
README.md
scraper.rb

README.md

This scraper queries Wikidata for articles on the ScienceSource focus list with a PubMed Central ID, but where the license is unknown.

It then looks up the article on the PubMed website to see if there's a Creative Commons license mentioned on the article page, and if so generates QuickStatements-compatible commands for adding that license information to Wikidata.

History

This was originally written as part of the ContentMine/Cambridge Wikidata Workshop in October 2018.

Possible Expansions

Rather than ignoring any non-CC licenses, it would be useful to set something useful about the license onto the Wikidata item, and/or remove it from the focus list.