Database of citation relationships, sourced from CrossRef, web scraping, etc.
php crossref-to-db.php takes DOI and gets citation data from CrossRef
php pensoft-to-db.php takes local Pensoft JATS XML download, extracts citations and tries to enhance
php match-to-db.php does a SQL query to get a list of articles and tries to match them to the local database (and optionally to CrossRef)
Extract references from PDF
php pdf-extract.php
Match cited references to Wikidata items
php wikidata-match.php
Generate quick statements for Wikidata
php wikidata-cites-quickstatments.php
Wikidata example: http://opencitations.net/oci/01027931310-01022252312
cites.php calls external services to get data and dump to SQL.
Need to fetch HTMl for reference page, then parse.
Has citation data in Google Scholar tags.
Extract from web page.
Need to fetch HTMl for reference page, then parse.
Citations may not be linked to DOIs, so we may need to fix this.
match.php matches citations to DOIs in microcitations.publications table (i. E., does SQL query
match-to-db.php uses micro citation web service to match to any GUIDs in microcitations.publications table
Darwininitium – a new fully pseudosigmurethrous orthurethran genus from Nepal (Gastropoda, Pulmonata, Cerastidae) has very few linked references, but many can be linked.
Oryzomys couesi has a very detailed Wikipedia page, how much of this can be captured in Wikidata?