Skip to content
This repository was archived by the owner on Mar 1, 2023. It is now read-only.

1.2

Choose a tag to compare

@rabdill rabdill released this 23 Jul 05:21
· 44 commits to master since this release

Spider

  • Squashed bug that didn't update papers in an unknown collection (038eefa)
  • Outdated Crossref results are now not deleted until we verify we have valid data to replace it (0e1a398)
  • Author institution now recorded for each preprint, not just most recent (1cb5707)
  • Papers automatically updated if they have missing URLs (5a26612), missing dates (ec271cb) or missing authors (7fd60cc).
  • Paper abstracts are pulled from a different page location now, which is available more consistently.
  • Better retry logic for fetching data from Crossref (f6eb529)
  • Changes to accommodate the modified article metrics format on the bioRxiv website, which now includes download statistics for the "full-text HTML" in addition to the other metrics. (9d0077f)
  • Simplified get_publication_dates function (02fdbaf)
  • Squashed retry bug in fetching article stats (bf0dfb1)

API

  • Added endpoint at /v1/data/stats for stats reflecting data quality (c4940b6)