This repository was archived by the owner on Mar 1, 2023. It is now read-only.
1.2
Spider
- Squashed bug that didn't update papers in an unknown collection (038eefa)
- Outdated Crossref results are now not deleted until we verify we have valid data to replace it (0e1a398)
- Author institution now recorded for each preprint, not just most recent (1cb5707)
- Papers automatically updated if they have missing URLs (5a26612), missing dates (ec271cb) or missing authors (7fd60cc).
- Paper abstracts are pulled from a different page location now, which is available more consistently.
- Better retry logic for fetching data from Crossref (f6eb529)
- Changes to accommodate the modified article metrics format on the bioRxiv website, which now includes download statistics for the "full-text HTML" in addition to the other metrics. (9d0077f)
- Simplified
get_publication_datesfunction (02fdbaf) - Squashed retry bug in fetching article stats (bf0dfb1)
API
- Added endpoint at
/v1/data/statsfor stats reflecting data quality (c4940b6)