This repository was archived by the owner on Mar 1, 2023. It is now read-only.
0.8
Pre-release
Pre-release
- Added more code comments and tidied up function definitions
- General refactoring
Web crawler
- The old "Author" entities, and all references to them, are gone. Only the "DetailedAuthor" class and its attendant methods are present now, and are now called just
Author. - Author institutions are updated every time we find a new paper of theirs, so we'll always have the most recent entry.
- Cutting off semicolons that keep showing up at the end of author institution names in the bioRxiv HTML
- More error handling, better recovery from botched HTTP calls
API
- Added redirects for old author IDs. Lots and lots of authors were indexed by google, and the switch from
AuthortoDetailedAuthorobjects changed all the IDs. Now we have a translation to add 301 responses for the old ones to the (most likely) new one. - Changed URLs to not have
/api/in them anymore, since that will be in our hostname going forward. - Negative page numbers no longer allowed