Releases · jannisborn/paperscraper

0.2.9 was broken because deps of paperscraper.impact were not shipped via PyPI (installation from source was OK).
Fixed this and expanded tests to discover such cases in future

What's Changed

Hotfix by @jannisborn in #39

Full Changelog: v0.2.9...v0.2.10

Contributors

jannisborn

Assets 2

24 Dec 14:10

jannisborn

v0.2.9

0ff9218

Impact factor integration

Fuzzy search of impact factor from journals

What's Changed

Impact factor by @jannisborn in #37

Full Changelog: v0.2.8...v0.2.9

Contributors

jannisborn

Assets 2

08 Dec 05:24

jannisborn

v0.2.8

db4f0c1

v0.2.8

What's Changed

Graceful handling of connection errors by @jannisborn in #35
chore(deps): bump requests from 2.24.0 to 2.31.0 by @dependabot in #30

New Contributors

@dependabot made their first contribution in #30

Full Changelog: v0.2.7...v0.2.8

Contributors

jannisborn and dependabot

Assets 2

10 May 22:09

jannisborn

v0.2.7

328fb73

v0.2.7

What's Changed

fix: OS agnostic urljoining by @jannisborn in #29

A bugfix for Windows users that prevented from querying the chemrxiv API

Full Changelog: v0.2.6...v0.2.7

Contributors

jannisborn

Assets 2

07 May 22:39

jannisborn

v0.2.6

f1d1d85

0.2.6

What's Changed

Save DOIs from arxiv papers by @jannisborn in #27
--> This also allows to scrape PDFs from arxiv metadata

Full Changelog: v0.2.5...v0.2.6

Contributors

jannisborn

Assets 2

19 Apr 16:30

jannisborn

v0.2.5

04b10c9

v0.2.5

What's Changed

Extract records from biorxiv and medrxiv based on start date and end date by @achouhan93 in #24
Extract records from chemrxiv based on start date and end date by @achouhan93 and @jannisborn in #25

EXAMPLE

Since v0.2.5 paperscraper also allows to scrape {med/bio/chem}rxiv for specific dates!

medrxiv(begin_date="2023-04-01", end_date="2023-04-08")

But watch out. The resulting .jsonl file will be labelled according to the current date and all your subsequent searches will be based on this file only. If you use this option you might want to keep an eye on the source files (paperscraper/server_dumps/*jsonl) to ensure they contain the paper metadata for all papers you're interested in.

New Contributors

@achouhan93 made their first contribution in #24

Full Changelog: v0.2.4...v0.2.5

Contributors

jannisborn and achouhan93

Assets 2

03 Aug 15:11

jannisborn

v0.2.4

f8ed29c

v0.2.4

v0.2.4 - release summary

Support for scraping PDFs
Harmonize return types of scraper classes to pd.DataFrame rather than List[Dict].

1. Scraping PDFs
v0.2.4 now supports downloading PDFs. The core function is paperscraper.pdf.save_pdf which receives a dictionary with the key doi and downloads the PDF for the desired DOI. There's also a wrapper function paperscraper.pdf.save_pdf_from_dump that can be called with a filepath of a .jsonl file that was previously obtained in the metadata search. This wrapper downloads all PDFs from the metadata search. Examples are given in the README.

Thanks to @daenuprobst for suggestions!

2.Return types
With this version, it is ensured that all scraper classes return the results in a pandas dataframe (one paper per row) as opposed to a list of dictionaries (one paper per dict).

Full Changelog: v0.2.3...v0.2.4

Contributors

daenuprobst

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

EXAMPLE

New Contributors

Contributors

Contributors

Releases: jannisborn/paperscraper

v0.2.13

What's Changed

Contributors

v0.2.12

What's Changed

New Contributors

Contributors

v0.2.11

What's Changed

Contributors

Impact factor restoration

What's Changed

Contributors

Impact factor integration

What's Changed

Contributors

v0.2.8

What's Changed

New Contributors

Contributors

v0.2.7

What's Changed

Contributors

0.2.6

What's Changed

Contributors

v0.2.5

What's Changed

EXAMPLE

New Contributors

Contributors

v0.2.4

Contributors